Ira M. Hall
- Assistant Professor, Biochemistry and Molecular Genetics
Bioinformatics; Genome Variation; Genomic Instability; Cancer Genome Evolution
Research in my laboratory is focused on understanding the mechanistic origins and functional consequences of genome variation in mammals. We are primarily interested in structural variation (SV), which includes duplications, deletions, inversions, insertions, translocations and other forms of genomic rearrangement. These large scale genetic differences comprise a significant fraction of inherited human genetic variation, are increasingly found to underlie sporadic disease, and are a driving force in the evolution of cancer genomes. Somatically-acquired structural variants may also contribute to phenotypes that emerge during development and aging. Our understanding of structural variation has been, and remains to be, limited by technology. Due to the profound experimental and computational challenges associated with performing unbiased genome-wide experiments, many fundamental questions regarding the origin and impact of structural variation remain unanswered.
We are currently pursuing three general lines of investigation. First, we are developing methods to map and interpret genome variation from next-generation sequence data. These include algorithms to map structural variation by paired-end mapping, split-read mapping and depth of coverage analysis, as well as a suite of software tools for comparing raw sequence data, annotations and variant calls. We are also developing computational methods for optimal split-read alignment, haplotype-aware local assembly, and breakpoint visualization. Finally, we are working on experimental methods to map SV in small populations of cells, which will allow for a direct assessment of genetic diversity in somatic lineages.
Second, we are using these methods to tackle basic questions regarding structural variation in "normal" germline and somatic genomes. While our initial work on these topics has used the laboratory mouse as model, the recent explosion of human genome sequence data has allowed us to transition most of our work to human. Questions that we are interested in include: what are the mechanisms that generate new SV? To what extent are these mechanisms under cellular control, or affected by stress, and thus variable depending on genetic background or environmental conditions? What is the contribution of structurally unstable and/or hyper-variable loci to natural variation? How prevalent is de novo SV in different somatic lineages and single somatic cells, and to what extent can this phenomenon account for traits that emerge during the course of development or aging? Finally, we are studying the process of tumor genome evolution by comparing the patterns and mechanisms of genomic rearrangement in cancer cells relative to those found in normal cells. We are mapping and assembling structural variant breakpoints from hundreds of human whole-genome sequence datasets, comparing the breakpoint properties of germline versus somatically-acquired structural variation, and attempting to understand the mutational forces that shape highly rearranged tumor genomes. It is our hope that these experiments will lead to new insights into tumor genome evolution and perhaps suggest novel genome-based diagnostic tests that correlate with malignancy.
- Faust G, Hall I. SAMBLASTER: fast duplicate marking and structural variant read extraction. Bioinformatics (Oxford, England). 2014;30(17): 2503-5. PMID: 24812344 | PMCID: PMC4147885
- Malhotra A, Lindberg M, Faust G, Leibowitz M, Clark R, Layer R, Quinlan A, Hall I. Breakpoint profiling of 64 cancer genomes reveals numerous complex rearrangements spawned by homology-independent mechanisms. Genome research. 2013;23(5): 762-76. PMID: 23410887 | PMCID: PMC3638133
- Faust G, Hall I. YAHA: fast and flexible long-read alignment with optimal breakpoint detection. Bioinformatics (Oxford, England). 2012;28(19): 2417-24. PMID: 22829624 | PMCID: PMC3463118
- Mell J, Hall I, Redfield R. Defining the DNA uptake specificity of naturally competent Haemophilus influenzae cells. Nucleic acids research. 2012;40(17): 8536-49. PMID: 22753031 | PMCID: PMC3458573
- Keene K, Quinlan A, Hou X, Hall I, Mychaleckyj J, Onengut-Gumuscu S, Concannon P. Evidence for two independent associations with type 1 diabetes at the 12q13 locus. Genes and immunity. 2011;13(1): 66-70. PMID: 21850031 | PMCID: PMC3285513
- Quinlan A, Boland M, Leibowitz M, Shumilina S, Pehrson S, Baldwin K, Hall I. Genome sequencing of mouse induced pluripotent stem cells reveals retroelement stability and infrequent DNA rearrangement during reprogramming. Cell stem cell. 2011;9(4): 366-73. PMID: 21982236 | PMCID: PMC3975295
- Quinlan A, Hall I. Characterizing complex structural variation in germline and somatic genomes. Trends in genetics : TIG. 2011;28(1): 43-53. PMID: 22094265 | PMCID: PMC3249479
- Quinlan A, Clark R, Sokolova S, Leibowitz M, Zhang Y, Hurles M, Mell J, Hall I. Genome-wide mapping and assembly of structural variant breakpoints in the mouse genome. Genome research. 2010;20(5): 623-35. PMID: 20308636 | PMCID: PMC2860164
- Quinlan A, Hall I. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics (Oxford, England). 2010;26(6): 841-2. PMID: 20110278 | PMCID: PMC2832824