in progress
Long read sequencing data from 3,622 Icelanders identified a median of 22,636 SVs per individual (insertios: 13,353; deletions: 9,474) Beyter and coll, 2021.
Inversions
Within-population
SNP analysis in >1400 seaweed flies identified known and candidate genomic inversions (Mérot and coll, 2021).
SV analysis of 31 diploid assemblies of Theobroma cacao showed relaxed selection and increased genetic load in inversions and other types of variants Hämälä and coll., 2021.
?Stefansson and coll, 2005 found an inversion in the human genome and calculated that it appeared ~3 million years ago, before the speciation of Homo sapiens. Low-copy repeated sequences were found in the vicinity. Donelly and coll., 2010 gave a radically lower time estimate of 13,600 to 108,400 years, but an ancient origin was again supported by Steinberg and coll in 2012.
Between-species
~1100 small-scale inversions are estimated to have happened between S. cerevisiae and C. albicans (Seoighe and coll., 2000). The authors propose that “successive multigene inversions” have “distrupted” the “precise arrangement” of “genes that are adjacent in one species are in the same neighborhood”. A “bias toward small inversions” is reported.
A collection of gene arrangements where each pair is related is related by a single large-scale inversion was found by Dobzhansky and Sturtevant in D. pseudoobscura.
In the Drosophila pseudoobscura / persimilis species complex, chromosomal inversions suppress crossovers not only inside the inversion but also close to it (within 1–2 Mb)” ([[Machado, Haselkorn and Noor (2007)|biblio/17179068])].
The overall rate of breakage in the D. melanogaster / D. yakuba lineage calculated by (?Ranz and coll., 2007) is 0.0183/Mb/Myr. The authors “propose that the presence of inverted duplications associated with inversion breakpoint regions is the result of staggered breaks, either isochromatid or chromatid, and that this, rather than ectopic exchange between inverted repetitive sequences, is the prevalent mechanism for the generation of inversions in the melanogaster species group”.
Flanking an inversion in D. buzzati between two Galileo elements, the exchange of Target-Site Duplication sequences was observed by Cáceres and coll., (1999).
Ectopic recombination of a Galileo element may have caused a recent large-scale inversion in D. buzzati (Delprat and coll, 2009).
Kent and coll., 2003 reported 2 inversions per Mbp in human/mouse comparisons, median length 814.
In bacteria, rearrangements have been reported to generate a X-shaped pattern in dot-plots by Eisen and coll (2000) and Tillier and Collins (2000). Both reports suggest that the patterns are generated by inversions centered on origins and termini. The prevalence of this phenomenon was confirmed by D'Iorio and Dewar, 2022. The state of the art in bacterial genomics in 2000 was reviewed by Hughes.
A mixture of X-shaped dot-plot pattern and scrambling was reported by Zivanovic and coll. (2002) in a study of Pyrococcus genomes. Scrambling in Thermococcales appears to be fast enough that the usual compositional biases on both sides of the origin of replication do not have time to establish (Cossu and coll., 2015).
Mechanism
Staggered breaks (?Ranz and coll., 2007) caused by inverted duplications. The duplicated sequnces may be coding genes or pseudogenes, not just transposable elements.
(Delprat and coll, 2009) proposed that “Chromosomal inversions may be generated by transposons when two ends that are not part of the same transposon participate in an aberrant transposition event to a new site”. They give an example where the two transposons are identical copies on sister chromatids in G2 phase.
Indels
Indel-seq (Min and coll., 2023) shows insertions at double-strand breaks originating from donor regions possibly single-stranded by transcription (R-loops) or repair. Proximity and contact appear to be important.
Translocations
Translocations are frequent in Clarkia (Lewis and Raven, 1958).
Software
- NanoSV (Cretu Stancu and coll., 2017) uses nanopore long reads aligned to a reference genome with last-split
Liu X, Liu W, Lenstra JA, Zheng Z, Wu X, Yang J, Li B, Yang Y, Qiu Q, Liu H, Li K, Liang C, Guo X, Ma X, Abbott RJ, Kang M, Yan P, Liu J.
Nat Commun. 2023 Sep 19;14(1):5617. doi:10.1038/s41467-023-41220-x
Evolutionary origin of genomic structural variations in domestic yaks.
Min J, Zhao J, Zagelbaum J, Lee J, Takahashi S, Cummings P, Schooley A, Dekker J, Gottesman ME, Rabadan R, Gautier J.
Mol Cell. 2023 Jul 20;83(14):2434-2448.e7. doi: 10.1016/j.molcel.2023.06.016
Mechanisms of insertions at a DNA double-strand break.
Zivanovic Y, Lopez P, Philippe H, Forterre P.
Nucleic Acids Res. 2002 May 1;30(9):1902-10. doi:10.1093/nar/30.9.1902
Pyrococcus genome comparison evidences chromosome shuffling-driven evolution.
Matteo Cossu, Violette Da Cunha, Claire Toffano-Nioche, Patrick Forterre, Jacques Oberto
Biochimie. 2015 Nov;118:313-21. doi:10.1016/j.biochi.2015.07.008
Comparative genomics reveals conserved positioning of essential genomic clusters in highly rearranged Thermococcales chromosomes
Hughes D.
Genome Biol. 2000;1(6):REVIEWS0006. doi: 10.1186/gb-2000-1-6-reviews0006.
Evaluating genome dynamics: the constraints on rearrangements within bacterial genomes.
Review reporting that “almost all rearrangements conserve the axis of chromosome replication”.
Tillier ER, Collins RA
Nat Genet. 2000 Oct;26(2):195-7. doi:10.1038/79918
Genome rearrangement by replication-directed translocation.
D'Iorio M, Dewar K.
Life Sci Alliance. 2022 Oct 19;6(1):e202201434. doi:10.26508/lsa.202201434
Replication-associated inversions are the dominant form of bacterial chromosome structural variation.
Eisen JA, Heidelberg JF, White O, Salzberg SL.
Genome Biol. 2000;1(6):RESEARCH0011. doi:10.1186/gb-2000-1-6-research0011
Evidence for symmetric chromosomal inversions around the replication origin in bacteria.
Ranz JM, Maurin D, Chan YS, von Grotthuss M, Hillier LW, Roote J, Ashburner M, Bergman CM.
PLoS Biol. 2007 Jun;5(6):e152. doi:10.1371/journal.pbio.0050152
Principles of genome evolution in the Drosophila melanogaster species group.
Cáceres M, Ranz JM, Barbadilla A, Long M, Ruiz A.
Science. 1999 Jul 16;285(5426):415-8. doi:10.1126/science.285.5426.415
Generation of a widespread Drosophila inversion by a transposable element.
Brooks AN, Hughes AL, Clauder-Münster S, Mitchell LA, Boeke JD, Steinmetz LM.
Science. 2022 Mar 4;375(6584):1000-1005. doi: 10.1126/science.abg0162
Transcriptional neighborhoods regulate transcript isoform lengths and expression levels.
Kent WJ, Baertsch R, Hinrichs A, Miller W, Haussler D.
Proc Natl Acad Sci U S A. 2003 Sep 30;100(20):11484-9. doi:10.1073/pnas.1932072100
Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes.
“A chained alignment [is] an ordered sequence of traditional pairwise nucleotide alignments (“blocks”) separated by larger gaps, some of which may be simultaneous gaps in both species. [...] intervening DNA in one species that does not align with the other because it is locally inverted or has been inserted in by lineage-specific translocation or duplication is skipped”
“The chains are then put into a list sorted with the highest-scoring chain first. [...] each iteration taking the next chain off of the list, throwing out the parts of the chain that intersect with bases already covered by previously taken chains, and then marking the bases that are left in the chain as covered. [...] If a chain covers bases that are in a gap in a previously taken chain, it is marked as a child of the previous chain. In this way, a hierarchy of chains is formed that we call a net.”
“To be considered syntenic, a chain has to either have a very high score itself or be embedded in a larger chain, on the same chromosome, and come from the same region as the larger chain. Thus, inversions and tandem duplications are considered syntenic.”
“We define the (human) span of a chain to be the distance in bases in the human genome from the first to the last human base in the chain, including gaps, and we define the size of the chain as the number of aligning bases in it, not including gaps.”
Delprat A, Negre B, Puig M, Ruiz A.
PLoS One. 2009 Nov 18;4(11):e7883. doi:10.1371/journal.pone.0007883
The transposon Galileo generates natural chromosomal inversions in Drosophila by ectopic recombination.
Steinberg KM, Antonacci F, Sudmant PH, Kidd JM, Campbell CD, Vives L, Malig M, Scheinfeldt L, Beggs W, Ibrahim M, Lema G, Nyambo TB, Omar SA, Bodo JM, Froment A, Donnelly MP, Kidd KK, Tishkoff SA, Eichler EE.
Nat Genet. 2012 Jul 1;44(8):872-80. doi:10.1038/ng.2335
Structural diversity and African origin of the 17q21.31 inversion polymorphism.
Donnelly MP, Paschou P, Grigorenko E, Gurwitz D, Mehdi SQ, Kajuna SL, Barta C, Kungulilo S, Karoma NJ, Lu RB, Zhukova OV, Kim JJ, Comas D, Siniscalco M, New M, Li P, Li H, Manolopoulos VG, Speed WC, Rajeevan H, Pakstis AJ, Kidd JR, Kidd KK.
Am J Hum Genet. 2010 Feb 12;86(2):161-71. doi:10.1016/j.ajhg.2010.01.007
The distribution and most recent common ancestor of the 17q21 inversion in humans.
Stefansson H, Helgason A, Thorleifsson G, Steinthorsdottir V, Masson G, Barnard J, Baker A, Jonasdottir A, Ingason A, Gudnadottir VG, Desnica N, Hicks A, Gylfason A, Gudbjartsson DF, Jonsdottir GM, Sainz J, Agnarsson K, Birgisdottir B, Ghosh S, Olafsdottir A, Cazier JB, Kristjansson K, Frigge ML, Thorgeirsson TE, Gulcher JR, Kong A, Stefansson K.
Nat Genet. 2005 Feb;37(2):129-37. doi:10.1038/ng1508
A common inversion under selection in Europeans.
Machado CA, Haselkorn TS, Noor MA.
Genetics. 2007 Mar;175(3):1289-306. doi:10.1534/genetics.106.064758
Evaluation of the genomic extent of effects of fixed inversion differences on intraspecific variation and interspecific gene flow in Drosophila pseudoobscura and D. persimilis.
Dobzhansky T, Sturtevant AH
Genetics. 1938 Jan;23(1):28-64. doi:10.1093/genetics/23.1.28
Inversions in the Chromosomes of Drosophila Pseudoobscura.
Proc Natl Acad Sci U S A. 2000 Dec 19;97(26):14433-7. doi:10.1073/pnas.240462997
Seoighe C, Federspiel N, Jones T, Hansen N, Bivolarovic V, Surzycki R, Tamse R, Komp C, Huizar L, Davis RW, Scherer S, Tait E, Shaw DJ, Harris D, Murphy L, Oliver K, Taylor K, Rajandream MA, Barrell BG, Wolfe KH.
Prevalence of small inversions in yeast gene order evolution
Hämälä T, Wafula EK, Guiltinan MJ, Ralph PE, dePamphilis CW, Tiffin P.
Proc Natl Acad Sci U S A. 2021 Aug 31;118(35):e2102914118. doi:10.1073/pnas.2102914118
Genomic structural variants constrain and facilitate adaptation in natural populations of Theobroma cacao, the chocolate tree.
Mérot C, Berdan E, Cayuela H, Djambazian H, Ferchaud AL, Laporte M, Normandeau E, Ragoussis J, Wellenreuther M, Bernatchez L.
Mol Biol Evol. 2021 May 7:msab143. doi:10.1093/molbev/msab143
Locally adaptive inversions modulate genetic variation at different geographic scales in a seaweed fly.
“We sampled the seaweed fly Coelopa frigida along a bioclimatic gradient stretching across 10° of latitude, a salinity gradient and a range of heterogeneous, patchy habitats. We generated a chromosome-level genome assembly to analyse 1,446 low-coverage whole genomes collected along those gradients.“ “Analyses of more than 1,400 whole genomes of C. frigida flies revealed four large chromosomal inversions affecting a large fraction of the genome (36.1Mb, 15%), and three low recombining genomic regions.” “Among the 124,701 candidate SNPs identified by the GWAS, more than 99.8% were located in Cf-Inv(1)” “At a finer geographic scale, outlier SNPs associated with wrackbed abiotic characteristics (depth, temperature and salinity) were strongly enriched in the inverted region Cf-Inv(1) with an odds ratio of 5, including outliers with very strong support”
Satou Y, Sato A, Yasuo H, Mihirogi Y, Bishop J, Fujie M, Kawamitsu M, Hisata K, Satoh N.
Genome Biol Evol. 2021 Jun 8;13(6):evab068. doi:10.1093/gbe/evab068
Chromosomal Inversion Polymorphisms in Two Sympatric Ascidian Lineages.
“Chromosomal-level assemblies for the two type-B individuals, sampled at Roscoff (France; specimen R) and Plymouth (England; specimen P)” “For each specimen, we built contigs from long Nanopore reads with the NECAT assembler and polished obtained contigs with Illumina reads using the Nextpolish software [and removed overlaping contigs] using the purge_dups software [...]. Both of these contig sets were 140 Mb in length (table 1).” “Although the heterozygosity rate of type-A animals has been estimated to be 1.1–1.2% (Dehal et al. 2002; Satou et al. 2012), those of specimens R and P were 3.0% and 3.6%, respectively.”
“Exons are highly conserved between type-A and type-B animals, whereas intergenic regions are highly varied between the two types.” [Supplemental material shows 97% similarity in aligned regions between R and P and 94% similarity between R/P and A]
“To identify translocations, we searched the genomic sequences for blocks containing three or more genes that were not found in the positions expected from the type-A genome. We found two small interchromosomal translocations between the genomes of specimen P and type A, [but] we did not find such translocations between the genomes of specimen R and type A.”
“Similarly, to identify inversions, we searched the genomic sequences for blocks containing three or more genes that were mapped in the reverse direction to the order in the type-A genome. We found 21 and 20 inversions in the genomes of specimens R and P, respectively (supplementary fig. S7, Supplementary Material online). Among them, 15 sites were common, and the remaining 11 sites were specific to specimen R or P (supplementary fig. S10 and tables S3 and S4, Supplementary Material online). Thus, our data indicate that there are structural variations not only between type-A and type-B animals but also between specimens R and P.”
“We found R-derived haplotype contigs that were structurally different from the chromosome of specimen R, but the same as both the specimen-P and the type-A chromosomes. [In “an inversion in chromosome 7 [that] contained the largest number of genes over an ∼900-kb region”] genomic PCR using four primers flanking the junctional sites demonstrated that specimen R was indeed heterozygous” “We also confirmed experimentally an inversion on chromosome 3 [...]. A PCR analysis demonstrated that specimen P was indeed heterozygous in this region.” “we manually inspected genomic alignments of the two type-B specimens against the type-A genome, and found nine additional inversions.” “To understand how prevalent the inversions we found were in type-B populations, we collected eight wild specimens at Roscoff and Plymouth, RO1–RO4 and PL1–PL4, respectively, and performed genomic PCR [that demonstrated that] type-B animals indeed have inversion polymorphisms.” “The inversions we found in the present study are relatively small (3 kb ∼ 873 kb)” “our data highlight the likely importance of inversions in speciation of Ciona, an invertebrate with mating governed by interactions of aquatic gametes”.
Beyter D, Ingimundardottir H, Oddsson A, Eggertsson HP, Bjornsson E, Jonsson H, Atlason BA, Kristmundsdottir S, Mehringer S, Hardarson MT, Gudjonsson SA, Magnusdottir DN, Jonasdottir A, Jonasdottir A, Kristjansson RP, Sverrisson ST, Holley G, Palsson G, Stefansson OA, Eyjolfsson G, Olafsson I, Sigurdardottir O, Torfason B, Masson G, Helgason A, Thorsteinsdottir U, Holm H, Gudbjartsson DF, Sulem P, Magnusson OT, Halldorsson BV, Stefansson K.
Nat Genet. 2021 Jun;53(6):779-786. doi:10.1038/s41588-021-00865-4
Long-read sequencing of 3,622 Icelanders provides insight into the role of structural variants in human diseases and other traits.
Wang J, Wurm Y, Nipitwattanaphon M, Riba-Grognuz O, Huang YC, Shoemaker D, Keller L.
Nature. 2013 Jan 31;493(7434):664-8. doi:10.1038/nature11832
A Y-like social chromosome causes alternative colony organization in fire ants.
Sun J, Chen C, Miyamoto N, Li R, Sigwart JD, Xu T, Sun Y, Wong WC, Ip JCH, Zhang W, Lan Y, Bissessur D, Watsuji TO, Watanabe HK, Takaki Y, Ikeo K, Fujii N, Yoshitake K, Qiu JW, Takai K, Qian PY.
Nat Commun. 2020 Apr 8;11(1):1657. doi: 10.1038/s41467-020-15522-3.
The Scaly-foot Snail genome and implications for the origins of biomineralised armour.
Schaeffer SW
Genetics. 2018 Sep;210(1):3-13. doi:10.1534/genetics.118.301084
Muller "Elements" in Drosophila: How the Search for the Genetic Basis for Speciation Led to the Birth of Comparative Genomics.
Proc Natl Acad Sci U S A. 1917 Sep;3(9):555-8 doi:10.1073/pnas.3.9.555
Sturtevant AH
Genetic Factors Affecting the Strength of Linkage in Drosophila.
Catacchio CR, Maggiolini FAM, D'Addabbo P, Bitonto M, Capozzi O, Lepore Signorile M, Miroballo M, Archidiacono N, Eichler EE, Ventura M, Antonacci F.
Genome Res. 2018 Jun;28(6):910-920. doi:10.1101/gr.234831.118
Inversion variants in human and primate genomes.
Pettersson ME, Rochus CM, Han F, Chen J, Hill J, Wallerman O, Fan G, Hong X, Xu Q, Zhang H, Liu S, Liu X, Haggerty L, Hunt T, Martin FJ, Flicek P, Bunikis I, Folkvord A, Andersson L.
Genome Res. 2019 Nov;29(11):1919-1928. doi:10.1101/gr.253435.119
A chromosome-level assembly of the Atlantic herring genome-detection of a supergene and other signals of selection.
Puddu F, Herzog M, Selivanova A, Wang S, Zhu J, Klein-Lavi S, Gordon M, Meirman R, Millan-Zambrano G, Ayestaran I, Salguero I, Sharan R, Li R, Kupiec M, Jackson SP.
Nature. 2019 Sep;573(7774):416-420. doi:10.1038/s41586-019-1549-9
Genome architecture and stability in the Saccharomyces cerevisiae knockout collection.
Loss of short chromosomes and gains of long chromosomes, changes in rDNA and telomere repeats, ...
Linardopoulou EV, Williams EM, Fan Y, Friedman C, Young JM, Trask BJ.
Nature. 2005 Sep 1;437(7055):94-100 doi:10.1038/nature04029
Human subtelomeres are hot spots of interchromosomal recombination and segmental duplication.
Yue JX, Li J, Aigrain L, Hallin J, Persson K, Oliver K, Bergström A, Coupland P, Warringer J, Lagomarsino MC, Fischer G, Durbin R, Liti G.
Nat Genet. 2017 Jun;49(6):913-924. doi:10.1038/ng.3847
Contrasting evolutionary genome dynamics between domesticated and wild yeasts.
Yeast subtelomeres accumulate structural variants.
“For each subtelomere, we located its proximal boundary on the basis of the sudden loss of synteny conservation and demarcated its distal boundary by the telomere-associated core X and Y′ elements”
“All previously defined essential genes in S. cerevisiae S288C28 fell into the chromosomal cores, whereas all previously described subtelomeric duplication blocks in S288C were fully enclosed in our defined S288C subtelomeres. Furthermore, the genes from our defined subtelomeres showed 36.6-fold higher CNV accumulation than those from the cores”
“subtelomeric one-to-one orthologs also showed significantly higher nonsynonymous-to-synonymous substitution rate ratio (dN/dS) than those from the cores in the S. cerevisiae–S. cerevisiae and S. cerevisiae–S. paradoxus comparisons”
“rapid evolution of subtelomeres can substantially alter the gene repertoire and generate novel recombinants with adaptive potential”
Ghavi-Helm Y, Jankowski A, Meiers S, Viales RR, Korbel JO, Furlong EEM.
Nat Genet. 2019 Aug;51(8):1272-1282. doi:10.1038/s41588-019-0462-3
Highly rearranged chromosomes reveal uncoupling between genome topology and gene expression.
Long E, Evans C, Chaston J, Udall JA.
G3 (Bethesda). 2018 Oct 3;8(10):3247-3253. doi:10.1534/g3.118.200631
Genomic Structural Variations Within Five Continental Populations of Drosophila melanogaster.
Compares breakpoint position with a +-3 bp tolerance. Uses Assemblytics.
Cretu Stancu M, van Roosmalen MJ, Renkens I, Nieboer MM, Middelkamp S, de Ligt J, Pregno G, Giachino D, Mandrile G, Espejo Valle-Inclan J, Korzelius J, de Bruijn E, Cuppen E, Talkowski ME, Marschall T, de Ridder J, Kloosterman WP.
Nat Commun. 2017 Nov 6;8(1):1326. doi:10.1038/s41467-017-01343-4
Mapping and phasing of structural variation in patient genomes using nanopore sequencing.