Analyze Diet
Communications biology2018; 1; 197; doi: 10.1038/s42003-018-0199-z

Improved reference genome for the domestic horse increases assembly contiguity and composition.

Abstract: Recent advances in genomic sequencing technology and computational assembly methods have allowed scientists to improve reference genome assemblies in terms of contiguity and composition. EquCab2, a reference genome for the domestic horse, was released in 2007. Although of equal or better quality compared to other first-generation Sanger assemblies, it had many of the shortcomings common to them. In 2014, the equine genomics research community began a project to improve the reference sequence for the horse, building upon the solid foundation of EquCab2 and incorporating new short-read data, long-read data, and proximity ligation data. Here, we present EquCab3. The count of non-N bases in the incorporated chromosomes is improved from 2.33 Gb in EquCab2 to 2.41 Gb in EquCab3. Contiguity has also been improved nearly 40-fold with a contig N50 of 4.5 Mb and scaffold contiguity enhanced to where all but one of the 32 chromosomes is comprised of a single scaffold.
Publication Date: 2018-11-16 PubMed ID: 30456315PubMed Central: PMC6240028DOI: 10.1038/s42003-018-0199-zGoogle Scholar: Lookup
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
  • Journal Article

Summary

This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.

The research article presents an improved reference genome for domestic horses called EquCab3, which provides a more contiguous and comprehensive genetic map than the previous version, EquCab2.

Background

  • The genomic reference for domestic horses, EquCab2, was introduced in 2007. Despite being of good quality, it had several limitations typically seen in first-generation Sanger assemblies.
  • In 2014, the equine genomics research community initiated a project to enhance the horse’s reference sequence. Their aim was to build on the robust foundation of EquCab2 by integrating new short-read data, long-read data, and proximity ligation data.

Introduction of EquCab3

  • The researchers created EquCab3, an advanced reference genome for domestic horses that provides higher assembly contiguity and composition.
  • The count of non-N bases in the incorporated chromosomes in EquCab3 is 2.41 Gb, up from 2.33 Gb in EquCab2, indicating an increased genetic map coverage.

Improved Contiguity

  • The researchers enhanced the contiguity nearly 40-fold, with a contig N50 of 4.5 Mb. The term “contig N50” is a statistical measure of the average length of a set of sequences (contigs).
  • Scaffold contiguity was also increased to the extent that all but one of the 32 chromosomes is made up of a single scaffold. Scaffolds are sets of overlapping DNA pieces that together represent a consensus region of DNA

Significance of the Study

  • This research will significantly benefit the field of equine genomics by providing a more accurate, complete, and usable genomic reference for the domestic horse.
  • The enhanced genome map can be instrumental for biological research and practical applications ranging from evolutionary studies to equine health management.

Cite This Article

APA
Kalbfleisch TS, Rice ES, DePriest MS, Walenz BP, Hestand MS, Vermeesch JR, O Connell BL, Fiddes IT, Vershinina AO, Saremi NF, Petersen JL, Finno CJ, Bellone RR, McCue ME, Brooks SA, Bailey E, Orlando L, Green RE, Miller DC, Antczak DF, MacLeod JN. (2018). Improved reference genome for the domestic horse increases assembly contiguity and composition. Commun Biol, 1, 197. https://doi.org/10.1038/s42003-018-0199-z

Publication

ISSN: 2399-3642
NlmUniqueID: 101719179
Country: England
Language: English
Volume: 1
Pages: 197
PII: 197

Researcher Affiliations

Kalbfleisch, Theodore S
  • Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, 40292, USA. ted.kalbfleisch@louisville.edu.
Rice, Edward S
  • Department of Biomolecular Engineering, UC Santa Cruz, Santa Cruz, CA, 95064, USA.
DePriest, Michael S
  • Department of Biochemistry and Molecular Genetics, School of Medicine, University of Louisville, Louisville, KY, 40292, USA.
Walenz, Brian P
  • Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
Hestand, Matthew S
  • Center for Human Genetics, Katholieke University Leuven (KU Leuven), 3000, Leuven, Belgium.
Vermeesch, Joris R
  • Center for Human Genetics, Katholieke University Leuven (KU Leuven), 3000, Leuven, Belgium.
O Connell, Brendan L
  • Department of Biomolecular Engineering, UC Santa Cruz, Santa Cruz, CA, 95064, USA.
  • Medical and Molecular Genetics, Oregon Health and Science University, Portland, OR, 97239, USA.
Fiddes, Ian T
  • Department of Biomolecular Engineering, UC Santa Cruz, Santa Cruz, CA, 95064, USA.
  • 10x Genomics, Inc., Pleasanton, CA, 94566, USA.
Vershinina, Alisa O
  • Department of Ecology and Evolutionary Biology, UC Santa Cruz, Santa Cruz, CA, 95064, USA.
Saremi, Nedda F
  • Department of Biomolecular Engineering, UC Santa Cruz, Santa Cruz, CA, 95064, USA.
Petersen, Jessica L
  • Department of Animal Science, University of Nebraska - Lincoln, Lincoln, NE, 68583-0908, USA.
Finno, Carrie J
  • Department of Population Health and Reproduction, University of California, Davis, CA, 95616, USA.
Bellone, Rebecca R
  • Department of Population Health and Reproduction, University of California, Davis, CA, 95616, USA.
  • Veterinary Genetics Laboratory, University of California, Davis, CA, 95616, USA.
McCue, Molly E
  • Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, 55108, USA.
Brooks, Samantha A
  • UF Genetics Institute, Department of Animal Sciences, University of Florida, Gainesville, FL 32611, USA.
Bailey, Ernest
  • Gluck Equine Research Center, Department of Veterinary Science, University of Kentucky, Lexington, KY, 40546, USA.
Orlando, Ludovic
  • Centre for GeoGenetics, Natural History Museum of Denmark, 1350K, Copenhagen, Denmark.
  • Laboratoire d'Anthropobiologie Moléculaire et d'Imagerie de Synthèse UMR 5288, Université de Toulouse, CNRS, Université Paul Sabatier, Toulouse, France.
Green, Richard E
  • Department of Biomolecular Engineering, UC Santa Cruz, Santa Cruz, CA, 95064, USA.
Miller, Donald C
  • Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, 14853, USA.
Antczak, Douglas F
  • Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, 14853, USA.
MacLeod, James N
  • Gluck Equine Research Center, Department of Veterinary Science, University of Kentucky, Lexington, KY, 40546, USA.

Grant Funding

  • K01 OD015134 / NIH HHS
  • L40 TR001136 / NCATS NIH HHS

Conflict of Interest Statement

I.T.F. is an employee of 10× Genomics, Inc. R.E.G. is a co-founder and scientific adviser of Dovetail Genomics, LLC. The other authors declare no competing interests.

References

This article includes 55 references
  1. Outram AK, Stear NA, Bendrey R, Olsen S, Kasparov A, Zaibert V, Thorpe N, Evershed RP. The earliest horse harnessing and milking.. Science 2009 Mar 6;323(5919):1332-5.
    doi: 10.1126/science.1168594pubmed: 19265018google scholar: lookup
  2. Wade CM, Giulotto E, Sigurdsson S, Zoli M, Gnerre S, Imsland F, Lear TL, Adelson DL, Bailey E, Bellone RR, Blöcker H, Distl O, Edgar RC, Garber M, Leeb T, Mauceli E, MacLeod JN, Penedo MC, Raison JM, Sharpe T, Vogel J, Andersson L, Antczak DF, Biagi T, Binns MM, Chowdhary BP, Coleman SJ, Della Valle G, Fryc S, Guérin G, Hasegawa T, Hill EW, Jurka J, Kiialainen A, Lindgren G, Liu J, Magnani E, Mickelson JR, Murray J, Nergadze SG, Onofrio R, Pedroni S, Piras MF, Raudsepp T, Rocchi M, Røed KH, Ryder OA, Searle S, Skow L, Swinburne JE, Syvänen AC, Tozaki T, Valberg SJ, Vaudin M, White JR, Zody MC, Lander ES, Lindblad-Toh K. Genome sequence, comparative analysis, and population genetics of the domestic horse.. Science 2009 Nov 6;326(5954):865-7.
    doi: 10.1126/science.1178158pmc: PMC3785132pubmed: 19892987google scholar: lookup
  3. Coleman SJ, Zeng Z, Wang K, Luo S, Khrebtukova I, Mienaltowski MJ, Schroth GP, Liu J, MacLeod JN. Structural annotation of equine protein-coding genes determined by mRNA sequencing.. Anim Genet 2010 Dec;41 Suppl 2:121-30.
  4. Vanderman KS, Tremblay M, Zhu W, Shimojo M, Mienaltowski MJ, Coleman SJ, MacLeod JN. Brother of CDO (BOC) expression in equine articular cartilage.. Osteoarthritis Cartilage 2011 Apr;19(4):435-8.
    doi: 10.1016/j.joca.2011.01.011pubmed: 21262369google scholar: lookup
  5. Schaefer RJ, Schubert M, Bailey E, Bannasch DL, Barrey E, Bar-Gal GK, Brem G, Brooks SA, Distl O, Fries R, Finno CJ, Gerber V, Haase B, Jagannathan V, Kalbfleisch T, Leeb T, Lindgren G, Lopes MS, Mach N, da Câmara Machado A, MacLeod JN, McCoy A, Metzger J, Penedo C, Polani S, Rieder S, Tammen I, Tetens J, Thaller G, Verini-Supplizi A, Wade CM, Wallner B, Orlando L, Mickelson JR, McCue ME. Developing a 670k genotyping array to tag ~2M SNPs across 24 horse breeds.. BMC Genomics 2017 Jul 27;18(1):565.
    doi: 10.1186/s12864-017-3943-8pmc: PMC5530493pubmed: 28750625google scholar: lookup
  6. Petersen JL, Mickelson JR, Rendahl AK, Valberg SJ, Andersson LS, Axelsson J, Bailey E, Bannasch D, Binns MM, Borges AS, Brama P, da Câmara Machado A, Capomaccio S, Cappelli K, Cothran EG, Distl O, Fox-Clipsham L, Graves KT, Guérin G, Haase B, Hasegawa T, Hemmann K, Hill EW, Leeb T, Lindgren G, Lohi H, Lopes MS, McGivney BA, Mikko S, Orr N, Penedo MC, Piercy RJ, Raekallio M, Rieder S, Røed KH, Swinburne J, Tozaki T, Vaudin M, Wade CM, McCue ME. Genome-wide analysis reveals selection for important traits in domestic horse breeds.. PLoS Genet 2013;9(1):e1003211.
  7. McCue ME, Bannasch DL, Petersen JL, Gurr J, Bailey E, Binns MM, Distl O, Guérin G, Hasegawa T, Hill EW, Leeb T, Lindgren G, Penedo MC, Røed KH, Ryder OA, Swinburne JE, Tozaki T, Valberg SJ, Vaudin M, Lindblad-Toh K, Wade CM, Mickelson JR. A high density SNP array for the domestic horse and extant Perissodactyla: utility for association mapping, genetic diversity, and phylogeny studies.. PLoS Genet 2012 Jan;8(1):e1002451.
  8. Bellone RR. Pleiotropic effects of pigmentation genes in horses.. Anim Genet 2010 Dec;41 Suppl 2:100-10.
  9. Bellone RR, Holl H, Setaluri V, Devi S, Maddodi N, Archer S, Sandmeyer L, Ludwig A, Foerster D, Pruvost M, Reissmann M, Bortfeldt R, Adelson DL, Lim SL, Nelson J, Haase B, Engensteiner M, Leeb T, Forsyth G, Mienaltowski MJ, Mahadevan P, Hofreiter M, Paijmans JL, Gonzalez-Fortes G, Grahn B, Brooks SA. Evidence for a retroviral insertion in TRPM1 as the cause of congenital stationary night blindness and leopard complex spotting in the horse.. PLoS One 2013;8(10):e78280.
  10. Brooks SA, Gabreski N, Miller D, Brisbin A, Brown HE, Streeter C, Mezey J, Cook D, Antczak DF. Whole-genome SNP association in the horse: identification of a deletion in myosin Va responsible for Lavender Foal Syndrome.. PLoS Genet 2010 Apr 15;6(4):e1000909.
  11. Staiger EA, Tseng CT, Miller D, Cassano JM, Nasir L, Garrick D, Brooks SA, Antczak DF. Host genetic influence on papillomavirus-induced tumors in the horse.. Int J Cancer 2016 Aug 15;139(4):784-92.
    doi: 10.1002/ijc.30120pubmed: 27037728google scholar: lookup
  12. Sarkar S, Bailey E, Go YY, Cook RF, Kalbfleisch T, Eberth J, Chelvarajan RL, Shuck KM, Artiushin S, Timoney PJ, Balasuriya UB. Allelic Variation in CXCL16 Determines CD3+ T Lymphocyte Susceptibility to Equine Arteritis Virus Infection and Establishment of Long-Term Carrier State in the Stallion.. PLoS Genet 2016 Dec;12(12):e1006467.
  13. Bellone RR, Liu J, Petersen JL, Mack M, Singer-Berk M, Drögemüller C, Malvick J, Wallner B, Brem G, Penedo MC, Lassaline M. A missense mutation in damage-specific DNA binding protein 2 is a genetic risk factor for limbal squamous cell carcinoma in horses.. Int J Cancer 2017 Jul 15;141(2):342-353.
    doi: 10.1002/ijc.30744pubmed: 28425625google scholar: lookup
  14. Gaunitz C, Fages A, Hanghøj K, Albrechtsen A, Khan N, Schubert M, Seguin-Orlando A, Owens IJ, Felkel S, Bignon-Lau O, de Barros Damgaard P, Mittnik A, Mohaseb AF, Davoudi H, Alquraishi S, Alfarhan AH, Al-Rasheid KAS, Crubézy E, Benecke N, Olsen S, Brown D, Anthony D, Massy K, Pitulko V, Kasparov A, Brem G, Hofreiter M, Mukhtarova G, Baimukhanov N, Lõugas L, Onar V, Stockhammer PW, Krause J, Boldgiv B, Undrakhbold S, Erdenebaatar D, Lepetz S, Mashkour M, Ludwig A, Wallner B, Merz V, Merz I, Zaibert V, Willerslev E, Librado P, Outram AK, Orlando L. Ancient genomes revisit the ancestry of domestic and Przewalski's horses.. Science 2018 Apr 6;360(6384):111-114.
    doi: 10.1126/science.aao3297pubmed: 29472442google scholar: lookup
  15. Schubert M, Jónsson H, Chang D, Der Sarkissian C, Ermini L, Ginolhac A, Albrechtsen A, Dupanloup I, Foucal A, Petersen B, Fumagalli M, Raghavan M, Seguin-Orlando A, Korneliussen TS, Velazquez AM, Stenderup J, Hoover CA, Rubin CJ, Alfarhan AH, Alquraishi SA, Al-Rasheid KA, MacHugh DE, Kalbfleisch T, MacLeod JN, Rubin EM, Sicheritz-Ponten T, Andersson L, Hofreiter M, Marques-Bonet T, Gilbert MT, Nielsen R, Excoffier L, Willerslev E, Shapiro B, Orlando L. Prehistoric genomes reveal the genetic foundation and cost of horse domestication.. Proc Natl Acad Sci U S A 2014 Dec 30;111(52):E5661-9.
    doi: 10.1073/pnas.1416991111pmc: PMC4284583pubmed: 25512547google scholar: lookup
  16. Librado P, Der Sarkissian C, Ermini L, Schubert M, Jónsson H, Albrechtsen A, Fumagalli M, Yang MA, Gamba C, Seguin-Orlando A, Mortensen CD, Petersen B, Hoover CA, Lorente-Galdos B, Nedoluzhko A, Boulygina E, Tsygankova S, Neuditschko M, Jagannathan V, Thèves C, Alfarhan AH, Alquraishi SA, Al-Rasheid KA, Sicheritz-Ponten T, Popov R, Grigoriev S, Alekseev AN, Rubin EM, McCue M, Rieder S, Leeb T, Tikhonov A, Crubézy E, Slatkin M, Marques-Bonet T, Nielsen R, Willerslev E, Kantanen J, Prokhortchouk E, Orlando L. Tracking the origins of Yakutian horses and the genetic basis for their fast adaptation to subarctic environments.. Proc Natl Acad Sci U S A 2015 Dec 15;112(50):E6889-97.
    doi: 10.1073/pnas.1513696112pmc: PMC4687531pubmed: 26598656google scholar: lookup
  17. Librado P, Gamba C, Gaunitz C, Der Sarkissian C, Pruvost M, Albrechtsen A, Fages A, Khan N, Schubert M, Jagannathan V, Serres-Armero A, Kuderna LFK, Povolotskaya IS, Seguin-Orlando A, Lepetz S, Neuditschko M, Thèves C, Alquraishi S, Alfarhan AH, Al-Rasheid K, Rieder S, Samashev Z, Francfort HP, Benecke N, Hofreiter M, Ludwig A, Keyser C, Marques-Bonet T, Ludes B, Crubézy E, Leeb T, Willerslev E, Orlando L. Ancient genomic changes associated with domestication of the horse.. Science 2017 Apr 28;356(6336):442-445.
    doi: 10.1126/science.aam5298pubmed: 28450643google scholar: lookup
  18. Rebolledo-Mendez J, Hestand MS, Coleman SJ, Zeng Z, Orlando L, MacLeod JN, Kalbfleisch T. Comparison of the Equine Reference Sequence with Its Sanger Source Data and New Illumina Reads.. PLoS One 2015;10(6):e0126852.
  19. Hestand MS, Kalbfleisch TS, Coleman SJ, Zeng Z, Liu J, Orlando L, MacLeod JN. Annotation of the Protein Coding Regions of the Equine Genome.. PLoS One 2015;10(6):e0124375.
  20. Raudsepp T, Gustafson-Seabury A, Durkin K, Wagner ML, Goh G, Seabury CM, Brinkmeyer-Langford C, Lee EJ, Agarwala R, Stallknecht-Rice E, Schäffer AA, Skow LC, Tozaki T, Yasue H, Penedo MC, Lyons LA, Khazanehdari KA, Binns MM, MacLeod JN, Distl O, Guérin G, Leeb T, Mickelson JR, Chowdhary BP. A 4,103 marker integrated physical and comparative map of the horse genome.. Cytogenet Genome Res 2008;122(1):28-36.
    doi: 10.1159/000151313pmc: PMC2587302pubmed: 18931483google scholar: lookup
  21. Zimin AV, Marçais G, Puiu D, Roberts M, Salzberg SL, Yorke JA. The MaSuRCA genome assembler.. Bioinformatics 2013 Nov 1;29(21):2669-77.
  22. Miller JR, Delcher AL, Koren S, Venter E, Walenz BP, Brownley A, Johnson J, Li K, Mobarry C, Sutton G. Aggressive assembly of pyrosequencing reads with mates.. Bioinformatics 2008 Dec 15;24(24):2818-24.
  23. Putnam NH, O'Connell BL, Stites JC, Rice BJ, Blanchette M, Calef R, Troll CJ, Fields A, Hartley PD, Sugnet CW, Haussler D, Rokhsar DS, Green RE. Chromosome-scale shotgun assembly using an in vitro method for long-range linkage.. Genome Res 2016 Mar;26(3):342-50.
    doi: 10.1101/gr.193474.115pmc: PMC4772016pubmed: 26848124google scholar: lookup
  24. Marks P, Garcia S, Barrio AM, Belhocine K, Bernate J, Bharadwaj R, Bjornson K, Catalanotti C, Delaney J, Fehr A, Fiddes IT, Galvin B, Heaton H, Herschleb J, Hindson C, Holt E, Jabara CB, Jett S, Keivanfar N, Kyriazopoulou-Panagiotopoulou S, Lek M, Lin B, Lowe A, Mahamdallie S, Maheshwari S, Makarewicz T, Marshall J, Meschi F, O'Keefe CJ, Ordonez H, Patel P, Price A, Royall A, Ruark E, Seal S, Schnall-Levin M, Shah P, Stafford D, Williams S, Wu I, Xu AW, Rahman N, MacArthur D, Church DM. Resolving the full spectrum of human genome variation using Linked-Reads.. Genome Res 2019 Apr;29(4):635-645.
    pmc: PMC6442396pubmed: 30894395doi: 10.1101/gr.234443.118google scholar: lookup
  25. Burns EN, Bordbari MH, Mienaltowski MJ, Affolter VK, Barro MV, Gianino F, Gianino G, Giulotto E, Kalbfleisch TS, Katzman SA, Lassaline M, Leeb T, Mack M, Müller EJ, MacLeod JN, Ming-Whitfield B, Alanis CR, Raudsepp T, Scott E, Vig S, Zhou H, Petersen JL, Bellone RR, Finno CJ. Generation of an equine biobank to be used for Functional Annotation of Animal Genomes project.. Anim Genet 2018 Dec;49(6):564-570.
    doi: 10.1111/age.12717pmc: PMC6264908pubmed: 30311254google scholar: lookup
  26. Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.. Bioinformatics 2015 Oct 1;31(19):3210-2.
    doi: 10.1093/bioinformatics/btv351pubmed: 26059717google scholar: lookup
  27. Fiddes IT, Armstrong J, Diekhans M, Nachtweide S, Kronenberg ZN, Underwood JG, Gordon D, Earl D, Keane T, Eichler EE, Haussler D, Stanke M, Paten B. Comparative Annotation Toolkit (CAT)-simultaneous clade and personal genome annotation.. Genome Res 2018 Jul;28(7):1029-1038.
    doi: 10.1101/gr.233460.117pmc: PMC6028123pubmed: 29884752google scholar: lookup
  28. Berlin K, Koren S, Chin CS, Drake JP, Landolin JM, Phillippy AM. Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.. Nat Biotechnol 2015 Jun;33(6):623-30.
    doi: 10.1038/nbt.3238pubmed: 26006009google scholar: lookup
  29. Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, Sandstrom R, Bernstein B, Bender MA, Groudine M, Gnirke A, Stamatoyannopoulos J, Mirny LA, Lander ES, Dekker J. Comprehensive mapping of long-range interactions reveals folding principles of the human genome.. Science 2009 Oct 9;326(5950):289-93.
    doi: 10.1126/science.1181369pmc: PMC2858594pubmed: 19815776google scholar: lookup
  30. Deng X, Ma W, Ramani V, Hill A, Yang F, Ay F, Berletch JB, Blau CA, Shendure J, Duan Z, Noble WS, Disteche CM. Bipartite structure of the inactive mouse X chromosome.. Genome Biol 2015 Aug 7;16(1):152.
    doi: 10.1186/s13059-015-0728-8pmc: PMC4539712pubmed: 26248554google scholar: lookup
  31. Myers EW, Sutton GG, Delcher AL, Dew IM, Fasulo DP, Flanigan MJ, Kravitz SA, Mobarry CM, Reinert KH, Remington KA, Anson EL, Bolanos RA, Chou HH, Jordan CM, Halpern AL, Lonardi S, Beasley EM, Brandon RC, Chen L, Dunn PJ, Lai Z, Liang Y, Nusskern DR, Zhan M, Zhang Q, Zheng X, Rubin GM, Adams MD, Venter JC. A whole-genome assembly of Drosophila.. Science 2000 Mar 24;287(5461):2196-204.
    doi: 10.1126/science.287.5461.2196pubmed: 10731133google scholar: lookup
  32. Koren S, Harhay GP, Smith TP, Bono JL, Harhay DM, Mcvey SD, Radune D, Bergman NH, Phillippy AM. Reducing assembly complexity of microbial genomes with single-molecule sequencing.. Genome Biol 2013;14(9):R101.
    doi: 10.1186/gb-2013-14-9-r101pmc: PMC4053942pubmed: 24034426google scholar: lookup
  33. Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL. Versatile and open software for comparing large genomes.. Genome Biol 2004;5(2):R12.
    doi: 10.1186/gb-2004-5-2-r12pmc: PMC395750pubmed: 14759262google scholar: lookup
  34. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013;1303:3997v2.
  35. Wu TD, Watanabe CK. GMAP: a genomic mapping and alignment program for mRNA and EST sequences.. Bioinformatics 2005 May 1;21(9):1859-75.
    doi: 10.1093/bioinformatics/bti310pubmed: 15728110google scholar: lookup
  36. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform.. Bioinformatics 2009 Jul 15;25(14):1754-60.
  37. Eizenga J. SeqPrep2: tool for stripping adaptors and/or merging paired reads with overlap into single reads. GitHub 2011.
  38. Li H. Toolkit for processing sequences in FASTA/Q formats. GitHub 2012.
  39. Green RE, Malaspinas AS, Krause J, Briggs AW, Johnson PL, Uhler C, Meyer M, Good JM, Maricic T, Stenzel U, Prüfer K, Siebauer M, Burbano HA, Ronan M, Rothberg JM, Egholm M, Rudan P, Brajković D, Kućan Z, Gusić I, Wikström M, Laakkonen L, Kelso J, Slatkin M, Pääbo S. A complete Neandertal mitochondrial genome sequence determined by high-throughput sequencing.. Cell 2008 Aug 8;134(3):416-26.
    doi: 10.1016/j.cell.2008.06.021pmc: PMC2602844pubmed: 18692465google scholar: lookup
  40. Xu X, Arnason U. The complete mitochondrial DNA sequence of the horse, Equus caballus: extensive heteroplasmy of the control region.. Gene 1994 Oct 21;148(2):357-62.
    doi: 10.1016/0378-1119(94)90713-7pubmed: 7958969google scholar: lookup
  41. White JR, Roberts M, Yorke JA, Pop M. Figaro: a novel statistical method for vector sequence removal.. Bioinformatics 2008 Feb 15;24(4):462-7.
  42. Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform.. Bioinformatics 2010 Mar 1;26(5):589-95.
  43. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP. Integrative genomics viewer.. Nat Biotechnol 2011 Jan;29(1):24-6.
    doi: 10.1038/nbt.1754pmc: PMC3346182pubmed: 21221095google scholar: lookup
  44. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 2011;17:10–12.
    doi: 10.14806/ej.17.1.200google scholar: lookup
  45. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The Sequence Alignment/Map format and SAMtools.. Bioinformatics 2009 Aug 15;25(16):2078-9.
  46. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.. Genome Res 2010 Sep;20(9):1297-303.
    doi: 10.1101/gr.107524.110pmc: PMC2928508pubmed: 20644199google scholar: lookup
  47. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner.. Bioinformatics 2013 Jan 1;29(1):15-21.
  48. Kalbfleisch TS. Eq쪳 single nucleotide error correction. Github 2018.
  49. Morgulis A, Gertz EM, Schäffer AA, Agarwala R. A fast and symmetric DUST implementation to mask low-complexity DNA sequences.. J Comput Biol 2006 Jun;13(5):1028-40.
    doi: 10.1089/cmb.2006.13.1028pubmed: 16796549google scholar: lookup
  50. Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments.. Genome Biol 2014 Mar 3;15(3):R46.
    doi: 10.1186/gb-2014-15-3-r46pmc: PMC4053813pubmed: 24580807google scholar: lookup
  51. Paten B, Earl D, Nguyen N, Diekhans M, Zerbino D, Haussler D. Cactus: Algorithms for genome multiple sequence alignment.. Genome Res 2011 Sep;21(9):1512-28.
    doi: 10.1101/gr.123356.111pmc: PMC3166836pubmed: 21665927google scholar: lookup
  52. Samtools. A JAVA API for high-throughput sequencing data (HTS) formats. GitHub 2016.
  53. Schubert M, Lindgreen S, Orlando L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging.. BMC Res Notes 2016 Feb 12;9:88.
    doi: 10.1186/s13104-016-1900-2pmc: PMC4751634pubmed: 26868221google scholar: lookup
  54. Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets.. Bioinformatics 2011 Mar 15;27(6):863-4.
  55. Kalbfleisch TS. Eq쪳 intermediate assemblies. CyVerse Data Commons 2018.
    doi: 10.7946/p20348google scholar: lookup

Citations

This article has been cited 128 times.