Analyze Diet
Microbial biotechnology2012; 6(2); 150-156; doi: 10.1111/1751-7915.12006

Tips and tricks for the assembly of a Corynebacterium pseudotuberculosis genome using a semiconductor sequencer.

Abstract: New sequencing platforms have enabled rapid decoding of complete prokaryotic genomes at relatively low cost. The Ion Torrent platform is an example of these technologies, characterized by lower coverage, generating challenges for the genome assembly. One particular problem is the lack of genomes that enable reference-based assembly, such as the one used in the present study, Corynebacterium pseudotuberculosis biovar equi, which causes high economic losses in the US equine industry. The quality treatment strategy incorporated into the assembly pipeline enabled a 16-fold greater use of the sequencing data obtained compared with traditional quality filter approaches. Data preprocessing prior to the de novo assembly enabled the use of known methodologies in the next-generation sequencing data assembly. Moreover, manual curation was proved to be essential for ensuring a quality assembly, which was validated by comparative genomics with other species of the genus Corynebacterium. The present study presents a modus operandi that enables a greater and better use of data obtained from semiconductor sequencing for obtaining the complete genome from a prokaryotic microorganism, C. pseudotuberculosis, which is not a traditional biological model such as Escherichia coli.
Publication Date: 2012-12-02 PubMed ID: 23199210PubMed Central: PMC3917457DOI: 10.1111/1751-7915.12006Google Scholar: Lookup
The Equine Research Bank provides access to a large database of publicly available scientific literature. Inclusion in the Research Bank does not imply endorsement of study methods or findings by Mad Barn.
  • Journal Article
  • Research Support
  • Non-U.S. Gov't

Summary

This research summary has been generated with artificial intelligence and may contain errors and omissions. Refer to the original study to confirm details provided. Submit correction.

This research focuses on the process and techniques used by scientists for assembling the genome of a bacterium, Corynebacterium pseudotuberculosis biovar equi, using next-generation sequencing technology, specifically the Ion Torrent platform.

Assembly of a Bacterial Genome

  • The study begins by acknowledging the importance of new sequencing platforms like the Ion Torrent, which have allowed for the decoding of complete prokaryotic genomes at a relatively low cost.
  • One challenge with this technology is lower sequencing coverage, making genome assembly more difficult, especially for organisms with genome sequences not already thoroughly available for reference-based assembly.
  • In this case, the team addressed the genome sequencing of Corynebacterium pseudotuberculosis biovar equi, a bacterium that causes harm and economic loss to the US equine industry.

Quality Treatment Strategy and Assembly Pipeline

  • The strategy used in this research allowed for a 16-fold increase in the usage of sequencing data, compared to the traditional quality filter approaches.
  • In the assembly pipeline, data pre-processing was carried out before the de novo assembly, which is the assembly of a genome from scratch without using any reference genome.
  • This step was crucial as this allowed for the application of known methodologies in the next-generation sequencing data assembly.

Manual Curation and Genome Validation

  • Manual curation emerged as an invaluable part of ensuring the quality of the assembly. This involves reviewing and correcting the automatic outputs of a genome assembly.
  • The accuracy and quality of the assembly were then validated through a process of comparative genomics, whereby the genome of the studied organism was cross-examined against genomes of other Corynebacterium species.

Novel Modus Operandi

  • The study presents a unique workflow for effectively using data from semiconductor sequencing to obtain the complete genome of prokaryotic microbes like Corynebacterium pseudotuberculosis.
  • This method can be particularly useful when dealing with organisms that are not traditional biological models like E. coli, where the genome is well-documented.

Cite This Article

APA
Ramos RT, Carneiro AR, Soares Sde C, dos Santos AR, Almeida S, Guimarães L, Figueira F, Barbosa E, Tauch A, Azevedo V, Silva A. (2012). Tips and tricks for the assembly of a Corynebacterium pseudotuberculosis genome using a semiconductor sequencer. Microb Biotechnol, 6(2), 150-156. https://doi.org/10.1111/1751-7915.12006

Publication

ISSN: 1751-7915
NlmUniqueID: 101316335
Country: United States
Language: English
Volume: 6
Issue: 2
Pages: 150-156

Researcher Affiliations

Ramos, Rommel Thiago Jucá
  • Institute of Biological Sciences, Federal University Pará, Belém, Pará, Brazil.
Carneiro, Adriana Ribeiro
    Soares, Siomar de Castro
      dos Santos, Anderson Rodrigues
        Almeida, Sintia
          Guimarães, Luis
            Figueira, Flávia
              Barbosa, Eudes
                Tauch, Andreas
                  Azevedo, Vasco
                    Silva, Artur

                      MeSH Terms

                      • Animals
                      • Computational Biology / methods
                      • Corynebacterium Infections / microbiology
                      • Corynebacterium Infections / veterinary
                      • Corynebacterium pseudotuberculosis / genetics
                      • Corynebacterium pseudotuberculosis / isolation & purification
                      • DNA, Bacterial / analysis
                      • Electrochemical Techniques / instrumentation
                      • Electrochemical Techniques / methods
                      • Equipment Design
                      • Genome, Bacterial / genetics
                      • Genomics / methods
                      • Horse Diseases / microbiology
                      • Horses
                      • Semiconductors
                      • Sequence Analysis, DNA / instrumentation
                      • Sequence Analysis, DNA / methods
                      • Software

                      References

                      This article includes 22 references
                      1. Alikhan NF, Petty NK, Ben Zakour NL, Beatson SA. BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons.. BMC Genomics 2011 Aug 8;12:402.
                        pmc: PMC3163573pubmed: 21824423doi: 10.1186/1471-2164-12-402google scholar: lookup
                      2. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool.. J Mol Biol 1990 Oct 5;215(3):403-10.
                        pubmed: 2231712doi: 10.1016/S0022-2836(05)80360-2google scholar: lookup
                      3. Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J. ACT: the Artemis Comparison Tool.. Bioinformatics 2005 Aug 15;21(16):3422-3.
                        pubmed: 15976072doi: 10.1093/bioinformatics/bti553google scholar: lookup
                      4. Cerdeira LT, Carneiro AR, Ramos RT, de Almeida SS, D'Afonseca V, Schneider MP, Baumbach J, Tauch A, McCulloch JA, Azevedo VA, Silva A. Rapid hybrid de novo assembly of a microbial genome using only short reads: Corynebacterium pseudotuberculosis I19 as a case study.. J Microbiol Methods 2011 Aug;86(2):218-23.
                        pubmed: 21620904doi: 10.1016/j.mimet.2011.05.008google scholar: lookup
                      5. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. Improved microbial gene identification with GLIMMER.. Nucleic Acids Res 1999 Dec 1;27(23):4636-41.
                        pmc: PMC148753pubmed: 10556321doi: 10.1093/nar/27.23.4636google scholar: lookup
                      6. Doherr MG, Carpenter TE, Hanson KM, Wilson WD, Gardner IA. Risk factors associated with Corynebacterium pseudotuberculosis infection in California horses.. Prev Vet Med 1998 Jun 30;35(4):229-39.
                        pubmed: 9689656doi: 10.1016/s0167-5877(98)00071-3google scholar: lookup
                      7. Dorella FA, Pacheco LG, Oliveira SC, Miyoshi A, Azevedo V. Corynebacterium pseudotuberculosis: microbiology, biochemical properties, pathogenesis and molecular studies of virulence.. Vet Res 2006 Mar-Apr;37(2):201-18.
                        pubmed: 16472520doi: 10.1051/vetres:2005056google scholar: lookup
                      8. Earl D, Bradnam K, St John J, Darling A, Lin D, Fass J, Yu HO, Buffalo V, Zerbino DR, Diekhans M, Nguyen N, Ariyaratne PN, Sung WK, Ning Z, Haimel M, Simpson JT, Fonseca NA, Birol İ, Docking TR, Ho IY, Rokhsar DS, Chikhi R, Lavenier D, Chapuis G, Naquin D, Maillet N, Schatz MC, Kelley DR, Phillippy AM, Koren S, Yang SP, Wu W, Chou WC, Srivastava A, Shaw TI, Ruby JG, Skewes-Cox P, Betegon M, Dimon MT, Solovyev V, Seledtsov I, Kosarev P, Vorobyev D, Ramirez-Gonzalez R, Leggett R, MacLean D, Xia F, Luo R, Li Z, Xie Y, Liu B, Gnerre S, MacCallum I, Przybylski D, Ribeiro FJ, Yin S, Sharpe T, Hall G, Kersey PJ, Durbin R, Jackman SD, Chapman JA, Huang X, DeRisi JL, Caccamo M, Li Y, Jaffe DB, Green RE, Haussler D, Korf I, Paten B. Assemblathon 1: a competitive assessment of de novo short read assembly methods.. Genome Res 2011 Dec;21(12):2224-41.
                        pmc: PMC3227110pubmed: 21926179doi: 10.1101/gr.126599.111google scholar: lookup
                      9. Hernandez D, François P, Farinelli L, Osterås M, Schrenzel J. De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer.. Genome Res 2008 May;18(5):802-9.
                        pmc: PMC2336802pubmed: 18332092doi: 10.1101/gr.072033.107google scholar: lookup
                      10. Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Das U, Daugherty L, Duquenne L, Finn RD, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Laugraud A, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Mulder N, Natale D, Orengo C, Quinn AF, Selengut JD, Sigrist CJ, Thimma M, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C. InterPro: the integrative protein signature database.. Nucleic Acids Res 2009 Jan;37(Database issue):D211-5.
                        pmc: PMC2686546pubmed: 18940856doi: 10.1093/nar/gkn785google scholar: lookup
                      11. Kircher M, Kelso J. High-throughput DNA sequencing--concepts and limitations.. Bioessays 2010 Jun;32(6):524-36.
                        pubmed: 20486139doi: 10.1002/bies.200900181google scholar: lookup
                      12. Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes.. Nucleic Acids Res 2007;35(9):3100-8.
                        pmc: PMC1888812pubmed: 17452365doi: 10.1093/nar/gkm160google scholar: lookup
                      13. Mellmann A, Harmsen D, Cummings CA, Zentz EB, Leopold SR, Rico A, Prior K, Szczepanowski R, Ji Y, Zhang W, McLaughlin SF, Henkhaus JK, Leopold B, Bielaszewska M, Prager R, Brzoska PM, Moore RL, Guenther S, Rothberg JM, Karch H. Prospective genomic characterization of the German enterohemorrhagic Escherichia coli O104:H4 outbreak by rapid next generation sequencing technology.. PLoS One 2011;6(7):e22751.
                      14. Price AL, Jones NC, Pevzner PA. De novo identification of repeat families in large genomes.. Bioinformatics 2005 Jun;21 Suppl 1:i351-8.
                      15. Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. InterProScan: protein domains identifier.. Nucleic Acids Res 2005 Jul 1;33(Web Server issue):W116-20.
                        pmc: PMC1160203pubmed: 15980438doi: 10.1093/nar/gki442google scholar: lookup
                      16. Ramos RT, Carneiro AR, Baumbach J, Azevedo V, Schneider MP, Silva A. Analysis of quality raw data of second generation sequencers with Quality Assessment Software.. BMC Res Notes 2011 Apr 18;4:130.
                        pmc: PMC3105940pubmed: 21501521doi: 10.1186/1756-0500-4-130google scholar: lookup
                      17. Rothberg JM, Hinz W, Rearick TM, Schultz J, Mileski W, Davey M, Leamon JH, Johnson K, Milgrew MJ, Edwards M, Hoon J, Simons JF, Marran D, Myers JW, Davidson JF, Branting A, Nobile JR, Puc BP, Light D, Clark TA, Huber M, Branciforte JT, Stoner IB, Cawley SE, Lyons M, Fu Y, Homer N, Sedova M, Miao X, Reed B, Sabina J, Feierstein E, Schorn M, Alanjary M, Dimalanta E, Dressman D, Kasinskas R, Sokolsky T, Fidanza JA, Namsaraev E, McKernan KJ, Williams A, Roth GT, Bustillo J. An integrated semiconductor device enabling non-optical genome sequencing.. Nature 2011 Jul 20;475(7356):348-52.
                        pubmed: 21776081doi: 10.1038/nature10242google scholar: lookup
                      18. Ruiz JC, D'Afonseca V, Silva A, Ali A, Pinto AC, Santos AR, Rocha AA, Lopes DO, Dorella FA, Pacheco LG, Costa MP, Turk MZ, Seyffert N, Moraes PM, Soares SC, Almeida SS, Castro TL, Abreu VA, Trost E, Baumbach J, Tauch A, Schneider MP, McCulloch J, Cerdeira LT, Ramos RT, Zerlotini A, Dominitini A, Resende DM, Coser EM, Oliveira LM, Pedrosa AL, Vieira CU, Guimarães CT, Bartholomeu DC, Oliveira DM, Santos FR, Rabelo ÉM, Lobo FP, Franco GR, Costa AF, Castro IM, Dias SR, Ferro JA, Ortega JM, Paiva LV, Goulart LR, Almeida JF, Ferro MI, Carneiro NP, Falcão PR, Grynberg P, Teixeira SM, Brommonschenkel S, Oliveira SC, Meyer R, Moore RJ, Miyoshi A, Oliveira GC, Azevedo V. Evidence for reductive genome evolution and lateral acquisition of virulence functions in two Corynebacterium pseudotuberculosis strains.. PLoS One 2011 Apr 18;6(4):e18551.
                      19. Soares SC, Abreu VA, Ramos RT, Cerdeira L, Silva A, Baumbach J, Trost E, Tauch A, Hirata R Jr, Mattos-Guaraldi AL, Miyoshi A, Azevedo V. PIPS: pathogenicity island prediction software.. PLoS One 2012;7(2):e30848.
                      20. Trost E, Ott L, Schneider J, Schröder J, Jaenicke S, Goesmann A, Husemann P, Stoye J, Dorella FA, Rocha FS, Soares Sde C, D'Afonseca V, Miyoshi A, Ruiz J, Silva A, Azevedo V, Burkovski A, Guiso N, Join-Lambert OF, Kayal S, Tauch A. The complete genome sequence of Corynebacterium pseudotuberculosis FRC41 isolated from a 12-year-old girl with necrotizing lymphadenitis reveals insights into gene-regulatory networks contributing to virulence.. BMC Genomics 2010 Dec 30;11:728.
                        pmc: PMC3022926pubmed: 21192786doi: 10.1186/1471-2164-11-728google scholar: lookup
                      21. Trost E, Al-Dilaimi A, Papavasiliou P, Schneider J, Viehoever P, Burkovski A, Soares SC, Almeida SS, Dorella FA, Miyoshi A, Azevedo V, Schneider MP, Silva A, Santos CS, Santos LS, Sabbadini P, Dias AA, Hirata R Jr, Mattos-Guaraldi AL, Tauch A. Comparative analysis of two complete Corynebacterium ulcerans genomes and detection of candidate virulence factors.. BMC Genomics 2011 Jul 30;12:383.
                        pmc: PMC3164646pubmed: 21801446doi: 10.1186/1471-2164-12-383google scholar: lookup
                      22. Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs.. Genome Res 2008 May;18(5):821-9.
                        pmc: PMC2336801pubmed: 18349386doi: 10.1101/gr.074492.107google scholar: lookup

                      Citations

                      This article has been cited 11 times.
                      1. Kremer FS, McBride AJA, Pinto LS. Approaches for in silico finishing of microbial genome sequences.. Genet Mol Biol 2017 Jul-Sep 01;40(3):553-576.
                      2. Viana MVC, Figueiredo H, Ramos R, Guimarães LC, Pereira FL, Dorella FA, Selim SAK, Salaheldean M, Silva A, Wattam AR, Azevedo V. Comparative genomic analysis between Corynebacterium pseudotuberculosis strains isolated from buffalo.. PLoS One 2017;12(4):e0176347.
                        doi: 10.1371/journal.pone.0176347pubmed: 28445543google scholar: lookup
                      3. Almeida S, Sousa C, Abreu V, Diniz C, Dorneles EM, Lage AP, Barh D, Azevedo V. Exploration of Nitrate Reductase Metabolic Pathway in Corynebacterium pseudotuberculosis.. Int J Genomics 2017;2017:9481756.
                        doi: 10.1155/2017/9481756pubmed: 28316974google scholar: lookup
                      4. Baraúna RA, Ramos RTJ, Veras AAO, de Sá PHCG, Guimarães LC, das Graças DA, Carneiro AR, Edman JM, Spier SJ, Azevedo V, Silva A. Genomic analysis of four strains of Corynebacterium pseudotuberculosis bv. Equi isolated from horses showing distinct signs of infection.. Stand Genomic Sci 2017;12:16.
                        doi: 10.1186/s40793-017-0234-6pubmed: 28163825google scholar: lookup
                      5. de Sá PC, Da Silva ML, Carneiro AR, Gomes JC, Dias LM, Alves JT, De Oliveira Veras AA, Baraúna RA, Das Graças DA, Matté MH, Sato MI, Hachich EM, Matté GR, Ramos RT, Silva A. Draft Genome Sequence of Non-O1 and Non-O139 Vibrio cholerae Strain VCC19.. Genome Announc 2014 Nov 6;2(6).
                        doi: 10.1128/genomeA.01094-14pubmed: 25377699google scholar: lookup
                      6. Dorella FA, Gala-Garcia A, Pinto AC, Sarrouh B, Antunes CA, Ribeiro D, Aburjaile FF, Fiaux KK, Guimarães LC, Seyffert N, El-Aouar RA, Silva R, Hassan SS, Castro TL, Marques WS, Ramos R, Carneiro A, de Sá P, Miyoshi A, Azevedo V, Silva A. Progression of 'OMICS' methodologies for understanding the pathogenicity of Corynebacterium pseudotuberculosis: the Brazilian experience.. Comput Struct Biotechnol J 2013;6:e201303013.
                        doi: 10.5936/csbj.201303013pubmed: 24688721google scholar: lookup
                      7. Henriques I, Jucá Ramos RT, Baraúna RA, de Sá PH, Marinho Almeida D, Carneiro AR, Barbosa S, Pereira A, Alves A, Saavedra MJ, Egas C, Silva A, Correia A. Draft Genome Sequence of Serratia fonticola UTAD54, a Carbapenem-Resistant Strain Isolated from Drinking Water.. Genome Announc 2013 Nov 27;1(6).
                        doi: 10.1128/genomeA.00970-13pubmed: 24285645google scholar: lookup
                      8. Carneiro AR, Jucá Ramos RT, Baraúna RA, de Sá PH, Marinho Almeida D, Barbosa S, Pereira A, Alves A, Egas C, Correia A, Henriques I, Silva A. Draft Genome Sequence of Serratia fonticola LMG 7882T Isolated from Freshwater.. Genome Announc 2013 Dec 26;1(6).
                        doi: 10.1128/genomeA.00971-13pubmed: 24265499google scholar: lookup
                      9. Veras AA, de Sá PH, Azevedo V, Silva A, Ramos RT. AutoAssemblyD: a graphical user interface system for several genome assemblers.. Bioinformation 2013;9(16):840-1.
                        doi: 10.6026/97320630009840pubmed: 24143057google scholar: lookup
                      10. Ramos RT, Carneiro AR, Caracciolo PH, Azevedo V, Schneider MP, Barh D, Silva A. Graphical contig analyzer for all sequencing platforms (G4ALL): a new stand-alone tool for finishing and draft generation of bacterial genomes.. Bioinformation 2013;9(11):599-604.
                        doi: 10.6026/97320630009599pubmed: 23888102google scholar: lookup
                      11. Soares SC, Silva A, Trost E, Blom J, Ramos R, Carneiro A, Ali A, Santos AR, Pinto AC, Diniz C, Barbosa EG, Dorella FA, Aburjaile F, Rocha FS, Nascimento KK, Guimarães LC, Almeida S, Hassan SS, Bakhtiar SM, Pereira UP, Abreu VA, Schneider MP, Miyoshi A, Tauch A, Azevedo V. The pan-genome of the animal pathogen Corynebacterium pseudotuberculosis reveals differences in genome plasticity between the biovar ovis and equi strains.. PLoS One 2013;8(1):e53818.
                        doi: 10.1371/journal.pone.0053818pubmed: 23342011google scholar: lookup