Chromosome-scale genome assembly and annotation of the tetraploid potato cultivar Diacol Capiro adapted to the Andean region

dc.contributor.authorPaula H. Reyes‐Herrera
dc.contributor.authorDiego A Delgadillo-Duran
dc.contributor.authorMirella Flores-Gonzalez
dc.contributor.authorLukas A. Mueller
dc.contributor.authorMarco Cristancho
dc.contributor.authorLuz Stella Barrero
dc.coverage.spatialBolivia
dc.date.accessioned2026-03-22T14:26:12Z
dc.date.available2026-03-22T14:26:12Z
dc.date.issued2024
dc.descriptionCitaciones: 3
dc.description.abstractPotato (Solanum tuberosum) is an essential crop for food security and is ranked as the third most important crop worldwide for human consumption. The Diacol Capiro cultivar holds the dominant position in Colombian cultivation, primarily catering to the food processing industry. This highly heterozygous, autotetraploid cultivar belongs to the Andigenum group and it stands out for its adaptation to a wide variety of environments spanning altitudes from 1,800 to 3,200 meters above sea level. Here, a chromosome-scale assembly, referred to as DC, is presented for this cultivar. The assembly was generated by combining circular consensus sequencing with proximity ligation Hi-C for the scaffolding and represents 2.369 Gb with 48 pseudochromosomes covering 2,091 Gb and an anchor rate of 88.26%. The reference genome metrics, including an N50 of 50.5 Mb, a BUSCO (Benchmarking Universal Single-Copy Orthologue) score of 99.38%, and an Long Terminal Repeat Assembly Index score of 13.53, collectively signal the achieved high assembly quality. A comprehensive annotation yielded a total of 154,114 genes, and the associated BUSCO score of 95.78% for the annotated sequences attests to their completeness. The number of predicted NLR (Nucleotide-Binding and Leucine-Rich-Repeat genes) was 2107 with a large representation of NBARC (for nucleotide binding domain shared by Apaf-1, certain R gene products, and CED-4) containing domains (99.85%). Further comparative analysis of the proposed annotation-based assembly with high-quality known potato genomes, showed a similar genome metrics with differences in total gene numbers related to the ploidy status. The genome assembly and annotation of DC presented in this study represent a valuable asset for comprehending potato genetics. This resource aids in targeted breeding initiatives and contributes to the creation of enhanced, resilient, and more productive potato varieties, particularly beneficial for countries in Latin America.
dc.identifier.doi10.1093/g3journal/jkae139
dc.identifier.urihttps://doi.org/10.1093/g3journal/jkae139
dc.identifier.urihttps://andeanlibrary.org/handle/123456789/46500
dc.language.isoen
dc.publisherGenetics Society of America
dc.relation.ispartofG3 Genes Genomes Genetics
dc.sourceColombian Corporation for Agricultural Research - AGROSAVIA
dc.subjectBiology
dc.subjectGenome
dc.subjectSequence assembly
dc.subjectCultivar
dc.subjectReference genome
dc.subjectGene
dc.subjectGenetics
dc.subjectGenome browser
dc.subjectChromosome
dc.subjectGene Annotation
dc.titleChromosome-scale genome assembly and annotation of the tetraploid potato cultivar Diacol Capiro adapted to the Andean region
dc.typearticle

Files