Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification

Yunzhi LinAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaChen YeAnhui Agricultural University School of Information and Computer, , Hefei, Anhui 230036, ChinaXingzhu LiAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaQinyao ChenAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaYing WuAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaFeng ZhangAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaRui PanAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaSijia ZhangAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaShuxia ChenAnhui Agricultural University School of Information and Computer, , Hefei, Anhui 230036, ChinaXu WangChinese Academy of Agricultural Sciences Agricultural Genomics Institute at Shenzhen, , Shenzhen, Guangdong 518124, ChinaShuo CaoChinese Academy of Agricultural Sciences Agricultural Genomics Institute at Shenzhen, , Shenzhen, Guangdong 518124, ChinaYingzhen WangAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaYi YueAnhui Agricultural University School of Information and Computer, , Hefei, Anhui 230036, ChinaYongsheng LiuAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, ChinaJunyang YueAnhui Agricultural University School of Horticulture, , Hefei, Anhui 230036, China
2023en
ABI

Аннотация

Abstract A high-quality genome is the basis for studies on functional, evolutionary, and comparative genomics. The majority of attention has been paid to the solution of complex chromosome structures and highly repetitive sequences, along with the emergence of a new ‘telomere-to-telomere (T2T) assembly’ era. However, the bioinformatic tools for the automatic construction and/or characterization of T2T genome are limited. Here, we developed a user-friendly web toolkit, quarTeT, which currently includes four modules: AssemblyMapper, GapFiller, TeloExplorer, and CentroMiner. First, AssemblyMapper is designed to assemble phased contigs into the chromosome-level genome by referring to a closely related genome. Then, GapFiller would endeavor to fill all unclosed gaps in a given genome with the aid of additional ultra-long sequences. Finally, TeloExplorer and CentroMiner are applied to identify candidate telomere and centromere as well as their localizations on each chromosome. These four modules can be used alone or in combination with each other for T2T genome assembly and characterization. As a case study, by adopting the entire modular functions of quarTeT, we have achieved the Actinidia chinensis genome assembly that is of a quality comparable to the reported genome Hongyang v4.0, which was assembled with the addition of manual handling. Further evaluation of CentroMiner by searching centromeres in Arabidopsis thaliana and Oryza sativa genomes showed that quarTeT is capable of identifying all the centromeric regions that have been previously detected by experimental methods. Collectively, quarTeT is an efficient toolkit for studies of large-scale T2T genomes and can be accessed at http://www.atcgn.com:8080/quarTeT/home.html without registration.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 2Использованных источников: 0