Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

Assessing the Impact of Data Preprocessing on Analyzing Next Generation Sequencing Data

Binsheng HeAcademician Workstation, Changsha Medical University, Changsha, ChinaRongrong ZhuVascular Surgery Department, Tsinghua University Affiliated Beijing Tsinghua Changgung Hospital, Beijing, ChinaHuandong YangDepartment of Gastrointestinal Surgery, Yidu Central Hospital of Weifang, Weifang, ChinaQingqing LuGeneis Beijing Co., Ltd., Beijing, ChinaWeiwei WangGeneis Beijing Co., Ltd., Beijing, ChinaLei SongGeneis Beijing Co., Ltd., Beijing, ChinaXue SunGeneis Beijing Co., Ltd., Beijing, ChinaGuandong ZhangGeneis Beijing Co., Ltd., Beijing, ChinaShijun LiDepartment of Pathology, Chifeng Municipal Hospital, Chifeng, ChinaJialiang YangAcademician Workstation, Changsha Medical University, Changsha, ChinaGeng TianGeneis Beijing Co., Ltd., Beijing, ChinaPingping BingAcademician Workstation, Changsha Medical University, Changsha, ChinaJidong LangGeneis Beijing Co., Ltd., Beijing, China
2020en
ABI

Аннотация

Data quality control and preprocessing are often the first step in processing next-generation sequencing (NGS) data of tumors. Not only can it help us evaluate the quality of sequencing data, but it can also help us obtain high-quality data for downstream data analysis. However, by comparing data analysis results of preprocessing with Cutadapt, FastP, Trimmomatic, and raw sequencing data, we found that the frequency of mutation detection had some fluctuations and differences, and human leukocyte antigen (HLA) typing directly resulted in erroneous results. We think that our research had demonstrated the impact of data preprocessing steps on downstream data analysis results. We hope that it can promote the development or optimization of better data preprocessing methods, so that downstream information analysis can be more accurate.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 2Использованных источников: 0