Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Обзорная статья

Applications of transformer-based language models in bioinformatics: a survey

Shuang ZhangCollege of Software, Nankai University , Tianjin 300350, ChinaRui FanCollege of Software, Nankai University , Tianjin 300350, ChinaYuti LiuCollege of Software, Nankai University , Tianjin 300350, ChinaShuang ChenCollege of Software, Nankai University , Tianjin 300350, ChinaQiao LiuDepartment of Statistics, Stanford University , Stanford, CA 94305, USAWanwen ZengCollege of Software, Nankai University , Tianjin 300350, China
2023en
ABI

Аннотация

Summary: The transformer-based language models, including vanilla transformer, BERT and GPT-3, have achieved revolutionary breakthroughs in the field of natural language processing (NLP). Since there are inherent similarities between various biological sequences and natural languages, the remarkable interpretability and adaptability of these models have prompted a new wave of their application in bioinformatics research. To provide a timely and comprehensive review, we introduce key developments of transformer-based language models by describing the detailed structure of transformers and summarize their contribution to a wide range of bioinformatics research from basic sequence analysis to drug discovery. While transformer-based applications in bioinformatics are diverse and multifaceted, we identify and discuss the common challenges, including heterogeneity of training data, computational expense and model interpretability, and opportunities in the context of bioinformatics research. We hope that the broader community of NLP researchers, bioinformaticians and biologists will be brought together to foster future research and development in transformer-based language models, and inspire novel bioinformatics applications that are unattainable by traditional methods. Supplementary information: online.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 2Использованных источников: 0