Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression

Thomas DerrienBioinformatics and Genomics, Centre for Genomic Regulation and UPF, 08003 Barcelona, Catalonia, SpainRory Johnson1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Giovanni Bussotti1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Andrea Tanzer1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Sarah Djebali1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Hagen Tilgner1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Grégory Guernec2INRA, UR1012 SCRIBE, IFR140, GenOuest, 35000 Rennes, France;David Martı́n1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Angelika Merkel1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;David G. Knowles1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Julien Lagarde1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Lavanya Veeravalli3Genome Institute of Singapore, Agency for Science, Technology and Research, Genome 138672, Singapore;Xiaoan Ruan3Genome Institute of Singapore, Agency for Science, Technology and Research, Genome 138672, Singapore;Yijun Ruan3Genome Institute of Singapore, Agency for Science, Technology and Research, Genome 138672, Singapore;Timo Lassmann4Riken Omics Science Center, Riken Yokohama Institute, Yokohama, Kanagawa 351-0198, Japan;Piero Carninci4Riken Omics Science Center, Riken Yokohama Institute, Yokohama, Kanagawa 351-0198, Japan;James Brown5Department of Statistics, University of California, Berkeley, California 94720, USA;Leonard Lipovich6Center for Molecular Medicine and Genetics, Wayne State University, Detroit, Michigan 48201, USA;José M. González7Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, United Kingdom;Mark Thomas7Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, United Kingdom;Carrie Davis8Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA;Ramin Shiekhattar9The Wistar Institute, Philadelphia, Pennsylvania 19104, USA;T Gingeras8Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA;Tim Hubbard7Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, United Kingdom;Cédric Notredame1Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, 08003 Barcelona, Catalonia, Spain;Jennifer Harrow7Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, United Kingdom;Roderic Guigó10Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, 08002 Barcelona, Catalonia, Spain
2012en
ABI

Аннотация

The human genome contains many thousands of long noncoding RNAs (lncRNAs). While several studies have demonstrated compelling biological and disease roles for individual examples, analytical and experimental approaches to investigate these genes have been hampered by the lack of comprehensive lncRNA annotation. Here, we present and analyze the most complete human lncRNA annotation to date, produced by the GENCODE consortium within the framework of the ENCODE project and comprising 9277 manually annotated genes producing 14,880 transcripts. Our analyses indicate that lncRNAs are generated through pathways similar to that of protein-coding genes, with similar histone-modification profiles, splicing signals, and exon/intron lengths. In contrast to protein-coding genes, however, lncRNAs display a striking bias toward two-exon transcripts, they are predominantly localized in the chromatin and nucleus, and a fraction appear to be preferentially processed into small RNAs. They are under stronger selective pressure than neutrally evolving sequences-particularly in their promoter regions, which display levels of selection comparable to protein-coding genes. Importantly, about one-third seem to have arisen within the primate lineage. Comprehensive analysis of their expression in multiple human organs and brain regions shows that lncRNAs are generally lower expressed than protein-coding genes, and display more tissue-specific expression patterns, with a large fraction of tissue-specific lncRNAs expressed in the brain. Expression correlation analysis indicates that lncRNAs show particularly striking positive correlation with the expression of antisense coding genes. This GENCODE annotation represents a valuable resource for future studies of lncRNAs.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 4Использованных источников: 0