List of long non-coding RNA databases

Last updated

This is a list of long noncoding RNA databases , which provide information about lncRNAs. [1] [2]

Long non-coding RNA databases

NameDescriptionReferences
deepBase Identification, expression, evolution and function of long non-coding RNAs (LncRNAs), small RNAs and circular RNAs from deep-sequencing data [3]
LNCipedia A comprehensive compendium of human long non-coding RNAs. [4] [5]
lncRNAdb The Reference Database For Functional Long Noncoding RNAs. [6]
LncRNAWiki A wiki-based, publicly editable and open-content platform for community curation of human long non-coding RNAs (lncRNAs) [7]
LncBook A comprehensive collection of 270,044 human lncRNAs and systematic curation of lncRNAs’ annotation by multi-omics data integration, function annotation and disease association [8]
MONOCLdb The MOuse NOnCode Lung database provides the annotations and expression profiles of mouse long non-coding RNAs (lncRNAs) involved in influenza and SARS-CoV infections. [9]
NONCODE An integrated knowledge database dedicated to ncRNAs, especially lncRNAs [10]
lncRNome A comprehensive searchable biologically oriented knowledgebase for long noncoding RNAs in Humans.
NRED A database of long noncoding RNA expression. [11]
C-It-Loci A tool to explore and to compare the expression profiles of conserved loci among various tissues in three organisms

[12]

MiTranscriptome A catalog of human long poly-adenylated RNA transcripts derived from computational analysis of high-throughput RNA-Seq data from over 6,500 samples, spanning diverse cancer and tissue types [13]
slncky Evolution Browser This site contains alignments and evolutionary metrics of conserved lncRNAs. [14]
Cancer LncRNA Census (CLC) Database of long-noncoding RNAs causally implicated in cancer through in vivo, in vitro and other evidence. [15]
BIGTranscriptome High-confidence of coding and noncoding transcriptomes assembled with hundreds of pseudo-stranded and stranded RNA-seq datasets. [16]
lncRNAKBA knowledgebase of tissue-specific functional annotation and trait association of long noncoding RNA [17]

Related Research Articles

Rfam is a database containing information about non-coding RNA (ncRNA) families and other structured RNA elements. It is an annotated, open access database originally developed at the Wellcome Trust Sanger Institute in collaboration with Janelia Farm, and currently hosted at the European Bioinformatics Institute. Rfam is designed to be similar to the Pfam database for annotating protein families.

<span class="mw-page-title-main">Small nucleolar SNORD12/SNORD106</span>

In molecular biology, the small nucleolar RNAs SNORD106 and SNORD12 are two related snoRNAs which belongs to the C/D class of small nucleolar RNAs (snoRNAs). Both contain the conserved C (UGAUGA) and D (CUGA) box sequence motifs

<span class="mw-page-title-main">Telomerase RNA component</span> NcRNA found in eukaryotes

Telomerase RNA component, also known as TR, TER or TERC, is an ncRNA found in eukaryotes that is a component of telomerase, the enzyme used to extend telomeres. TERC serves as a template for telomere replication by telomerase. Telomerase RNAs differ greatly in sequence and structure between vertebrates, ciliates and yeasts, but they share a 5' pseudoknot structure close to the template sequence. The vertebrate telomerase RNAs have a 3' H/ACA snoRNA-like domain.

<span class="mw-page-title-main">Long non-coding RNA</span> Non-protein coding transcripts longer than 200 nucleotides

Long non-coding RNAs are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as microRNAs (miRNAs), small interfering RNAs (siRNAs), Piwi-interacting RNAs (piRNAs), small nucleolar RNAs (snoRNAs), and other short RNAs. Given that some lncRNAs have been reported to have the potential to encode small proteins or micro-peptides, the latest definition of lncRNA is a class of RNA molecules of over 200 nucleotides that have no or limited coding capacity. Long intervening/intergenic noncoding RNAs (lincRNAs) are sequences of lncRNA which do not overlap protein-coding genes.

The Gene Wiki is a project within Wikipedia that aims to describe the relationships and functions of all human genes. It was established to transfer information from scientific resources to Wikipedia stub articles.

Therapeutic Target Database (TTD) is a pharmaceutical and medical repository constructed by the Innovative Drug Research and Bioinformatics Group (IDRB) at Zhejiang University, China and the Bioinformatics and Drug Design Group at the National University of Singapore. It provides information about known and explored therapeutic protein and nucleic acid targets, the targeted disease, pathway information and the corresponding drugs directed at each of these targets. Detailed knowledge about target function, sequence, 3D structure, ligand binding properties, enzyme nomenclature and drug structure, therapeutic class, and clinical development status. TTD is freely accessible without any login requirement at https://idrblab.org/ttd/.

miRBase

In bioinformatics, miRBase is a biological database that acts as an archive of microRNA sequences and annotations. As of September 2010 it contained information about 15,172 microRNAs. This number has risen to 38,589 by March 2018. The miRBase registry provides a centralised system for assigning new names to microRNA genes.

This microRNA database and microRNA targets databases is a compilation of databases and web portals and servers used for microRNAs and their targets. MicroRNAs (miRNAs) represent an important class of small non-coding RNAs (ncRNAs) that regulate gene expression by targeting messenger RNAs.

<span class="mw-page-title-main">LncRNAdb</span>

In bioinformatics, lncRNAdb is a biological database of Long non-coding RNAs The database focuses on those RNAs which have been experimentally characterised with a biological function. The database currently holds over 290 lncRNAs from around 60 species. Example lncRNAs in the database are HOTAIR and Xist.

This RNA modification databases are a compilation of databases and web portals and servers used for RNA modification. RNA modification occurs in all living organisms, and is one of the most evolutionarily conserved properties of RNAs. More than 100 different types of RNA modifications have been characterized across all living organisms. It can affect the activity, localization as well as stability of RNAs, and has been linked with human cancer and diseases.

The NONCODE database is a collection of expression and functional lncRNA data obtained from re-annotated microarray studies.

In molecular biology mir-425 microRNA is a short RNA molecule. MicroRNAs function to regulate the expression levels of other genes by several mechanisms.

In molecular biology mir-542 microRNA is a short RNA molecule. MicroRNAs function to regulate the expression levels of other genes by several mechanisms.

Competing endogenous RNAs hypothesis: ceRNAs regulate other RNA transcripts by competing for shared microRNAs. They are playing important roles in developmental, physiological and pathological processes, such as cancer. Multiple classes of ncRNAs and protein-coding mRNAs function as key ceRNAs (sponges) and to regulate the expression of mRNAs in plants and mammalian cells.

Single nucleotide polymorphism annotation is the process of predicting the effect or function of an individual SNP using SNP annotation tools. In SNP annotation the biological information is extracted, collected and displayed in a clear form amenable to query. SNP functional annotation is typically performed based on the available information on nucleic acid and protein sequences.

Model organism databases (MODs) are biological databases, or knowledgebases, dedicated to the provision of in-depth biological data for intensively studied model organisms. MODs allow researchers to easily find background information on large sets of genes, plan experiments efficiently, combine their data with existing knowledge, and construct novel hypotheses. They allow users to analyse results and interpret datasets, and the data they generate are increasingly used to describe less well studied species. Where possible, MODs share common approaches to collect and represent biological information. For example, all MODs use the Gene Ontology (GO) to describe functions, processes and cellular locations of specific gene products. Projects also exist to enable software sharing for curation, visualization and querying between different MODs. Organismal diversity and varying user requirements however mean that MODs are often required to customize capture, display, and provision of data.

Transcription factors are proteins that bind genomic regulatory sites. Identification of genomic regulatory elements is essential for understanding the dynamics of developmental, physiological and pathological processes. Recent advances in chromatin immunoprecipitation followed by sequencing (ChIP-seq) have provided powerful ways to identify genome-wide profiling of DNA-binding proteins and histone modifications. The application of ChIP-seq methods has reliably discovered transcription factor binding sites and histone modification sites.

<span class="mw-page-title-main">SHLD1</span> Protein-coding gene in the species Homo sapiens

SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.

References

  1. Rinn, J. L.; Chang, H. Y. (2012). "Genome Regulation by Long Noncoding RNAs". Annual Review of Biochemistry . 81: 145–166. doi:10.1146/annurev-biochem-051410-092902. PMC   3858397 . PMID   22663078.
  2. Martin, L.; Chang, H. Y. (2012). "Uncovering the role of genomic "dark matter" in human disease". Journal of Clinical Investigation . 122 (5): 1589–1595. doi:10.1172/JCI60020. PMC   3336981 . PMID   22546862.
  3. Zheng, LL; Li, JH; Wu, J; Sun, WJ; Liu, S; Wang, ZL; Zhou, H; Yang, JH; Qu, LH (4 January 2016). "deepBase v2.0: identification, expression, evolution and function of small RNAs, LncRNAs and circular RNAs from deep-sequencing data". Nucleic Acids Research. 44 (D1): D196–202. doi:10.1093/nar/gkv1273. PMC   4702900 . PMID   26590255.
  4. Volders, P. -J.; Helsens, K.; Wang, X.; Menten, B.; Martens, L.; Gevaert, K.; Vandesompele, J.; Mestdagh, P. (2012). "LNCipedia: A database for annotated human lncRNA transcript sequences and structures". Nucleic Acids Research . 41 (Database issue): D246–D251. doi:10.1093/nar/gks915. PMC   3531107 . PMID   23042674.
  5. Volders, P. J.; Verheggen, K; Menschaert, G; Vandepoele, K; Martens, L; Vandesompele, J; Mestdagh, P (2015). "An update on LNCipedia: A database for annotated human lncRNA sequences". Nucleic Acids Research. 43 (Database issue): D174–180. doi:10.1093/nar/gku1060. PMC   4383901 . PMID   25378313.
  6. Amaral, P. P.; Clark, M. B.; Gascoigne, D. K.; Dinger, M. E.; Mattick, J. S. (2010). "LncRNAdb: A reference database for long noncoding RNAs". Nucleic Acids Research. 39 (Database issue): D146–D151. doi:10.1093/nar/gkq1138. PMC   3013714 . PMID   21112873.
  7. Ma, Lina; Li, Ang; Zou, Dong; Xu, Xingjian; Xia, Lin; Yu, Jun; Bajic, Vladimir B.; Zhang, Zhang (2015). "LncRNAWiki: harnessing community knowledge in collaborative curation of human long non-coding RNAs". Nucleic Acids Research. 43 (Database issue): D187–D92. doi:10.1093/nar/gku1167. PMC   4383965 . PMID   25399417.
  8. Ma, Lina; Cao, Jiabao; Liu, Lin; Du, Qiang; Li, Zhao; Zou, Dong; Bajic, Vladimir B.; Zhang, Zhang (2019). "LncBook: a curated knowledgebase of human long non-coding RNAs". Nucleic Acids Research. 47 (Database issue): D128–D134. doi:10.1093/nar/gky960. PMC   6323930 . PMID   30329098.
  9. Josset L, Tchitchek N, Gralinski LE, Ferris MT, Eisfeld AJ, Green RR; et al. (2014). "Annotation of long non-coding RNAs expressed in collaborative cross founder mice in response to respiratory virus infection reveals a new class of interferon-stimulated transcripts". RNA Biol. 11 (7): 875–90. doi:10.4161/rna.29442. PMC   4179962 . PMID   24922324.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  10. Bu, D.; Yu, K.; Sun, S.; Xie, C.; Skogerbø, G.; Miao, R.; Xiao, H.; Liao, Q.; Luo, H.; Zhao, G.; Zhao, H.; Liu, Z.; Liu, C.; Chen, R.; Zhao, Y. (2011). "NONCODE v3.0: Integrative annotation of long noncoding RNAs". Nucleic Acids Research. 40 (Database issue): D210–D215. doi:10.1093/nar/gkr1175. PMC   3245065 . PMID   22135294.
  11. Dinger, M. E.; Pang, K. C.; Mercer, T. R.; Crowe, M. L.; Grimmond, S. M.; Mattick, J. S. (2009). "NRED: A database of long noncoding RNA expression". Nucleic Acids Research. 37 (Database issue): D122–D126. doi:10.1093/nar/gkn617. PMC   2686506 . PMID   18829717.
  12. Weirick, T.; David, J.; Stefanie, D.; Uchida, S. (2015). "C-It-Loci: a knowledge database for tissue-enriched loci". Bioinformatics . 31 (21): 3537–3543. doi: 10.1093/bioinformatics/btv410 . PMID   26163692.
  13. Iyer, Matthew K.; Niknafs, Yashar S.; Malik, Rohit; Singhal, Udit; Sahu, Anirban; Hosono, Yasuyuki; Barrette, Terrence R.; Prensner, John R.; Evans, Joseph R. (March 2015). "The landscape of long noncoding RNAs in the human transcriptome". Nature Genetics . 47 (3): 199–208. doi:10.1038/ng.3192. ISSN   1546-1718. PMC   4417758 . PMID   25599403.
  14. Chen, Jenny; Shishkin, Alexander A.; Zhu, Xiaopeng; Kadri, Sabah; Maza, Itay; Guttman, Mitchell; Hanna, Jacob H.; Regev, Aviv; Garber, Manuel (2016-02-02). "Evolutionary analysis across mammals reveals distinct classes of long non-coding RNAs". Genome Biology. 17: 19. doi:10.1186/s13059-016-0880-9. ISSN   1474-760X. PMC   4739325 . PMID   26838501.
  15. Carlevaro-Fita, Joana; Lanzós, Andrés; Feuerbach, Lars; Hong, Chen; Mas-Ponte, David; Skou Pedersen, Jakob; Johnson, Rory (2020). "Cancer LncRNA Census reveals evidence for deep functional conservation of long noncoding RNAs in tumorigenesis". Communications Biology. 3 (1): 56. doi: 10.1038/s42003-019-0741-7 . PMC   7002399 . PMID   32024996.
  16. You, Bo-Hyun; Yoon, Sang-Ho; Nam, Jin-Wu (2017-06-01). "High-confidence coding and noncoding transcriptome maps". Genome Research. 27 (6): 1050–1062. doi: 10.1101/gr.214288.116 . ISSN   1088-9051. PMC   5453319 . PMID   28396519.
  17. Seifuddin, Fayaz; Singh, Komudi; Suresh, Abhilash; Judy, Jennifer T.; Chen, Yun-Ching; Chaitankar, Vijender; Tunc, Ilker; Ruan, Xiangbo; Ping, Li; Chen, Yi; Cao, Haiming; Lee, Richard S.; Goes, Fernando S.; Zandi, Peter P.; Jafri, M. Saleet; Pirooznia, Mehdi (2020-10-05). "lncRNAKB, a knowledgebase of tissue-specific functional annotation and trait association of long noncoding RNA". Scientific Data . 7 (1) 326. doi: 10.1038/s41597-020-00659-z . PMC   7536183 . PMID   33020484.