Gene Wiki

Last updated

The Gene Wiki is a project within Wikipedia that aims to describe the relationships and functions of all human genes. It was established to transfer information from scientific resources to Wikipedia stub articles. [1] [2] [3]

Contents

The Gene Wiki project also initiated publication of gene-specific review articles in the journal Gene , together with the editing of the gene-specific pages in Wikipedia. [4]

The Gene Wiki project in collaboration with the journal Gene was terminated in May 2022, ten years after the project's initiation. A report by the project's leaders summarizes the project's achievements. [5]

Project goals and scope

Number of gene articles

The human genome contains an estimated 20,000–25,000 protein-coding genes. [6] The goal of the Gene Wiki project is to create seed articles for every notable human gene, that is, every gene whose function has been assigned in the peer-reviewed scientific literature. Approximately half of human genes have assigned function, therefore the total number of articles seeded by the Gene Wiki project would be expected to be in the range of 10,000–15,000. To date,[ as of? ] approximately 11,000 articles have been created or augmented to include Gene Wiki project content.[ citation needed ]

Expansion

Once seed articles have been established, the hope and expectation is that these will be annotated and expanded by editors ranging in experience from the lay audience to students to professionals and academics. [1]

Proteins encoded by genes

Only a small portion of the genome actually encodes protein in the human genome. Understanding the function of a gene that codes for a protein generally requires understanding of the function of the corresponding protein. In addition to including basic information about the gene, the project therefore also includes information about the protein encoded by the gene. The function of other portions of the genome, non-coding DNA, also called "junk" DNA in the past because they had no apparent function, actually are thought to have regulatory functions.

Gene Wiki generated content

Stubs for the Gene Wiki project are created by a bot and contain links to the following primary gene/protein databases:

Response

A report found that between 2013 and 2017, the content which Gene Wiki contributed to Wikipedia got crowdsourced development over time. [8]

Related Research Articles

Israel Hanukoglu is a Turkish-born Israeli scientist. He is a full professor of biochemistry and molecular biology at Ariel University and former science and technology adviser to the prime minister of Israel (1996–1999). He is founder of Israel Science and Technology Directory.

<span class="mw-page-title-main">ENCODE</span> Research consortium investigating functional elements in human and model organism DNA

The Encyclopedia of DNA Elements (ENCODE) is a public research project which aims "to build a comprehensive parts list of functional elements in the human genome."

Rfam is a database containing information about non-coding RNA (ncRNA) families and other structured RNA elements. It is an annotated, open access database originally developed at the Wellcome Trust Sanger Institute in collaboration with Janelia Farm, and currently hosted at the European Bioinformatics Institute. Rfam is designed to be similar to the Pfam database for annotating protein families.

<span class="mw-page-title-main">GPR1</span> Protein-coding gene in the species Homo sapiens

G protein-coupled receptor 1, also known as GPR1, is a protein that in humans is encoded by the GPR1 gene.

<span class="mw-page-title-main">Neuromedin U receptor 1</span> Protein-coding gene in the species Homo sapiens

Neuromedin-U receptor 1 is a protein that in humans is encoded by the NMUR1 gene.

<span class="mw-page-title-main">CCRL2</span> Protein-coding gene in the species Homo sapiens

C-C chemokine receptor-like 2 is a protein that in humans is encoded by the CCRL2 gene. Recently it was found that CCRL2 also acts as a receptor for the chemokine chemerin.

<span class="mw-page-title-main">RFXANK</span> Protein-coding gene in the species Homo sapiens

DNA-binding protein RFXANK is a protein that in humans is encoded by the RFXANK gene.

<span class="mw-page-title-main">ASIC3</span> Protein-coding gene in the species Homo sapiens

Acid-sensing ion channel 3 (ASIC3) also known as amiloride-sensitive cation channel 3 (ACCN3) or testis sodium channel 1 (TNaC1) is a protein that in humans is encoded by the ASIC3 gene. The ASIC3 gene is one of the five paralogous genes that encode proteins that form trimeric acid-sensing ion channels (ASICs) in mammals. The cDNA of this gene was first cloned in 1998. The ASIC genes have splicing variants that encode different proteins that are called isoforms.

<span class="mw-page-title-main">SCNN1D</span> Protein-coding gene in the species Homo sapiens

The SCNN1D gene encodes for the δ (delta) subunit of the epithelial sodium channel ENaC in vertebrates. ENaC is assembled as a heterotrimer composed of three homologous subunits α, β, and γ or δ, β, and γ. The other ENAC subunits are encoded by SCNN1A, SCNN1B, and SCNN1G.

<span class="mw-page-title-main">EXTL1</span> Protein-coding gene in the species Homo sapiens

Exostosin-like 1 is a protein that in humans is encoded by the EXTL1 gene.

<span class="mw-page-title-main">CHRNA9</span> Protein-coding gene in the species Homo sapiens

Neuronal acetylcholine receptor subunit alpha-9, also known as nAChRα9, is a protein that in humans is encoded by the CHRNA9 gene. The protein encoded by this gene is a subunit of certain nicotinic acetylcholine receptors (nAchR).

<span class="mw-page-title-main">CCDC113</span> Protein-coding gene in humans

Coiled-coil domain-containing protein 113 also known as HSPC065, GC16Pof6842 and GC16P044152, is a protein that in humans is encoded by the CCDC113 gene. The human CCDC113 gene is located on chromosome 16q21 and encodes 5,304 base pairs of mRNA and 377 amino acids.

GENCODE is a scientific project in genome research and part of the ENCODE scale-up project.

<span class="mw-page-title-main">ARMCX6</span> Protein-coding gene in humans

Armadillo repeat containing X-linked 6 is a protein that in humans is encoded by the ARMCX6 gene located on the X-chromosome.

<span class="mw-page-title-main">DNA annotation</span> The process of describing the structure and function of a genome

In molecular biology and genetics, DNA annotation or genome annotation is the process of describing the structure and function of the components of a genome, by analyzing and interpreting them in order to extract their biological significance and understand the biological processes in which they participate. Among other things, it identifies the locations of genes and all the coding regions in a genome and determines what those genes do.

<span class="mw-page-title-main">TMEM106A</span> Protein-coding gene in the species Homo sapiens

TMEM106A is a gene that encodes the transmembrane protein 106A (TMEM106A) in Homo sapiens. It is located at 17q21.31 on the plus strand next to cancer-related genes NBR1 and BRCA1. The TMEM106A gene contains a domain of unknown function, DUF1356.

Single nucleotide polymorphism annotation is the process of predicting the effect or function of an individual SNP using SNP annotation tools. In SNP annotation the biological information is extracted, collected and displayed in a clear form amenable to query. SNP functional annotation is typically performed based on the available information on nucleic acid and protein sequences.

<span class="mw-page-title-main">Alex Bateman</span> British bioinformatician

Alexander George Bateman is a computational biologist and Head of Protein Sequence Resources at the European Bioinformatics Institute (EBI), part of the European Molecular Biology Laboratory (EMBL) in Cambridge, UK. He has led the development of the Pfam biological database and introduced the Rfam database of RNA families. He has also been involved in the use of Wikipedia for community-based annotation of biological databases.

Biocuration is the field of life sciences dedicated to organizing biomedical data, information and knowledge into structured formats, such as spreadsheets, tables and knowledge graphs. The biocuration of biomedical knowledge is made possible by the cooperative work of biocurators, software developers and bioinformaticians and is at the base of the work of biological databases.

References

  1. 1 2 Huss JW, Orozco C, Goodale J, Wu C, Batalov S, Vickers TJ, Valafar F, Su AI (July 2008). "A Gene Wiki for Community Annotation of Gene Function". PLOS Biology. 6 (7): e175. doi: 10.1371/journal.pbio.0060175 . PMC   2443188 . PMID   18613750.
  2. Huss JW, Orozco C, Goodale J, Wu C, Batalov S, Vickers TJ, Valafar F, Su AI. "SciVee Pubcast: A Gene Wiki for Community Annotation of Gene Function". Archived from the original on 2010-07-24. Retrieved 2008-11-19.
  3. Huss JW; Lindenbaum P; Martone M; et al. (January 2010). "The Gene Wiki: community intelligence applied to human gene annotation". Nucleic Acids Research. 38 (Database issue): D633–9. doi:10.1093/nar/gkp760. PMC   2808918 . PMID   19755503.
  4. Tsueng G, Good BM, Ping P, Golemis E, Hanukoglu I, van Wijnen AJ, Su AI (2 May 2016). "Gene Wiki Reviews-Raising the quality and accessibility of information about the human genome". Gene. 592 (2): 235–8. doi:10.1016/j.gene.2016.04.053. PMC   5944608 . PMID   27150585.
  5. van Wijnen AJ, Golemis E, Hanukoglu I, Tsui SK, Hu E, Ul-Hasan S, Joy J, Su AI, Tsueng G (July 2022). "A retrospective evaluation of a decade of Gene Wiki Reviews and their impact" (PDF). Gene. 830: 146534. doi:10.1016/j.gene.2022.146534. PMID   35525475.
  6. Clamp M; Fry B; Kamal M; Xie X; Cuff J; Lin MF; Kellis M; Lindblad-Toh K; Lander ES (December 2007). "Distinguishing protein-coding and noncoding genes in the human genome". Proceedings of the National Academy of Sciences USA. 104 (49): 19428–33. doi: 10.1073/pnas.0709013104 . PMC   2148306 . PMID   18040051.
  7. Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, Cooke MP, Walker JR, Hogenesch JB (April 2004). "A gene atlas of the mouse and human protein-encoding transcriptomes". Proceedings of the National Academy of Sciences USA. 101 (16): 6062–7. Bibcode:2004PNAS..101.6062S. doi: 10.1073/pnas.0400782101 . PMC   395923 . PMID   15075390.
  8. Zinovyev, Andrei; Czerwinska, Urszula; Cantini, Laura; Barillot, Emmanuel; Frahm, Klaus M.; Shepelyansky, Dima L.; Rzhetsky, Andrey (18 February 2020). "Collective intelligence defines biological functions in Wikipedia as communities in the hidden protein connection network". PLOS Computational Biology. 16 (2): e1007652. Bibcode:2020PLSCB..16E7652Z. doi: 10.1371/journal.pcbi.1007652 . PMC   7048313 . PMID   32069277.

Further reading