Content | |
---|---|
Description | The Human Protein Atlas portal is a publicly available database with millions of high-resolution images showing the spatial distribution of proteins in normal human tissues and different cancer types, as well the sub cellular localisation in single cells. |
Organisms | Human |
Contact | |
Research center | KTH, UU, SciLifeLab, Sweden |
Primary citation | Uhlén M, et al. (January 2015). "Proteomics. Tissue-based map of the human proteome". Science. 347 (6220): 1260419. doi:10.1126/science.1260419. PMID 25613900. S2CID 802377. |
Access | |
Website | www |
Download URL | www |
Tools | |
Web | Advanced search, bulk retrieval/download |
Miscellaneous | |
Versioning | Yes |
Data release frequency | 12 months |
Version | 23 |
Curation policy | Yes – manual |
Bookmarkable entities | Yes – both individual protein entries and searches |
The Human Protein Atlas (HPA) is a Swedish-based program started in 2003 with the aim to map all the human proteins in cells, tissues and organs using integration of various omics technologies, including antibody-based imaging, mass spectrometry-based proteomics, transcriptomics and systems biology. All the data in the knowledge resource is open access to allow scientists both in academia and industry to freely access the data for exploration of the human proteome. In June 2023, version 23 was launched where a new Interaction section was introduced containing human protein-protein interaction networks for more than 11,000 genes that will add new aspects in terms of protein function.
The resource now includes twelve separate sections with complementary information about all human proteins. All data has been updated on the approximately 5 million individual web pages. The Human Protein Atlas program has already contributed to several thousands of publications in the field of human biology and disease and was selected by the organization ELIXIR as a European core resource due to its fundamental importance for a wider life science community. The HPA consortium is funded by the Knut and Alice Wallenberg Foundation.
The Human Protein Atlas consists of twelve sections:
In addition to the twelve sections of HPA, exploring gene and protein expression, there are various features available at the HPA website to assist the research community, including integrated external resources, such as Metabolic Atlas, educational material and free downloadable data.
The Human Protein Atlas program was started in 2003 and funded by the non-profit organization Knut and Alice Wallenberg Foundation (KAW). The main site of the project is the Royal Institute of Technology (KTH), School of Engineering Sciences in Chemistry, Biotechnology and Health (Stockholm, Sweden). Additionally, the project involves research groups at Uppsala University, Karolinska Institutet, Chalmers University of Technology and Lund University, as well as several present and past international collaborations initiated with research groups in Europe, the United States, South Korea, China, and India. Professor Mathias Uhlén is the director of the program.
The research underpinning the start of the exploration of the whole human proteome in the Human Protein Atlas program was carried out in the late 1990s and early 2000s. A pilot study employing an affinity proteomics strategy using affinity-purified antibodies raised against recombinant human protein fragments was carried out for a chromosome-wide protein profiling of chromosome 21. [10] Other projects were also carried out to establish processes for parallel and automated affinity purification of mono-specific antibodies and their validation. [11] [12]
Antibodies and antigens, produced in the Human Protein Atlas workflow, are used in research projects to study potential biomarkers in various diseases, such as breast cancer, prostate cancer, colon cancer, diabetes, autoimmune diseases, ovarian cancer and renal failure. [13] [14] [15] [16] [17] [18]
Researchers involved with Human Protein Atlas projects, are sharing protocols and method details in an open-access group on protocols.io. [19] A large effort is put into validating the antibody reagents used for profiling of tissues and cells, and the HPA has implemented stringent antibody validation criteria as suggested by the International Working Group for Antibody Validation (IWGAV). [20] [21] [22]
The Human Protein Atlas program has participated in 9 EU research projects ENGAGE, PROSPECTS, BIO_NMD, AFFINOMICS, CAGEKID, EURATRANS, ITFoM, DIRECT and PRIMES.
The duodenum is the first section of the small intestine in most higher vertebrates, including mammals, reptiles, and birds. In mammals, it may be the principal site for iron absorption. The duodenum precedes the jejunum and ileum and is the shortest part of the small intestine.
Immunohistochemistry is a form of immunostaining. It involves the process of selectively identifying antigens (proteins) in cells and tissue, by exploiting the principle of antibodies binding specifically to antigens in biological tissues. Albert Hewett Coons, Ernest Berliner, Norman Jones and Hugh J Creech was the first to develop immunofluorescence in 1941. This led to the later development of immunohistochemistry.
Caspase 14 is an enzyme that in humans is encoded by the CASP14 gene. Orthologs of this gene also exist in other mammals, such as sirenians and cetaceans, though they are inactivated in these two clades. Curiously, manatees, which are sirenians, retain some functional CASP14 genes.
Receptor expression-enhancing protein 2 is a protein that in humans is encoded by the REEP2 gene.
Special AT-rich sequence-binding protein 2 (SATB2) also known as DNA-binding protein SATB2 is a protein that in humans is encoded by the SATB2 gene. SATB2 is a DNA-binding protein that specifically binds nuclear matrix attachment regions and is involved in transcriptional regulation and chromatin remodeling. SATB2 shows a restricted mode of expression and is expressed in certain cell nuclei. The SATB2 protein is mainly expressed in the epithelial cells of the colon and rectum, followed by the nuclei of neurons in the brain.
Cubilin is a protein that in humans is encoded by the CUBN gene.
The Human Proteome Project (HPP) is a collaborative effort coordinated by the Human Proteome Organization. Its stated goal is to experimentally observe all of the proteins produced by the sequences translated from the human genome.
The secretome is the set of proteins expressed by an organism and secreted into the extracellular space. In humans, this subset of the proteome encompasses 13-20% of all proteins, including cytokines, growth factors, extracellular matrix proteins and regulators, and shed receptors. The secretome of a specific tissue can be measured by mass spectrometry and its analysis constitutes a type of proteomics known as secretomics.
KIAA0825 is a protein that in humans is encoded by the gene of the same name, located on chromosome 5, 5q15. It is a possible risk factor in Type II Diabetes, and associated with high levels of glucose in the blood. It is a relatively fast mutating gene, compared to other coding genes. There is however one region which is highly conserved across the species that have the gene, known as DUF4495. It is predicted to travel between the nucleus and the cytoplasm.
Mathias Uhlén is a Swedish scientist and Professor of Microbiology at Royal Institute of Technology (KTH), Stockholm. After a post-doc period at the EMBL in Heidelberg, Germany, he became professor in microbiology at KTH in 1988. His research is focused on protein science, antibody engineering and precision medicine and range from basic research in human and microbial biology to more applied research, including clinical applications. He is member of several academies and societies, including Royal Swedish Academy of Science (KVA), National Academy of Engineering (NAE) and the Swedish Academy of Engineering Science (IVA). Dr Uhlen was the Founding Director of the national infrastructure Science for Life Laboratory (SciLifeLab) from 2010 to 2015.
SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.
Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.
LOC101928193 is a protein which in humans is encoded by the LOC101928193 gene. There are no known aliases for this gene or protein. Similar copies of this gene, called orthologs, are known to exist in several different species across mammals, amphibians, fish, mollusks, cnidarians, fungi, and bacteria. The human LOC101928193 gene is located on the long (q) arm of chromosome 9 with a cytogenic location at 9q34.2. The molecular location of the gene is from base pair 133,189,767 to base pair 133,192,979 on chromosome 9 for an mRNA length of 3213 nucleotides. The gene and protein are not yet well understood by the scientific community, but there is data on its genetic makeup and expression. The LOC101928193 protein is targeted for the cytoplasm and has the highest level of expression in the thyroid, ovary, skin, and testes in humans.
Glutamate-rich protein 4 is encoded by the gene ERICH4 and can be otherwise known as chromosome 19 open reading frame 69 (C19orf69). ERICH4 is highly conserved in mammals and exhibits overexpression in tissues of the kidneys, terminal ileum, and duodenum. The function of ERICH4 has yet to be well understood by the scientific community but is suggested to contribute to immune inflammatory responses.
RTP3 is a gene located on chromosome 3 in humans that encodes the RTP3 protein. Its expression is liver-restricted.
Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.
CCDC188 or coiled-coil domain containing protein is a protein that in humans is encoded by the CCDC188 gene.
Transmembrane Protein 144 (TMEM144) is a protein in humans encoded by the TMEM144 gene.
FAM131A is a protein that is encoded by the FAM131A gene in humans. Aliases for FAM131A include C3orf40, FLAT715, and PRO1378.
C19Orf81 is a protein which in humans is encoded by the gene C19Orf81. It is a rarely expressed protein found mainly in the testes, cerebellum and cerebral cortex.