Putative gene

Last updated October 12, 2025

A putative gene is an alignment segment of DNA that is believed to be a gene. Putative genes can share sequence similarities to already characterized genes and thus can be inferred to share a similar function, yet the exact function of putative genes remains unknown.^[1] Newly identified sequences are considered putative gene candidates when homologs of those sequences are found to be associated with the phenotype of interest.^[2]

Examples

Examples of studies involving putative genes include the discovery of 30 putative receptor genes found in rat vomeronasal organ (VNO)^[3] and the identification of 79 putative TATA boxes found in many plant genomes.^[4]

Practical importance

In order to define and characterize a biosynthetic gene cluster, all the putative genes within said cluster must first be identified and their functions must be characterized. This can be performed by complementation and knock out experiments. In the process of characterizing putative genes, the genome under study becomes increasingly well understood as more interactions can be identified.^[5] Identification of putative genes is necessary to study genomic evolution, as significant proportion of genomes make up larger families of related genes. Genomic evolution occurs by processes such as duplication of individual genes, genome segments, or entire genomes. These processes can result in loss of function, altered function, or gain of function, and have drastic affects on the phenotype.^[6]^[7]

DNA mutations outside of a putative gene can act by positional effect, in which they alter the gene expression. These alterations leave the transcription unit and promoter of the gene intact, but may involve distal promoters, enhancer/silencer elements, or the local chromatin environment. These mutations can be associated with diseases or disorders associated with the gene.

Identification

Putative genes can be identified by clustering large groups of sequences by patterns and arranging by mutual similarity^[8] or can be inferred by potential TATA boxes.^[9]

Putative genes can also be identified by recognizing differences between well-known gene clusters and gene clusters with a unique profiling.^[10]

Software tools have been developed in order to automatically identify putative genes. This is done by searching for gene families and testing the validity of uncharacterized genes by comparison to already identified genes.^[11]

Protein products can be identified and used to characterize the putative gene that codes for it.^[12]

References

↑ Alexandre S, Guyaux M, Murphy NB, Coquelet H, Pays A, Steinert M, Pays E (June 1988). "Putative genes of a variant-specific antigen gene transcription unit in Trypanosoma brucei". Molecular and Cellular Biology. 8 (6): 2367–78. doi:10.1128/mcb.8.6.2367. PMC 363435 . PMID 3405209.
↑ Mishima K, Hirao T, Tsubomura M, Tamura M, Kurita M, Nose M, et al. (April 2018). "Identification of novel putative causative genes and genetic marker for male sterility in Japanese cedar (Cryptomeria japonica D.Don)". BMC Genomics. 19 (1) 277. doi: 10.1186/s12864-018-4581-5 . PMC 5914023 . PMID 29685102.
↑ Dulac C, Axel R (October 1995). "A novel family of genes encoding putative pheromone receptors in mammals". Cell. 83 (2): 195–206. doi: 10.1016/0092-8674(95)90161-2 . PMID 7585937. S2CID 18784638.
↑ Joshi CP (August 1987). "An inspection of the domain between putative TATA box and translation start site in 79 plant genes". Nucleic Acids Research. 15 (16): 6643–53. doi:10.1093/nar/15.16.6643. PMC 306128 . PMID 3628002.
↑ Wawrzyn GT, Bloch SE, Schmidt-Dannert C (2012-01-01). "Discovery and characterization of terpenoid biosynthetic pathways of fungi". In Hopwood (ed.). Natural Product Biosynthesis by Microorganisms and Plants, Part A. Methods in Enzymology. Vol. 515. pp. 83–105. doi:10.1016/b978-0-12-394290-6.00005-7. ISBN 978-0-12-394290-6. PMID 22999171.
↑ Frank RL, Mane A, Ercal F (September 2006). "An automated method for rapid identification of putative gene family members in plants". BMC Bioinformatics. 7 (2) S19. doi: 10.1186/1471-2105-7-S2-S19 . PMC 1683565 . PMID 17118140.
↑ Emery AE (2013). "Personal Memories of David Rimoin". Emery and Rimoin's Principles and Practice of Medical Genetics. Elsevier. p. i. doi:10.1016/b978-0-12-383834-6.11001-8. ISBN 978-0-12-383834-6.
↑ Aouf M, Liyanage L (2012-09-26). "Analysis of High Dimensionality Yeast Gene Expression Data Using Data Mining". Applied Mechanics and Materials. 197: 515–522. Bibcode:2012AMM...197..515A. doi:10.4028/www.scientific.net/amm.197.515. S2CID 109965976.
↑ Joshi CP (August 1987). "An inspection of the domain between putative TATA box and translation start site in 79 plant genes". Nucleic Acids Research. 15 (16): 6643–53. doi:10.1093/nar/15.16.6643. PMC 306128 . PMID 3628002.
↑ Mihali TK, Carmichael WW, Neilan BA (February 2011). "A putative gene cluster from a Lyngbya wollei bloom that encodes paralytic shellfish toxin biosynthesis". PLOS ONE. 6 (2) e14657. Bibcode:2011PLoSO...614657M. doi: 10.1371/journal.pone.0014657 . PMC 3037375 . PMID 21347365.
↑ Frank RL, Mane A, Ercal F (September 2006). "An automated method for rapid identification of putative gene family members in plants". BMC Bioinformatics. 7 Suppl 2 (2) S19. doi: 10.1186/1471-2105-7-S2-S19 . PMC 1683565 . PMID 17118140.
↑ Denison M, Perlman S (April 1987). "Identification of putative polymerase gene product in cells infected with murine coronavirus A59". Virology. 157 (2): 565–8. doi:10.1016/0042-6822(87)90303-5. PMC 7131660 . PMID 3029990.

External links

NCBI ORF finder for analyzing ORFs from nucleotide sequences

This genetics article is a stub. You can help Wikipedia by expanding it.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Alexandre S, Guyaux M, Murphy NB, Coquelet H, Pays A, Steinert M, Pays E (June 1988). "Putative genes of a variant-specific antigen gene transcription unit in Trypanosoma brucei". Molecular and Cellular Biology. 8 (6): 2367–78. doi:10.1128/mcb.8.6.2367. PMC 363435 . PMID 3405209.

[2] Mishima K, Hirao T, Tsubomura M, Tamura M, Kurita M, Nose M, et al. (April 2018). "Identification of novel putative causative genes and genetic marker for male sterility in Japanese cedar (Cryptomeria japonica D.Don)". BMC Genomics. 19 (1) 277. doi: 10.1186/s12864-018-4581-5 . PMC 5914023 . PMID 29685102.

[3] Dulac C, Axel R (October 1995). "A novel family of genes encoding putative pheromone receptors in mammals". Cell. 83 (2): 195–206. doi: 10.1016/0092-8674(95)90161-2 . PMID 7585937. S2CID 18784638.

[:0-4] Joshi CP (August 1987). "An inspection of the domain between putative TATA box and translation start site in 79 plant genes". Nucleic Acids Research. 15 (16): 6643–53. doi:10.1093/nar/15.16.6643. PMC 306128 . PMID 3628002.

[5] Wawrzyn GT, Bloch SE, Schmidt-Dannert C (2012-01-01). "Discovery and characterization of terpenoid biosynthetic pathways of fungi". In Hopwood (ed.). Natural Product Biosynthesis by Microorganisms and Plants, Part A. Methods in Enzymology. Vol. 515. pp. 83–105. doi:10.1016/b978-0-12-394290-6.00005-7. ISBN 978-0-12-394290-6. PMID 22999171.

[:1-6] Frank RL, Mane A, Ercal F (September 2006). "An automated method for rapid identification of putative gene family members in plants". BMC Bioinformatics. 7 (2) S19. doi: 10.1186/1471-2105-7-S2-S19 . PMC 1683565 . PMID 17118140.

[7] Emery AE (2013). "Personal Memories of David Rimoin". Emery and Rimoin's Principles and Practice of Medical Genetics. Elsevier. p. i. doi:10.1016/b978-0-12-383834-6.11001-8. ISBN 978-0-12-383834-6.

[8] Aouf M, Liyanage L (2012-09-26). "Analysis of High Dimensionality Yeast Gene Expression Data Using Data Mining". Applied Mechanics and Materials. 197: 515–522. Bibcode:2012AMM...197..515A. doi:10.4028/www.scientific.net/amm.197.515. S2CID 109965976.

[:02-9] Joshi CP (August 1987). "An inspection of the domain between putative TATA box and translation start site in 79 plant genes". Nucleic Acids Research. 15 (16): 6643–53. doi:10.1093/nar/15.16.6643. PMC 306128 . PMID 3628002.

[10] Mihali TK, Carmichael WW, Neilan BA (February 2011). "A putative gene cluster from a Lyngbya wollei bloom that encodes paralytic shellfish toxin biosynthesis". PLOS ONE. 6 (2) e14657. Bibcode:2011PLoSO...614657M. doi: 10.1371/journal.pone.0014657 . PMC 3037375 . PMID 21347365.

[:12-11] Frank RL, Mane A, Ercal F (September 2006). "An automated method for rapid identification of putative gene family members in plants". BMC Bioinformatics. 7 Suppl 2 (2) S19. doi: 10.1186/1471-2105-7-S2-S19 . PMC 1683565 . PMID 17118140.

[12] Denison M, Perlman S (April 1987). "Identification of putative polymerase gene product in cells infected with murine coronavirus A59". Virology. 157 (2): 565–8. doi:10.1016/0042-6822(87)90303-5. PMC 7131660 . PMID 3029990.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

v t e Genetics
Introduction Outline History Timeline Index Glossary
Key components	Chromosome DNA RNA Genome Heredity Nucleotide Mutation Genetic variation Allele Amino acid
Fields	Classical Conservation Cytogenetics Ecological Immunogenetics Microbial Molecular Population Quantitative
Archaeogenetics of	Africa the Americas the British Isles Europe Italy the Middle East South Asia
Related topics	Behavioural genetics Epigenetics Geneticist Genome editing Genomics Genetic code Genetic engineering Genetic diversity Genetic monitoring Genetic genealogy Heredity He Jiankui genome editing incident Medical genetics Missing heritability problem Molecular evolution Plant genetics Population genomics Reverse genetics
Lists	List of genetic codes List of genetics research organizations
Category