Family with sequence similarity 104, member A is a protein that in humans is encoded by the FAM104A gene. [1] The orthologous gene in mice is also known as D11Wsu99e. [1]
The X chromosome is one of the two sex chromosomes in many organisms, including mammals, and is found in both males and females. It is a part of the XY sex-determination system and XO sex-determination system. The X chromosome was named for its unique properties by early researchers, which resulted in the naming of its counterpart Y chromosome, for the next letter in the alphabet, following its subsequent discovery.
A protein family is a group of evolutionarily related proteins. In many cases, a protein family has a corresponding gene family, in which each gene encodes a corresponding protein with a 1:1 relationship. The term "protein family" should not be confused with family as it is used in taxonomy.
A gene family is a set of several similar genes, formed by duplication of a single original gene, and generally with similar biochemical functions. One such family are the genes for human hemoglobin subunits; the ten genes are in two clusters on different chromosomes, called the α-globin and β-globin loci. These two gene clusters are thought to have arisen as a result of a precursor gene being duplicated approximately 500 million years ago.
In computational biology, gene prediction or gene finding refers to the process of identifying the regions of genomic DNA that encode genes. This includes protein-coding genes as well as RNA genes, but may also include prediction of other functional elements such as regulatory regions. Gene finding is one of the first and most important steps in understanding the genome of a species once it has been sequenced.
Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a speciation event (orthologs), or a duplication event (paralogs), or else a horizontal gene transfer event (xenologs).
An evolutionary lineage is a temporal series of populations, organisms, cells, or genes connected by a continuous line of descent from ancestor to descendant. Lineages are subsets of the evolutionary tree of life. Lineages are often determined by the techniques of molecular systematics.
Chromosome 1 is the designation for the largest human chromosome. Humans have two copies of chromosome 1, as they do with all of the autosomes, which are the non-sex chromosomes. Chromosome 1 spans about 249 million nucleotide base pairs, which are the basic units of information for DNA. It represents about 8% of the total DNA in human cells.
Chromosome 2 is one of the twenty-three pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 2 is the second-largest human chromosome, spanning more than 242 million base pairs and representing almost eight percent of the total DNA in human cells.
Chromosome 3 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 3 spans more than 198 million base pairs and represents about 6.5 percent of the total DNA in cells.
Chromosome 4 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 4 spans more than 190 million base pairs and represents between 6 and 6.5 percent of the total DNA in cells.
Chromosome 8 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 8 spans about 146 million base pairs and represents between 4.5 and 5.0% of the total DNA in cells.
Chromosome 10 is one of the 23 pairs of chromosomes in humans. People normally have two copies of this chromosome. Chromosome 10 spans about 134 million base pairs and represents between 4 and 4.5 percent of the total DNA in cells.
SOX genes encode a family of transcription factors that bind to the minor groove in DNA, and belong to a super-family of genes characterized by a homologous sequence called the HMG-box. This HMG box is a DNA binding domain that is highly conserved throughout eukaryotic species. Homologues have been identified in insects, nematodes, amphibians, reptiles, birds and a range of mammals. However, HMG boxes can be very diverse in nature, with only a few amino acids being conserved between species.
HomoloGene, a tool of the United States National Center for Biotechnology Information (NCBI), is a system for automated detection of homologs among the annotated genes of several completely sequenced eukaryotic genomes.
INT-2 proto-oncogene protein also known as FGF-3 is a protein that in humans is encoded by the FGF3 gene.
Relaxin/insulin-like family peptide receptor 3, also known as RXFP3, is a human G-protein coupled receptor.
Platelet-derived growth factor D is a protein that in humans is encoded by the PDGFD gene.
Homeobox protein DLX-2 is a protein that in humans is encoded by the DLX2 gene.
A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred. Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if no sequence similarity is evident. Sequence homology can then be deduced even if not apparent. Superfamilies typically contain several protein families which show sequence similarity within each family. The term protein clan is commonly used for protease and glycosyl hydrolases superfamilies based on the MEROPS and CAZy classification systems.
Family with sequence similarity 104 member B is a protein that in humans is encoded by the FAM104B gene.