AlphaGenome is a large-scale deep-learning system developed by DeepMind to predict how segments of DNA regulate gene expression and how single-nucleotide variants may disrupt that regulation. [1] [2] [3] It was launched on 25 June 2025. [4] Subject to following the terms of use, the API can be used free of charge, for non-commercial use. [5]
Ever since the first draft of the human genome appeared in 2001, most of its over 3 billion letters have remained functionally opaque. [6] [2] AlphaGenome extends DeepMind's "Alpha" line of models beyond protein folding and chip design into the longstanding puzzle of the genome's non-coding dark matter. [2]
AlphaGenome accepts stretches of up to one million base pairs, orders of magnitude longer than typical sequence-to-function models, and produces thousands of quantitative predictions, including gene-expression levels, chromatin accessibility, three-dimensional genome contacts and RNA-splicing junctions. [2] The model was trained using genomic data from humans and mice, and its effectiveness on other organisms has not been tested. [2] The model is able to predict what impact changing a single letter/base in the million base sequence has on the outcome. [7]
The model can predict across 11 modalities concurrently: RNA-seq, CAGE and PRO-cap (gene expression), splice sites, splice site usage and splice junctions (splicing patterns), DNase, ATAC-seq, histone modifications and transcription factor binding (chromatin state) and chromatin contact maps. An evaluation of 24 genome tracks, using all 11 modalities found that AlphaGenome outperformed the best alternative models in 22 of these evaluations. [8] Current limitations include difficulty in predicting the effects of mutations on genes located more than 100,000 base pairs away. [2]