International Nucleotide Sequence Database Collaboration

Last updated July 20, 2025

The International Nucleotide Sequence Database Collaboration (INSDC) consists of a joint effort to collect and disseminate databases containing DNA and RNA sequences.^[1] It involves the following computerized databases: NIG's DNA Data Bank of Japan (Japan), NCBI's GenBank (USA) and the EMBL-EBI's European Nucleotide Archive (EMBL). New and updated data on nucleotide sequences contributed by research teams to each of the three databases are synchronized on a daily basis through continuous interaction between the staff at each the collaborating organizations.

All of the data in INSDC is available for free and unrestricted access, for any purpose, with no restrictions on analysis, redistribution, or re-publication of the data. This policy has been a foundational principle of the INSDC since its inception.^[2] Since the 1990s, most of the world's major scientific journals have required that sequence data be deposited in an INSDC database as a pre-condition for publication.

The DDBJ/EMBL-EBI/GenBank synchronization is maintained according to a number of guidelines which are produced and published by an International Advisory Board.^[3] The guidelines consist of a common definition of the feature tables ^[4] for the databases, which regulate the content and syntax of the database entries,^[5] in the form of a common DTD (Document Type Definition).

The syntax is called INSDSeq and its core consists of the letter sequence of the gene expression (amino acid sequence) and the letter sequence for nucleotide bases in the gene or decoded segment. In a DBFetch operation shows a typical INSD entry at the EMBL-EBI database;^[6] the same entry at NCBI.^[7]

References

↑ Karsch-Mizrachi, I.; Nakamura, Y.; Cochrane, G.; International Nucleotide Sequence Database Collaboration (2011). "The International Nucleotide Sequence Database Collaboration". Nucleic Acids Research. 40 (Database issue): D33 –D37. doi:10.1093/nar/gkr1006. PMC 3244996 . PMID 22080546.
↑ Brunak, Soren; Danchin, Antoine; Hattori, Masahira; Nakamura, Haruki; Shinozaki, Kazuo; Matise, Tara; Preuss, Daphne (15 November 2002). "Nucleotide sequence database policies". Science. 298 (5597): 1333. doi:10.1126/science.298.5597.1333b. ISSN 1095-9203. PMID 12436968. S2CID 42740562.
↑ "INSDC :: Advisors". Archived from the original on 2007-12-09. Retrieved 2019-06-29.
↑ "The DDBJ/ENA/GenBank Feature Table Definition". Ebi.ac.uk. Archived from the original on 2005-03-24. Retrieved 2019-06-29.
↑ "European Nucleotide Archive < EMBL-EBI". www.ebi.ac.uk.
↑ "Database Browsing". Archived from the original on 2005-02-12. Retrieved 2005-03-02.
↑ USA (2019-05-06). "Trifolium repens mRNA for non-cyanogenic beta-glucosidase - Nucleotide - NCBI". Ncbi.nlm.nih.gov. Retrieved 2019-06-29.

External links

Official site

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Karsch-Mizrachi, I.; Nakamura, Y.; Cochrane, G.; International Nucleotide Sequence Database Collaboration (2011). "The International Nucleotide Sequence Database Collaboration". Nucleic Acids Research. 40 (Database issue): D33 –D37. doi:10.1093/nar/gkr1006. PMC 3244996 . PMID 22080546.

[2] Brunak, Soren; Danchin, Antoine; Hattori, Masahira; Nakamura, Haruki; Shinozaki, Kazuo; Matise, Tara; Preuss, Daphne (15 November 2002). "Nucleotide sequence database policies". Science. 298 (5597): 1333. doi:10.1126/science.298.5597.1333b. ISSN 1095-9203. PMID 12436968. S2CID 42740562.

[3] "INSDC :: Advisors". Archived from the original on 2007-12-09. Retrieved 2019-06-29.

[4] "The DDBJ/ENA/GenBank Feature Table Definition". Ebi.ac.uk. Archived from the original on 2005-03-24. Retrieved 2019-06-29.

[5] "European Nucleotide Archive < EMBL-EBI". www.ebi.ac.uk.

[6] "Database Browsing". Archived from the original on 2005-02-12. Retrieved 2005-03-02.

[7] USA (2019-05-06). "Trifolium repens mRNA for non-cyanogenic beta-glucosidase - Nucleotide - NCBI". Ncbi.nlm.nih.gov. Retrieved 2019-06-29.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

v t e Bioinformatics
Databases	Sequence databases: GenBank, European Nucleotide Archive, DNA Data Bank of Japan and China National GeneBank Secondary databases: UniProt, database of protein sequences grouping together Swiss-Prot, TrEMBL and Protein Information Resource Other databases: BioNumbers, Protein Data Bank, Ensembl, InterPro, KEGG, and Gene Ontology Specialised genomic databases: BOLD, Saccharomyces Genome Database, FlyBase, VectorBase, WormBase, Rat Genome Database, PHI-base, Arabidopsis Information Resource, GISAID and Zebrafish Information Network
Software	BLAST Bowtie Clustal EMBOSS HMMER MUSCLE PANGOLIN SAMtools SOAP suite TopHat
Other	Server: ExPASy Rosalind (education platform)
Institutions	Broad Institute Computational Biology Department (CBD) Microsoft Research - University of Trento Centre for Computational and Systems Biology (COSBI) Database Center for Life Science (DBCLS) DNA Data Bank of Japan (DDBJ) European Bioinformatics Institute (EMBL-EBI) European Molecular Biology Laboratory (EMBL) Flatiron Institute J. Craig Venter Institute (JCVI) Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG) US National Center for Biotechnology Information (NCBI) Japanese Institute of Genetics Netherlands Bioinformatics Centre (NBIC) Philippine Genome Center (PGC) Scripps Research Swiss Institute of Bioinformatics (SIB) Wellcome Sanger Institute Whitehead Institute
Organizations	African Society for Bioinformatics and Computational Biology (ASBCB) Australia Bioinformatics Resource (EMBL-AR) European Molecular Biology network (EMBnet) International Nucleotide Sequence Database Collaboration (INSDC) International Society for Biocuration (ISB) International Society for Computational Biology (ISCB) Student Council (ISCB-SC) Institute of Genomics and Integrative Biology (CSIR-IGIB) Japanese Society for Bioinformatics (JSBi)
Meetings	Basel Computational Biology Conference‎ ([BC²]) European Conference on Computational Biology (ECCB) Intelligent Systems for Molecular Biology (ISMB) International Conference on Bioinformatics (InCoB) International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB) ISCB Africa ASBCB Conference on Bioinformatics Pacific Symposium on Biocomputing (PSB) Research in Computational Molecular Biology (RECOMB)
File formats	CRAM format FASTA format FASTQ format NeXML format Nexus format Pileup format SAM format Stockholm format VCF format GFF format GTF format
Related topics	Computational biology List of biobanks List of biological databases Molecular phylogenetics Sequencing Sequence database Sequence alignment
Category Commons

International Nucleotide Sequence Database Collaboration

Contents

See also

References

External links

External links