Julio Collado-Vides

Last updated
Julio Collado-Vides
Born
Pedro Julio Collado-Vides

(1957-02-14) February 14, 1957 (age 64)
Alma mater National Autonomous University of Mexico
Awards Guggenheim Fellowship (1992)
Scientific career
Fields
Thesis  (1989)

Julio Collado-Vides is a Guatemalan scientist and Professor of Computational Genomics at the National Autonomous University of Mexico. His research focuses on genomics and bioinformatics. [1] [2]

Contents

Education

Collado-Vides studied at the National Autonomous University of Mexico, gaining a BSc in biomedical science (1983), an MSc in physical chemistry (1985) and a PhD in biomathematics (1989). Following his PhD, he carried out postdoctoral studies at the Massachusetts Institute of Technology (MIT). [1]

Work and research

Collado-Vides' research focuses on the regulation of gene expression in bacteria, [1] in particular, creating a model of the regulatory network of E. coli K-12. The RegulonDB database (released in 2010) makes this data available. [3] He has also been involved in the construction of the EcoCyc E. coli database [4] [5] and the completion of the sequenced E. coli genome. [6]

He was the founding President of the Mexican Society of Genomics. [7]

Awards and honours

Collado-Vides is a member of the Mexican Academy of Sciences [8] and was elected a Fellow of the International Society for Computational Biology in 2015. [9]

Books

Related Research Articles

Biological database

Biological databases are libraries of life sciences information, collected from scientific experiments, published literature, high-throughput experiment technology, and computational analysis. They contain information from research areas including genomics, proteomics, metabolomics, microarray gene expression, and phylogenetics. Information contained in biological databases includes gene function, structure, localization, clinical effects of mutations as well as similarities of biological sequences and structures.

Metabolome

The metabolome refers to the complete set of small-molecule chemicals found within a biological sample. The biological sample can be a cell, a cellular organelle, an organ, a tissue, a tissue extract, a biofluid or an entire organism. The small molecule chemicals found in a given metabolome may include both endogenous metabolites that are naturally produced by an organism as well as exogenous chemicals that are not naturally produced by an organism.

Start codon First codon of a messenger RNA transcript translated by a ribosome

The start codon is the first codon of a messenger RNA (mRNA) transcript translated by a ribosome. The start codon always codes for methionine in eukaryotes and Archaea and a N-formylmethionine (fMet) in bacteria, mitochondria and plastids. The most common start codon is AUG.

David J. Lipman

David J. Lipman is an American biologist who from 1989 to 2017 was the Director of the National Center for Biotechnology Information (NCBI) at the National Institutes of Health. NCBI is the home of GenBank, the U.S. node of the International Sequence Database Consortium, and PubMed, one of the most heavily used sites in the world for the search and retrieval of biomedical information. Lipman is one of the original authors of the BLAST sequence alignment program, and a respected figure in bioinformatics. In 2017, he left NCBI and became Chief Science Officer at Impossible Foods.

The hisB gene, found in the enterobacteria, in Campylobacter jejuni and in Xylella/Xanthomonas encodes a protein involved in catalysis of two step in histidine biosynthesis, namely the bifunctional Imidazoleglycerol-phosphate dehydratase/histidinol-phosphatase.

In bioinformatics EcoCyc is a biological database for the bacterium Escherichia coli K-12. The EcoCyc project performs literature-based curation of the E. coli genome, and of E. coli transcriptional regulation, transporters, and metabolic pathways. EcoCyc contains written summaries of E. coli genes, distilled from over 36,000 scientific articles. EcoCyc is also a description of the genome and cellular networks of E. coli that supports scientists to carry out computational analyses.

The BioCyc database collection is an assortment of organism specific Pathway/ Genome Databases (PGDBs). They provide reference to genome and metabolic pathway information for thousands of organisms. As of December 2016, there are 9300 databases within BioCyc. SRI International, based in Menlo Park, California, maintains the BioCyc database family.

FnrS RNA

FnrS RNA is a family of Hfq-binding small RNA whose expression is upregulated in response to anaerobic conditions. It is named FnrS because its expression is strongly dependent on fumarate and nitrate reductase regulator (FNR), a direct oxygen availability sensor.

RegulonDB is a database of the regulatory network of gene expression in Escherichia coli K-12. RegulonDB also models the organization of the genes in transcription units, operons and regulons. A total of 120 sRNAs with 231 total interactions which all together regulate 192 genes are also included. RegulonDB was founded in 1998 and also contributes data to the EcoCyc database.

αr7 is a family of bacterial small non-coding RNAs with representatives in a broad group of α-proteobacterial species from the order Rhizobiales. The first member of this family was found in a Sinorhizobium meliloti 1021 locus located in the chromosome (C). Further homology and structure conservation analysis identified full-length homologs in several nitrogen-fixing symbiotic rhizobia, in the plant pathogens belonging to Agrobacterium species as well as in a broad spectrum of Brucella species. αr7 RNA species are 134-159 nucleotides (nt) long and share a well defined common secondary structure. αr7 transcripts can be catalogued as trans-acting sRNAs expressed from well-defined promoter regions of independent transcription units within intergenic regions (IGRs) of the α-proteobacterial genomes.

αr9 is a family of bacterial small non-coding RNAs with representatives in a broad group of α-proteobacteria from the order Rhizobiales. The first member of this family (Smr9C) was found in a Sinorhizobium meliloti 1021 locus located in the chromosome (C). Further homology and structure conservation analysis have identified full-length Smr9C homologs in several nitrogen-fixing symbiotic rhizobia, in the plant pathogens belonging to Agrobacterium species as well as in a broad spectrum of Brucella species. αr9C RNA species are 144-158 nt long and share a well defined common secondary structure consisting of seven conserved regions. Most of the αr9 transcripts can be catalogued as trans-acting sRNAs expressed from well-defined promoter regions of independent transcription units within intergenic regions (IGRs) of the α-proteobacterial genomes.

αr14 is a family of bacterial small non-coding RNAs with representatives in a broad group of α-proteobacteria. The first member of this family (Smr14C2) was found in a Sinorhizobium meliloti 1021 locus located in the chromosome (C). It was later renamed NfeR1 and shown to be highly expressed in salt stress and during the symbiotic interaction on legume roots. Further homology and structure conservation analysis identified 2 other chromosomal copies and 3 plasmidic ones. Moreover, full-length Smr14C homologs have been identified in several nitrogen-fixing symbiotic rhizobia, in the plant pathogens belonging to Agrobacterium species as well as in a broad spectrum of Brucella species. αr14C RNA species are 115-125 nt long and share a well defined common secondary structure. Most of the αr14 transcripts can be catalogued as trans-acting sRNAs expressed from well-defined promoter regions of independent transcription units within intergenic regions (IGRs) of the α-proteobacterial genomes.

αr15 is a family of bacterial small non-coding RNAs with representatives in a broad group of α-proteobacteria from the order Rhizobiales. The first members of this family were found tandemly arranged in the same intergenic region (IGR) of the Sinorhizobium meliloti 1021 chromosome (C). Further homology and structure conservation analysis have identified full-length Smr15C1 and Smr15C2 homologs in several nitrogen-fixing symbiotic rhizobia, in the plant pathogens belonging to Agrobacterium species as well as in a broad spectrum of Brucella species. The Smr15C1 and Smr15C2 homologs are also encoded in tandem within the same IGR region of Rhizobium and Agrobacterium species, whereas in Brucella species the αr15C loci are spread in the IGRs of Chromosome I. Moreover, this analysis also identified a third αr15 loci in extrachromosomal replicons of the mentioned nitrogen-fixing α-proteobacteria and in the Chromosome II of Brucella species. αr15 RNA species are 99-121 nt long and share a well defined common secondary structure consisting of three stem loops. The transcripts of the αr15 family can be catalogued as trans-acting sRNAs encoded by independent transcription units with recognizable promoter and transcription termination signatures within intergenic regions (IGRs) of the α-proteobacterial genomes.

αr35 is a family of bacterial small non-coding RNAs with representatives in a reduced group of α-proteobacteria from the order Rhizobiales. The first member of this family (Smr35B) was found in a Sinorhizobium meliloti 1021 locus located in the symbiotic plasmid B (pSymB). Further homology and structure conservation analysis have identified full-length SmrB35 homologs in other legume symbionts, as well as in the human and plant pathogens Ochrobactrum anthropi and Agrobacterium tumefaciens, respectively. αr35 RNA species are 139-142 nt long and share a common secondary structure consisting of two stem loops and a well conserved rho independent terminator. Most of the αr35 transcripts can be catalogued as trans-acting sRNAs expressed from well-defined promoter regions of independent transcription units within intergenic regions of the α-proteobacterial genomes.

αr45 is a family of bacterial small non-coding RNAs with representatives in a broad group of α-proteobacteria from the order Rhizobiales. The first member of this family (Smr45C) was found in a Sinorhizobium meliloti 1021 locus located in the chromosome (C). Further homology and structure conservation analysis identified homologs in several nitrogen-fixing symbiotic rhizobia, in the plant pathogens belonging to Agrobacterium species as well as in a broad spectrum of Brucella species, in Bartonella species, in several members of the Xanthobactereacea family, and in some representatives of the Beijerinckiaceae family. αr45C RNA species are 147-153 nt long and share a well defined common secondary structure. All of the αr45 transcripts can be catalogued as trans-acting sRNAs expressed from well-defined promoter regions of independent transcription units within intergenic regions (IGRs) of the α-proteobacterial genomes.

Gary Stormo

Gary Stormo is an American geneticist and currently Joseph Erlanger Professor in the Department of Genetics and the Center for Genome Sciences and Systems Biology at Washington University School of Medicine in St Louis. He is considered as one of the pioneers of bioinformatics and genomics. His research combines experimental and computational approaches in order to identify and predict regulatory sequences in DNA and RNA, and their contributions to the regulatory networks that control gene expression.

The CyberCell Database (CCDB) is a freely available, web-accessible database that provides quantitative genomic, proteomic as well metabolomic data on Escherichia coli. Escherichia coli is perhaps the best-studied bacterium on the planet and has been the organism of choice for several international efforts in cell simulation. These cell simulation efforts require up-to-date web-accessible resources that provide comprehensive, non-redundant, and quantitative data on this bacterium. The intent of CCDB is to facilitate the collection, revision, coordination and storage of the key information required for in silico E. coli simulation.

Monica Riley was an American scientist who contributed to the discovery of messenger RNA in her Ph.D work with Arthur Pardee, and was later a pioneer in the exploration and computer representation of the Escherichia coli genome.

Model organism databases (MODs) are biological databases, or knowledgebases, dedicated to the provision of in-depth biological data for intensively studied model organisms. MODs allow researchers to easily find background information on large sets of genes, plan experiments efficiently, combine their data with existing knowledge, and construct novel hypotheses. They allow users to analyse results and interpret datasets, and the data they generate are increasingly used to describe less well studied species. Where possible, MODs share common approaches to collect and represent biological information. For example, all MODs use the Gene Ontology (GO) to describe functions, processes and cellular locations of specific gene products. Projects also exist to enable software sharing for curation, visualization and querying between different MODs. Organismal diversity and varying user requirements however mean that MODs are often required to customize capture, display, and provision of data.

Peter D. Karp is director of the Bioinformatics Research Group at SRI International in Menlo Park, California. Karp leads the development of the BioCyc database collection. BioCyc databases combine genome, metabolic pathway, and regulatory information for thousands of organisms.

References

  1. 1 2 3 "Personal information | Center for Genomic Sciences". www.ccg.unam.mx. Retrieved 12 September 2016.
  2. Julio Collado-Vides publications indexed by Google Scholar
  3. Gama-Castro S, Salgado H, Peralta-Gil M, Santos-Zavaleta A, Muñiz-Rascado L, Solano-Lira H, Jimenez-Jacinto V, Weiss V, García-Sotelo JS, López-Fuentes A, Porrón-Sotelo L, Alquicira-Hernández S, Medina-Rivera A, Martínez-Flores I, Alquicira-Hernández K, Martínez-Adame R, Bonavides-Martínez C, Miranda-Ríos J, Huerta AM, Mendoza-Vargas A, Collado-Torres L, Taboada B, Vega-Alvarado L, Olvera M, Olvera L, Grande R, Morett E, Collado-Vides J (Jan 2011). "RegulonDB version 7.0: transcriptional regulation of Escherichia coli K-12 integrated within genetic sensory response units (Gensor Units)". Nucleic Acids Res. England. 39 (Database issue): D98-105. doi:10.1093/nar/gkq1110. PMC   3013702 . PMID   21051347.
  4. Karp, P. D. (1 January 2002). "The EcoCyc Database". Nucleic Acids Research. 30 (1): 56–58. doi:10.1093/nar/30.1.56. PMC   99147 . PMID   11752253.
  5. Keseler, I. M. (17 December 2004). "EcoCyc: a comprehensive database resource for Escherichia coli". Nucleic Acids Research. 33 (Database issue): D334–D337. doi:10.1093/nar/gki108. PMC   540062 . PMID   15608210.
  6. Blattner, F. R. (5 September 1997). "The Complete Genome Sequence of Escherichia coli K-12". Science. 277 (5331): 1453–1462. doi: 10.1126/science.277.5331.1453 . PMID   9278503.
  7. "Julio Collado‐Vides". drclas.harvard.edu. Retrieved 12 September 2016.
  8. "miembros_alfabetico2014.pdf" (PDF). Retrieved 12 September 2016.
  9. "Feb 20, 2015: Meet the ISCB Fellows Class of 2015". www.iscb.org. Retrieved 12 September 2016.