Metabolomics is the scientific study of chemical processes involving metabolites, the small molecule substrates, intermediates, and products of cell metabolism. Specifically, metabolomics is the "systematic study of the unique chemical fingerprints that specific cellular processes leave behind", the study of their small-molecule metabolite profiles. [1] The metabolome represents the complete set of metabolites in a biological cell, tissue, organ, or organism, which are the end products of cellular processes. [2] Messenger RNA (mRNA), gene expression data, and proteomic analyses reveal the set of gene products being produced in the cell, data that represents one aspect of cellular function. Conversely, metabolic profiling can give an instantaneous snapshot of the physiology of that cell, [3] and thus, metabolomics provides a direct "functional readout of the physiological state" of an organism. [4] There are indeed quantifiable correlations between the metabolome and the other cellular ensembles (genome, transcriptome, proteome, and lipidome), which can be used to predict metabolite abundances in biological samples from, for example mRNA abundances. [5] One of the ultimate challenges of systems biology is to integrate metabolomics with all other -omics information to provide a better understanding of cellular biology.
The concept that individuals might have a "metabolic profile" that could be reflected in the makeup of their biological fluids was introduced by Roger Williams in the late 1940s, [6] who used paper chromatography to suggest characteristic metabolic patterns in urine and saliva were associated with diseases such as schizophrenia. However, it was only through technological advancements in the 1960s and 1970s that it became feasible to quantitatively (as opposed to qualitatively) measure metabolic profiles. [7] The term "metabolic profile" was introduced by Horning, et al. in 1971 after they demonstrated that gas chromatography-mass spectrometry (GC-MS) could be used to measure compounds present in human urine and tissue extracts. [8] [9] The Horning group, along with that of Linus Pauling and Arthur B. Robinson led the development of GC-MS methods to monitor the metabolites present in urine through the 1970s. [10]
Concurrently, NMR spectroscopy, which was discovered in the 1940s, was also undergoing rapid advances. In 1974, Seeley et al. demonstrated the utility of using NMR to detect metabolites in unmodified biological samples. [11] This first study on muscle highlighted the value of NMR in that it was determined that 90% of cellular ATP is complexed with magnesium. As sensitivity has improved with the evolution of higher magnetic field strengths and magic angle spinning, NMR continues to be a leading analytical tool to investigate metabolism. [8] [12] Recent efforts to utilize NMR for metabolomics have been largely driven by the laboratory of Jeremy K. Nicholson at Birkbeck College, University of London and later at Imperial College London. In 1984, Nicholson showed 1H NMR spectroscopy could potentially be used to diagnose diabetes mellitus, and later pioneered the application of pattern recognition methods to NMR spectroscopic data. [13] [14]
In 1994 and 1996, liquid chromatography mass spectrometry metabolomics experiments [15] [16] were performed by Gary Siuzdak while working with Richard Lerner (then president of the Scripps Research Institute) and Benjamin Cravatt, to analyze the cerebral spinal fluid from sleep deprived animals. One molecule of particular interest, oleamide, was observed and later shown to have sleep inducing properties. This work is one of the earliest such experiments combining liquid chromatography and mass spectrometry in metabolomics.
In 2005, the first metabolomics tandem mass spectrometry database, METLIN, [17] [18] for characterizing human metabolites was developed in the Siuzdak laboratory at the Scripps Research Institute. METLIN has since grown and as of December, 2023, METLIN contains MS/MS experimental data on over 930,000 molecular standards and other chemical entities, [19] each compound having experimental tandem mass spectrometry data generated from molecular standards at multiple collision energies and in positive and negative ionization modes. METLIN is the largest repository of tandem mass spectrometry data of its kind. The dedicated academic journal Metabolomics first appeared in 2005, founded by its current editor-in-chief Roy Goodacre.
In 2005, the Siuzdak lab was engaged in identifying metabolites associated with sepsis and in an effort to address the issue of statistically identifying the most relevant dysregulated metabolites across hundreds of LC/MS datasets, the first algorithm was developed to allow for the nonlinear alignment of mass spectrometry metabolomics data. Called XCMS, [20] it has since (2012) [21] been developed as an online tool and as of 2019 (with METLIN) has over 30,000 registered users.
On 23 January 2007, the Human Metabolome Project, led by David S. Wishart, completed the first draft of the human metabolome, consisting of a database of approximately 2,500 metabolites, 1,200 drugs and 3,500 food components. [22] [23] Similar projects have been underway in several plant species, most notably Medicago truncatula [24] and Arabidopsis thaliana [25] for several years.
As late as mid-2010, metabolomics was still considered an "emerging field". [26] Further, it was noted that further progress in the field depended in large part, through addressing otherwise "irresolvable technical challenges", by technical evolution of mass spectrometry instrumentation. [26]
In 2015, real-time metabolome profiling was demonstrated for the first time. [27]
The metabolome refers to the complete set of small-molecule (<1.5 kDa) [22] metabolites (such as metabolic intermediates, hormones and other signaling molecules, and secondary metabolites) to be found within a biological sample, such as a single organism. [28] [29] The word was coined in analogy with transcriptomics and proteomics; like the transcriptome and the proteome, the metabolome is dynamic, changing from second to second. Although the metabolome can be defined readily enough, it is not currently possible to analyse the entire range of metabolites by a single analytical method.
In January 2007, scientists at the University of Alberta and the University of Calgary completed the first draft of the human metabolome. The Human Metabolome Database (HMDB) is perhaps the most extensive public metabolomic spectral database to date [30] and is a freely available electronic database (www.hmdb.ca) containing detailed information about small molecule metabolites found in the human body. It is intended to be used for applications in metabolomics, clinical chemistry, biomarker discovery and general education. The database is designed to contain or link three kinds of data:
The database contains 220,945 metabolite entries including both water-soluble and lipid soluble metabolites. Additionally, 8,610 protein sequences (enzymes and transporters) are linked to these metabolite entries. Each MetaboCard entry contains 130 data fields with 2/3 of the information being devoted to chemical/clinical data and the other 1/3 devoted to enzymatic or biochemical data. [31] The version 3.5 of the HMDB contains >16,000 endogenous metabolites, >1,500 drugs and >22,000 food constituents or food metabolites. [32] This information, available at the Human Metabolome Database and based on analysis of information available in the current scientific literature, is far from complete. [33] In contrast, much more is known about the metabolomes of other organisms. For example, over 50,000 metabolites have been characterized from the plant kingdom, and many thousands of metabolites have been identified and/or characterized from single plants. [34] [35]
Each type of cell and tissue has a unique metabolic ‘fingerprint’ that can elucidate organ or tissue-specific information. Bio-specimens used for metabolomics analysis include but not limit to plasma, serum, urine, saliva, feces, muscle, sweat, exhaled breath and gastrointestinal fluid. [36] The ease of collection facilitates high temporal resolution, and because they are always at dynamic equilibrium with the body, they can describe the host as a whole. [37] Genome can tell what could happen, transcriptome can tell what appears to be happening, proteome can tell what makes it happen and metabolome can tell what has happened and what is happening. [38]
Part of a series on |
Microbiomes |
---|
Metabolites are the substrates, intermediates and products of metabolism. Within the context of metabolomics, a metabolite is usually defined as any molecule less than 1.5 kDa in size. [22] However, there are exceptions to this depending on the sample and detection method. For example, macromolecules such as lipoproteins and albumin are reliably detected in NMR-based metabolomics studies of blood plasma. [39] In plant-based metabolomics, it is common to refer to "primary" and "secondary" metabolites. [3] A primary metabolite is directly involved in the normal growth, development, and reproduction. A secondary metabolite is not directly involved in those processes, but usually has important ecological function. Examples include antibiotics and pigments. [40] By contrast, in human-based metabolomics, it is more common to describe metabolites as being either endogenous (produced by the host organism) or exogenous. [41] [42] Metabolites of foreign substances such as drugs are termed xenometabolites. [43]
The metabolome forms a large network of metabolic reactions, where outputs from one enzymatic chemical reaction are inputs to other chemical reactions. Such systems have been described as hypercycles.[ citation needed ]
Metabonomics is defined as "the quantitative measurement of the dynamic multiparametric metabolic response of living systems to pathophysiological stimuli or genetic modification". The word origin is from the Greek μεταβολή meaning change and nomos meaning a rule set or set of laws. [44] This approach was pioneered by Jeremy Nicholson at Murdoch University and has been used in toxicology, disease diagnosis and a number of other fields. Historically, the metabonomics approach was one of the first methods to apply the scope of systems biology to studies of metabolism. [45] [46] [47]
There has been some disagreement over the exact differences between 'metabolomics' and 'metabonomics'. The difference between the two terms is not related to choice of analytical platform: although metabonomics is more associated with NMR spectroscopy and metabolomics with mass spectrometry-based techniques, this is simply because of usages amongst different groups that have popularized the different terms. While there is still no absolute agreement, there is a growing consensus that 'metabolomics' places a greater emphasis on metabolic profiling at a cellular or organ level and is primarily concerned with normal endogenous metabolism. 'Metabonomics' extends metabolic profiling to include information about perturbations of metabolism caused by environmental factors (including diet and toxins), disease processes, and the involvement of extragenomic influences, such as gut microflora. This is not a trivial difference; metabolomic studies should, by definition, exclude metabolic contributions from extragenomic sources, because these are external to the system being studied. However, in practice, within the field of human disease research there is still a large degree of overlap in the way both terms are used, and they are often in effect synonymous. [48]
Exometabolomics, or "metabolic footprinting", is the study of extracellular metabolites. It uses many techniques from other subfields of metabolomics, and has applications in biofuel development, bioprocessing, determining drugs' mechanism of action, and studying intercellular interactions. [49]
The typical workflow of metabolomics studies is shown in the figure. First, samples are collected from tissue, plasma, urine, saliva, cells, etc. Next, metabolites extracted often with the addition of internal standards and derivatization. [38] During sample analysis, metabolites are quantified (liquid chromatography or gas chromatography coupled with MS and/or NMR spectroscopy). [50] The raw output data can be used for metabolite feature extraction and further processed before statistical analysis (such as principal component analysis, PCA). Many bioinformatic tools and software are available to identify associations with disease states and outcomes, determine significant correlations, and characterize metabolic signatures with existing biological knowledge. [51]
Initially, analytes in a metabolomic sample comprise a highly complex mixture. This complex mixture can be simplified prior to detection by separating some analytes from others. Separation achieves various goals: analytes which cannot be resolved by the detector may be separated in this step; in MS analysis, ion suppression is reduced; the retention time of the analyte serves as information regarding its identity. This separation step is not mandatory and is often omitted in NMR and "shotgun" based approaches such as shotgun lipidomics.
Gas chromatography (GC), especially when interfaced with mass spectrometry (GC-MS), is a widely used separation technique for metabolomic analysis. GC offers very high chromatographic resolution, and can be used in conjunction with a flame ionization detector (GC/FID) or a mass spectrometer (GC-MS). The method is especially useful for identification and quantification of small and volatile molecules. [52] However, a practical limitation of GC is the requirement of chemical derivatization for many biomolecules as only volatile chemicals can be analysed without derivatization. In cases where greater resolving power is required, two-dimensional chromatography (GCxGC) can be applied.
High performance liquid chromatography (HPLC) has emerged as the most common separation technique for metabolomic analysis. With the advent of electrospray ionization, HPLC was coupled to MS. In contrast with GC, HPLC has lower chromatographic resolution, but requires no derivatization for polar molecules, and separates molecules in the liquid phase. Additionally HPLC has the advantage that a much wider range of analytes can be measured with a higher sensitivity than GC methods. [53]
Capillary electrophoresis (CE) has a higher theoretical separation efficiency than HPLC (although requiring much more time per separation), and is suitable for use with a wider range of metabolite classes than is GC. As for all electrophoretic techniques, it is most appropriate for charged analytes. [54]
Mass spectrometry (MS) is used to identify and quantify metabolites after optional separation by GC, HPLC, or CE. GC-MS was the first hyphenated technique to be developed. Identification leverages the distinct patterns in which analytes fragment. These patterns can be thought of as a mass spectral fingerprint. Libraries exist that allow identification of a metabolite according to this fragmentation pattern [ example needed ]. MS is both sensitive and can be very specific. There are also a number of techniques which use MS as a stand-alone technology: the sample is infused directly into the mass spectrometer with no prior separation, and the MS provides sufficient selectivity to both separate and to detect metabolites.
For analysis by mass spectrometry, the analytes must be imparted with a charge and transferred to the gas phase. Electron ionization (EI) is the most common ionization technique applied to GC separations as it is amenable to low pressures. EI also produces fragmentation of the analyte, both providing structural information while increasing the complexity of the data and possibly obscuring the molecular ion. Atmospheric-pressure chemical ionization (APCI) is an atmospheric pressure technique that can be applied to all the above separation techniques. APCI is a gas phase ionization method, which provides slightly more aggressive ionization than ESI which is suitable for less polar compounds. Electrospray ionization (ESI) is the most common ionization technique applied in LC/MS. This soft ionization is most successful for polar molecules with ionizable functional groups. Another commonly used soft ionization technique is secondary electrospray ionization (SESI).
In the 2000s, surface-based mass analysis has seen a resurgence, with new MS technologies focused on increasing sensitivity, minimizing background, and reducing sample preparation. The ability to analyze metabolites directly from biofluids and tissues continues to challenge current MS technology, largely because of the limits imposed by the complexity of these samples, which contain thousands to tens of thousands of metabolites. Among the technologies being developed to address this challenge is Nanostructure-Initiator MS (NIMS), [55] [56] a desorption/ ionization approach that does not require the application of matrix and thereby facilitates small-molecule (i.e., metabolite) identification. MALDI is also used; however, the application of a MALDI matrix can add significant background at < 1000 Da that complicates analysis of the low-mass range (i.e., metabolites). In addition, the size of the resulting matrix crystals limits the spatial resolution that can be achieved in tissue imaging. Because of these limitations, several other matrix-free desorption/ionization approaches have been applied to the analysis of biofluids and tissues.
Secondary ion mass spectrometry (SIMS) was one of the first matrix-free desorption/ionization approaches used to analyze metabolites from biological samples.[ citation needed ] SIMS uses a high-energy primary ion beam to desorb and generate secondary ions from a surface. The primary advantage of SIMS is its high spatial resolution (as small as 50 nm), a powerful characteristic for tissue imaging with MS. However, SIMS has yet to be readily applied to the analysis of biofluids and tissues because of its limited sensitivity at >500 Da and analyte fragmentation generated by the high-energy primary ion beam. Desorption electrospray ionization (DESI) is a matrix-free technique for analyzing biological samples that uses a charged solvent spray to desorb ions from a surface. Advantages of DESI are that no special surface is required and the analysis is performed at ambient pressure with full access to the sample during acquisition. A limitation of DESI is spatial resolution because "focusing" the charged solvent spray is difficult. However, a recent development termed laser ablation ESI (LAESI) is a promising approach to circumvent this limitation.[ citation needed ] Most recently, ion trap techniques such as orbitrap mass spectrometry are also applied to metabolomics research. [57]
Nuclear magnetic resonance (NMR) spectroscopy is the only detection technique which does not rely on separation of the analytes, and the sample can thus be recovered for further analyses. All kinds of small molecule metabolites can be measured simultaneously - in this sense, NMR is close to being a universal detector. The main advantages of NMR are high analytical reproducibility and simplicity of sample preparation. Practically, however, it is relatively insensitive compared to mass spectrometry-based techniques. [58] [59]
Although NMR and MS are the most widely used modern-day techniques for detection, there are other methods in use. These include Fourier-transform ion cyclotron resonance, [60] ion-mobility spectrometry, [61] electrochemical detection (coupled to HPLC), Raman spectroscopy and radiolabel (when combined with thin-layer chromatography).[ citation needed ]
Technology | Sensitivity (LOD) | Sample volume | Compatible with gases | Compatible with liquids | Compatible with solids | Start-up cost | Can be used in metabolite imaging (MALDI or DESI) | Advantages | Disadvantages |
---|---|---|---|---|---|---|---|---|---|
GC-MS | 0.5 μM | 0.1-0.2 mL | Yes | Yes | No | <$300,000 | No |
|
|
LC-MS | 0.5 nM | 10—100 μL | No | Yes | Yes | >$300,000 | Yes |
|
|
NMR spectroscopy | 5 μM | 10—100 μL | No | Yes | Yes | >US$1 million | Yes |
|
|
The data generated in metabolomics usually consist of measurements performed on subjects under various conditions. These measurements may be digitized spectra, or a list of metabolite features. In its simplest form, this generates a matrix with rows corresponding to subjects and columns corresponding with metabolite features (or vice versa). [8] Several statistical programs are currently available for analysis of both NMR and mass spectrometry data. A great number of free software are already available for the analysis of metabolomics data shown in the table. Some statistical tools listed in the table were designed for NMR data analyses were also useful for MS data. [62] For mass spectrometry data, software is available that identifies molecules that vary in subject groups on the basis of mass-over-charge value and sometimes retention time depending on the experimental design. [63]
Once metabolite data matrix is determined, unsupervised data reduction techniques (e.g. PCA) can be used to elucidate patterns and connections. In many studies, including those evaluating drug-toxicity and some disease models, the metabolites of interest are not known a priori. This makes unsupervised methods, those with no prior assumptions of class membership, a popular first choice. The most common of these methods includes principal component analysis (PCA) which can efficiently reduce the dimensions of a dataset to a few which explain the greatest variation. [37] When analyzed in the lower-dimensional PCA space, clustering of samples with similar metabolic fingerprints can be detected. PCA algorithms aim to replace all correlated variables with a much smaller number of uncorrelated variables (referred to as principal components (PCs)) and retain most of the information in the original dataset. [64] This clustering can elucidate patterns and assist in the determination of disease biomarkers – metabolites that correlate most with class membership.
Linear models are commonly used for metabolomics data, but are affected by multicollinearity. On the other hand, multivariate statistics are thriving methods for high-dimensional correlated metabolomics data, of which the most popular one is Projection to Latent Structures (PLS) regression and its classification version PLS-DA. Other data mining methods, such as random forest, support-vector machines, etc. are received increasing attention for untargeted metabolomics data analysis. [65] In the case of univariate methods, variables are analyzed one by one using classical statistics tools (such as Student's t-test, ANOVA or mixed models) and only these with sufficient small p-values are considered relevant. [36] However, correction strategies should be used to reduce false discoveries when multiple comparisons are conducted since there is no standard method for measuring the total amount of metabolites directly in untargeted metabolomics. [66] For multivariate analysis, models should always be validated to ensure that the results can be generalized.
Machine learning is a powerful tool that can be used in metabolomics analysis. Recently, scientists have developed retention time prediction software. These tools allow researchers to apply artificial intelligence to the retention time prediction of small molecules in complex mixture, such as human plasma, plant extracts, foods, or microbial cultures. Retention time prediction increases the identification rate in liquid chromatography and can lead to an improved biological interpretation of metabolomics data. [67]
Toxicity assessment/toxicology by metabolic profiling (especially of urine or blood plasma samples) detects the physiological changes caused by toxic insult of a chemical (or mixture of chemicals). In many cases, the observed changes can be related to specific syndromes, e.g. a specific lesion in liver or kidney. This is of particular relevance to pharmaceutical companies wanting to test the toxicity of potential drug candidates: if a compound can be eliminated before it reaches clinical trials on the grounds of adverse toxicity, it saves the enormous expense of the trials. [48]
For functional genomics, metabolomics can be an excellent tool for determining the phenotype caused by a genetic manipulation, such as gene deletion or insertion. Sometimes this can be a sufficient goal in itself—for instance, to detect any phenotypic changes in a genetically modified plant intended for human or animal consumption. More exciting is the prospect of predicting the function of unknown genes by comparison with the metabolic perturbations caused by deletion/insertion of known genes. Such advances are most likely to come from model organisms such as Saccharomyces cerevisiae and Arabidopsis thaliana . The Cravatt laboratory at the Scripps Research Institute has recently applied this technology to mammalian systems, identifying the N-acyltaurines as previously uncharacterized endogenous substrates for the enzyme fatty acid amide hydrolase (FAAH) and the monoalkylglycerol ethers (MAGEs) as endogenous substrates for the uncharacterized hydrolase KIAA1363. [68] [69]
Metabologenomics is a novel approach to integrate metabolomics and genomics data by correlating microbial-exported metabolites with predicted biosynthetic genes. [70] This bioinformatics-based pairing method enables natural product discovery at a larger-scale by refining non-targeted metabolomic analyses to identify small molecules with related biosynthesis and to focus on those that may not have previously well known structures.
Fluxomics is a further development of metabolomics. The disadvantage of metabolomics is that it only provides the user with abundances or concentrations of metabolites, while fluxomics determines the reaction rates of metabolic reactions and can trace metabolites in a biological system over time.
Nutrigenomics is a generalised term which links genomics, transcriptomics, proteomics and metabolomics to human nutrition. In general, in a given body fluid, a metabolome is influenced by endogenous factors such as age, sex, body composition and genetics as well as underlying pathologies. The large bowel microflora are also a very significant potential confounder of metabolic profiles and could be classified as either an endogenous or exogenous factor. The main exogenous factors are diet and drugs. Diet can then be broken down to nutrients and non-nutrients. Metabolomics is one means to determine a biological endpoint, or metabolic fingerprint, which reflects the balance of all these forces on an individual's metabolism. [71] [72] Thanks to recent cost reductions, metabolomics has now become accessible for companion animals, such as pregnant dogs. [73] [74]
Plant metabolomics is designed to study the overall changes in metabolites of plant samples and then conduct deep data mining and chemometric analysis. Specialized metabolites are considered components of plant defense systems biosynthesized in response to biotic and abiotic stresses. [75] Metabolomics approaches have recently been used to assess the natural variance in metabolite content between individual plants, an approach with great potential for the improvement of the compositional quality of crops. [76]
Mass spectrometry (MS) is an analytical technique that is used to measure the mass-to-charge ratio of ions. The results are presented as a mass spectrum, a plot of intensity as a function of the mass-to-charge ratio. Mass spectrometry is used in many different fields and is applied to pure samples as well as complex mixtures.
Electron ionization is an ionization method in which energetic electrons interact with solid or gas phase atoms or molecules to produce ions. EI was one of the first ionization techniques developed for mass spectrometry. However, this method is still a popular ionization technique. This technique is considered a hard ionization method, since it uses highly energetic electrons to produce ions. This leads to extensive fragmentation, which can be helpful for structure determination of unknown compounds. EI is the most useful for organic compounds which have a molecular weight below 600 amu. Also, several other thermally stable and volatile compounds in solid, liquid and gas states can be detected with the use of this technique when coupled with various separation methods.
Gas chromatography–mass spectrometry (GC–MS) is an analytical method that combines the features of gas-chromatography and mass spectrometry to identify different substances within a test sample. Applications of GC–MS include drug detection, fire investigation, environmental analysis, explosives investigation, food and flavor analysis, and identification of unknown samples, including that of material samples obtained from planet Mars during probe missions as early as the 1970s. GC–MS can also be used in airport security to detect substances in luggage or on human beings. Additionally, it can identify trace elements in materials that were previously thought to have disintegrated beyond identification. Like liquid chromatography–mass spectrometry, it allows analysis and detection even of tiny amounts of a substance.
Lipidomics is the large-scale study of pathways and networks of cellular lipids in biological systems. The word "lipidome" is used to describe the complete lipid profile within a cell, tissue, organism, or ecosystem and is a subset of the "metabolome" which also includes other major classes of biological molecules. Lipidomics is a relatively recent research field that has been driven by rapid advances in technologies such as mass spectrometry (MS), nuclear magnetic resonance (NMR) spectroscopy, fluorescence spectroscopy, dual polarisation interferometry and computational methods, coupled with the recognition of the role of lipids in many metabolic diseases such as obesity, atherosclerosis, stroke, hypertension and diabetes. This rapidly expanding field complements the huge progress made in genomics and proteomics, all of which constitute the family of systems biology.
The metabolome refers to the complete set of small-molecule chemicals found within a biological sample. The biological sample can be a cell, a cellular organelle, an organ, a tissue, a tissue extract, a biofluid or an entire organism. The small molecule chemicals found in a given metabolome may include both endogenous metabolites that are naturally produced by an organism as well as exogenous chemicals that are not naturally produced by an organism.
Liquid chromatography–mass spectrometry (LC–MS) is an analytical chemistry technique that combines the physical separation capabilities of liquid chromatography with the mass analysis capabilities of mass spectrometry (MS). Coupled chromatography – MS systems are popular in chemical analysis because the individual capabilities of each technique are enhanced synergistically. While liquid chromatography separates mixtures with multiple components, mass spectrometry provides spectral information that may help to identify each separated component. MS is not only sensitive, but provides selective detection, relieving the need for complete chromatographic separation. LC–MS is also appropriate for metabolomics because of its good coverage of a wide range of chemicals. This tandem technique can be used to analyze biochemical, organic, and inorganic compounds commonly found in complex samples of environmental and biological origin. Therefore, LC–MS may be applied in a wide range of sectors including biotechnology, environment monitoring, food processing, and pharmaceutical, agrochemical, and cosmetic industries. Since the early 2000s, LC–MS has also begun to be used in clinical applications.
Atmospheric pressure chemical ionization (APCI) is an ionization method used in mass spectrometry which utilizes gas-phase ion-molecule reactions at atmospheric pressure (105 Pa), commonly coupled with high-performance liquid chromatography (HPLC). APCI is a soft ionization method similar to chemical ionization where primary ions are produced on a solvent spray. The main usage of APCI is for polar and relatively less polar thermally stable compounds with molecular weight less than 1500 Da. The application of APCI with HPLC has gained a large popularity in trace analysis detection such as steroids, pesticides and also in pharmacology for drug metabolites.
Sample preparation for mass spectrometry is used for the optimization of a sample for analysis in a mass spectrometer (MS). Each ionization method has certain factors that must be considered for that method to be successful, such as volume, concentration, sample phase, and composition of the analyte solution. Quite possibly the most important consideration in sample preparation is knowing what phase the sample must be in for analysis to be successful. In some cases the analyte itself must be purified before entering the ion source. In other situations, the matrix, or everything in the solution surrounding the analyte, is the most important factor to consider and adjust. Often, sample preparation itself for mass spectrometry can be avoided by coupling mass spectrometry to a chromatography method, or some other form of separation before entering the mass spectrometer. In some cases, the analyte itself must be adjusted so that analysis is possible, such as in protein mass spectrometry, where usually the protein of interest is cleaved into peptides before analysis, either by in-gel digestion or by proteolysis in solution.
Two-dimensional chromatography is a type of chromatographic technique in which the injected sample is separated by passing through two different separation stages. Two different chromatographic columns are connected in sequence, and the effluent from the first system is transferred onto the second column. Typically the second column has a different separation mechanism, so that bands that are poorly resolved from the first column may be completely separated in the second column. Alternately, the two columns might run at different temperatures. During the second stage of separation the rate at which the separation occurs must be faster than the first stage, since there is still only a single detector. The plane surface is amenable to sequential development in two directions using two different solvents.
Capillary electrophoresis–mass spectrometry (CE–MS) is an analytical chemistry technique formed by the combination of the liquid separation process of capillary electrophoresis with mass spectrometry. CE–MS combines advantages of both CE and MS to provide high separation efficiency and molecular mass information in a single analysis. It has high resolving power and sensitivity, requires minimal volume and can analyze at high speed. Ions are typically formed by electrospray ionization, but they can also be formed by matrix-assisted laser desorption/ionization or other ionization techniques. It has applications in basic research in proteomics and quantitative analysis of biomolecules as well as in clinical medicine. Since its introduction in 1987, new developments and applications have made CE-MS a powerful separation and identification technique. Use of CE–MS has increased for protein and peptides analysis and other biomolecules. However, the development of online CE–MS is not without challenges. Understanding of CE, the interface setup, ionization technique and mass detection system is important to tackle problems while coupling capillary electrophoresis to mass spectrometry.
Instrumental analysis is a field of analytical chemistry that investigates analytes using scientific instruments.
The METLIN Metabolite and Chemical Entity Database is the largest repository of experimental tandem mass spectrometry and neutral loss data acquired from standards. The tandem mass spectrometry data on over 930,000 molecular standards is provided to facilitate the identification of chemical entities from tandem mass spectrometry experiments. In addition to the identification of known molecules, it is also useful for identifying unknowns using its similarity searching technology. All tandem mass spectrometry data comes from the experimental analysis of standards at multiple collision energies and in both positive and negative ionization modes.
Surface-assisted laser desorption/ionization (SALDI) is a soft laser desorption technique used for mass spectrometry analysis of biomolecules, polymers, and small organic molecules. In its first embodiment Koichi Tanaka used a cobalt/glycerol liquid matrix and subsequent applications included a graphite/glycerol liquid matrix as well as a solid surface of porous silicon. The porous silicon represents the first matrix-free SALDI surface analysis allowing for facile detection of intact molecular ions, these porous silicon surfaces also facilitated the analysis of small molecules at the yoctomole level. At present laser desorption/ionization methods using other inorganic matrices such as nanomaterials are often regarded as SALDI variants. As an example, silicon nanowires as well as Titania nanotube arrays (NTA) have been used as substrates to detect small molecules. SALDI is used to detect proteins and protein-protein complexes. A related method named "ambient SALDI" - which is a combination of conventional SALDI with ambient mass spectrometry incorporating the direct analysis real time (DART) ion source has also been demonstrated. SALDI is considered one of the most important techniques in MS and has many applications.
Pharmacometabolomics, also known as pharmacometabonomics, is a field which stems from metabolomics, the quantification and analysis of metabolites produced by the body. It refers to the direct measurement of metabolites in an individual's bodily fluids, in order to predict or evaluate the metabolism of pharmaceutical compounds, and to better understand the pharmacokinetic profile of a drug. Alternatively, pharmacometabolomics can be applied to measure metabolite levels following the administration of a pharmaceutical compound, in order to monitor the effects of the compound on certain metabolic pathways(pharmacodynamics). This provides detailed mapping of drug effects on metabolism and the pathways that are implicated in mechanism of variation of response to treatment. In addition, the metabolic profile of an individual at baseline (metabotype) provides information about how individuals respond to treatment and highlights heterogeneity within a disease state. All three approaches require the quantification of metabolites found in bodily fluids and tissue, such as blood or urine, and can be used in the assessment of pharmaceutical treatment options for numerous disease states.
The Golm Metabolome Database (GMD) is a gas chromatography (GC) – mass spectrometry (MS) reference library dedicated to metabolite profiling experiments and comprises mass spectral and retention index (RI) information for non-annotated mass spectral tags together with data of a multitude of already identified metabolites and reference substances. The GMD is hosted at the Max Planck Institute of Molecular Plant Physiology in Golm district of Potsdam, Germany.
In the field of cellular biology, single-cell analysis and subcellular analysis is the study of genomics, transcriptomics, proteomics, metabolomics and cell–cell interactions at the single cell level. The concept of single-cell analysis originated in the 1970s. Before the discovery of heterogeneity, single-cell analysis mainly referred to the analysis or manipulation of an individual cell in a bulk population of cells at a particular condition using optical or electronic microscope. To date, due to the heterogeneity seen in both eukaryotic and prokaryotic cell populations, analyzing a single cell makes it possible to discover mechanisms not seen when studying a bulk population of cells. Technologies such as fluorescence-activated cell sorting (FACS) allow the precise isolation of selected single cells from complex samples, while high throughput single cell partitioning technologies, enable the simultaneous molecular analysis of hundreds or thousands of single unsorted cells; this is particularly useful for the analysis of transcriptome variation in genotypically identical cells, allowing the definition of otherwise undetectable cell subtypes. The development of new technologies is increasing our ability to analyze the genome and transcriptome of single cells, as well as to quantify their proteome and metabolome. Mass spectrometry techniques have become important analytical tools for proteomic and metabolomic analysis of single cells. Recent advances have enabled quantifying thousands of protein across hundreds of single cells, and thus make possible new types of analysis. In situ sequencing and fluorescence in situ hybridization (FISH) do not require that cells be isolated and are increasingly being used for analysis of tissues.
The Yeast Metabolome Database (YMDB) is a comprehensive, high-quality, freely accessible, online database of small molecule metabolites found in or produced by Saccharomyces cerevisiae. The YMDB was designed to facilitate yeast metabolomics research, specifically in the areas of general fermentation as well as wine, beer and fermented food analysis. YMDB supports the identification and characterization of yeast metabolites using NMR spectroscopy, GC-MS spectrometry and Liquid chromatography–mass spectrometry. The YMDB contains two kinds of data: 1) chemical data and 2) molecular biology/biochemistry data. The chemical data includes 2027 metabolite structures with detailed metabolite descriptions along with nearly 4000 NMR, GC-MS and LC/MS spectra.
Exometabolomics, also known as 'metabolic footprinting', is the study of extracellular metabolites and is a sub-field of metabolomics.
Gary Siuzdak is an American chemist best known for his work in the field of metabolomics, activity metabolomics, and mass spectrometry. His lab discovered indole-3-propionic acid as a gut bacteria derived metabolite in 2009. He is currently the Professor and Director of The Center for Metabolomics and Mass Spectrometry at Scripps Research in La Jolla, California. Siuzdak has also made contributions to virus analysis, viral structural dynamics, as well as developing mass spectrometry imaging technology using nanostructured surfaces. The Siuzdak lab is also responsible for creating the research tools eXtensible Computational Mass Spectrometry (XCMS), METLIN, METLIN Neutral Loss and Q-MRM. As of January 2021, the XCMS/METLIN platform has over 50,000 registered users.
XCMS Online is a cloud version of the original eXtensible Computational Mass Spectrometry (XCMS) technology, created by the Siuzdak Lab at Scripps Research. XCMS introduced the concept of nonlinear retention time alignment that allowed for the statistical assessment of the detected peaks across LCMS and GCMS datasets. XCMS Online was designed to facilitate XCMS analyses through a cloud portal and as a more straightforward way to analyze, visualize and share untargeted metabolomic data. Further to this, the combination of XCMS and METLIN allows for the identification of known molecules using METLIN's tandem mass spectrometry data, and enables the identification of unknown via similarity searching of tandem mass spectrometry data. XCMS Online has also become a systems biology tool for integrating different omic data sets. As of January 2021, the XCMSOnline - METLIN platform has over 44,000 registered users. XCMS - METLIN was recognized in 2023 as the year's top analytical innovation.