Zooarchaeology by mass spectrometry, commonly referred to by the abbreviation ZooMS, is a scientific method that identifies animal species by means of characteristic peptide sequences in the protein collagen. ZooMS is the most common archaeological application of peptide mass fingerprinting (PMF) and can be used for species identification of bones, teeth, skin and antler. It is commonly used to identify objects that cannot be identified morphologically. In an archaeological context this usually means that the object is too fragmented or that it has been shaped into an artefact. Archaeologists use these species identification to study among others past environments, diet and raw material selection for the production of tools.
ZooMS was first published in 2009 [1] by a team of researchers from the University of York, but the term was coined later in a publication in 2010. [2] The original aim of ZooMS was to distinguish between sheep and goat. The bones of these two closely related species are difficult to distinguish, especially when fragmented, yet the difference between these two common domesticates is very important for our understanding of past husbandry practices.
Most of the method development following the initial publication of ZooMS has focused on the extraction of collagen from the archaeological material. In the original protocol acid was used to dissolve the bone’s mineral matrix and free up the collagen. In 2011 an alternative extraction method was published that used an ammonium bicarbonate buffer to solubilise the collagen without dissolving the mineral matrix. [3] In contrast to the acid protocol, the ammonium bicarbonate protocol does not affect the size and mass of the sample, making it a much less destructive method compared to the original protocol. In fact, the ammonium bicarbonate protocol was proposed as a non-destructive protocol for ZooMS, but in practice destructive samples are still taken for this protocol (see [4] ). Submerging a sample in ammonium bicarbonate does chemically alter the sample, which is why current practices continue to take a destructive sample.
Although the ammonium bicarbonate protocol should not be considered a non-destructive method, it was followed by more ‘true’ non-destructive methods. The first of these was the eraser protocol, first tested on parchment, [5] but later also applied to bone. [6] The eraser protocol is performed by rubbing a PVC eraser on a piece of parchment or bone. The friction generates triboelectric forces, which causes small particles of the sample to cling to the eraser waste. From the eraser waste collagen can then be extracted and analysed. The eraser protocol was found to work relatively well for parchment, but it is less effective on bone. Additionally, it leaves microscopic traces on the bone surface, which appear very similar to use wear traces and could be an issue for use wear analysis. [6]
A second non-destructive protocol is the plastic bag protocol, first published in 2019. [7] It is based on the idea that the normal friction between an object and the plastic bags, commonly used for storing archaeological objects, might be sufficient to extract enough material for ZooMS analysis.
A third protocol uses the same triboelectric principle. However, instead of using an eraser, this microgrid protocol employs a fine polishing film to remove very small amounts of material from a sample. [8]
The last non-destructive protocol that has been published for ZooMS is the membrane box protocol. [9] The membrane box protocol is based on contact electrification, which is the generation of electrostatic forces due to small localised differences in charge between two objects. These electrostatic forces can be large enough for material transfer between two surfaces. [10]
Most of these protocols have only been published recently and their respective advantages and disadvantages have not yet been tested against each other. It is therefore not yet clear how reliable these methods are and what level of preservation of the samples is required for them to work.
Apart from non-destructive sampling, a second area of method development has been the expansion of reference biomarkers. To identify a species using ZooMS, a set of diagnostic biomarkers is used. These biomarkers correspond to particular fragments of the species’ collagen protein. The set of known biomarkers at the time of ZooMS’ original publication was relatively limited, but recent publications have been expanding this list. A regularly updated list of published biomarkers is maintained by the University of York and can be found here.
ZooMS identifies species based on differences in the amino acid composition of the collagen protein. The amino acid sequence of a species’ collagen protein is determined by its DNA and as a result like DNA, the amino acid sequence reflects a species’ evolutionary history. The greater the evolutionary distance between two species, the more different their collagen proteins will be. ZooMS typically can identify a sample up to genus level, though in some cases the identification can be more or less specific. A good understanding of the archaeological context of the sample can be used to further refine the resolution of the species identification.
A ZooMS protocol (Fig. 1) typically consists of an extraction, denaturation, digestion and filtration step, followed by mass spectrometric analysis. Various destructive and non-destructive extraction protocols have already been discussed in some detail above. The key is to extract the protein preserved in the sample and then bring it into solution, usually an ammonium bicarbonate buffer. Denaturation is done to unfold the proteins and make them more accessible for the enzymatic digestion. It is done by heating the solubilised sample at around 65 °C. [3] Then an enzyme, trypsin, is added to the solution. Trypsin cleaves the protein after every arginine or lysine amino acid in its sequence, resulting in peptide fragments of predictable masses. After digestion the sample is filtered with C18 filters to get rid of non-proteinaceous material and the sample is now ready for mass spectrometric analysis, which for ZooMS generally means MALDI-TOF MS.
Zooarchaeology or archaeozoology merges the disciplines of zoology and archaeology, focusing on the analysis of animal remains within archaeological sites. This field, managed by specialists known as zooarchaeologists or faunal analysts, examines remnants such as bones, shells, hair, chitin, scales, hides, and proteins, such as DNA, to derive insights into historical human-animal interactions and environmental conditions. While bones and shells tend to be relatively more preserved in archaeological contexts, the survival of faunal remains is generally infrequent. The degradation or fragmentation of faunal remains presents challenges in the accurate analysis and interpretation of data.
Proteomics is the large-scale study of proteins. Proteins are vital macromolecules of all living organisms, with many functions such as the formation of structural fibers of muscle tissue, enzymatic digestion of food, or synthesis and replication of DNA. In addition, other kinds of proteins include antibodies that protect an organism from infection, and hormones that send important signals throughout the body.
Tandem mass spectrometry, also known as MS/MS or MS2, is a technique in instrumental analysis where two or more stages of analysis using one or more mass analyzer are performed with an additional reaction step in between these analyses to increase their abilities to analyse chemical samples. A common use of tandem MS is the analysis of biomolecules, such as proteins and peptides.
Protein sequencing is the practical process of determining the amino acid sequence of all or part of a protein or peptide. This may serve to identify the protein or characterize its post-translational modifications. Typically, partial sequencing of a protein provides sufficient information to identify it with reference to databases of protein sequences derived from the conceptual translation of genes.
Peptide mass fingerprinting (PMF), also known as protein fingerprinting, is an analytical technique for protein identification in which the unknown protein of interest is first cleaved into smaller peptides, whose absolute masses can be accurately measured with a mass spectrometer such as MALDI-TOF or ESI-TOF. The method was developed in 1993 by several groups independently. The peptide masses are compared to either a database containing known protein sequences or even the genome. This is achieved by using computer programs that translate the known genome of the organism into proteins, then theoretically cut the proteins into peptides, and calculate the absolute masses of the peptides from each protein. They then compare the masses of the peptides of the unknown protein to the theoretical peptide masses of each protein encoded in the genome. The results are statistically analyzed to find the best match.
In mass spectrometry, matrix-assisted laser desorption/ionization (MALDI) is an ionization technique that uses a laser energy-absorbing matrix to create ions from large molecules with minimal fragmentation. It has been applied to the analysis of biomolecules and various organic molecules, which tend to be fragile and fragment when ionized by more conventional ionization methods. It is similar in character to electrospray ionization (ESI) in that both techniques are relatively soft ways of obtaining ions of large molecules in the gas phase, though MALDI typically produces far fewer multi-charged ions.
Protein methods are the techniques used to study proteins. There are experimental methods for studying proteins. Computational methods typically use computer programs to analyze proteins. However, many experimental methods require computational analysis of the raw data.
Deamidation is a chemical reaction in which an amide functional group in the side chain of the amino acids asparagine or glutamine is removed or converted to another functional group. Typically, asparagine is converted to aspartic acid or isoaspartic acid. Glutamine is converted to glutamic acid or pyroglutamic acid (5-oxoproline). In a protein or peptide, these reactions are important because they may alter its structure, stability or function and may lead to protein degradation. The net chemical change is the addition of a water group and removal of an ammonia group, which corresponds to a +1 (0.98402) Da mass increase. Although deamidation occurs on glutamine, glycosylated asparagine and other amides, these are negligible under typical proteolysis conditions.
Surface-enhanced laser desorption/ionization (SELDI) is a soft ionization method in mass spectrometry (MS) used for the analysis of protein mixtures. It is a variation of matrix-assisted laser desorption/ionization (MALDI). In MALDI, the sample is mixed with a matrix material and applied to a metal plate before irradiation by a laser, whereas in SELDI, proteins of interest in a sample become bound to a surface before MS analysis. The sample surface is a key component in the purification, desorption, and ionization of the sample. SELDI is typically used with time-of-flight (TOF) mass spectrometers and is used to detect proteins in tissue samples, blood, urine, or other clinical samples, however, SELDI technology can potentially be used in any application by simply modifying the sample surface.
Phosphoproteomics is a branch of proteomics that identifies, catalogs, and characterizes proteins containing a phosphate group as a posttranslational modification. Phosphorylation is a key reversible modification that regulates protein function, subcellular localization, complex formation, degradation of proteins and therefore cell signaling networks. With all of these modification results, it is estimated that between 30–65% of all proteins may be phosphorylated, some multiple times. Based on statistical estimates from many datasets, 230,000, 156,000 and 40,000 phosphorylation sites should exist in human, mouse, and yeast, respectively.
Thermospray is a soft ionization source by which a solvent flow of liquid sample passes through a very thin heated column to become a spray of fine liquid droplets. As a form of atmospheric pressure ionization in mass spectrometry these droplets are then ionized via a low-current discharge electrode to create a solvent ion plasma. A repeller then directs these charged particles through the skimmer and acceleration region to introduce the aerosolized sample to a mass spectrometer. It is particularly useful in liquid chromatography-mass spectrometry (LC-MS).
Mascot is a software search engine that uses mass spectrometry data to identify proteins from peptide sequence databases. Mascot is widely used by research facilities around the world. Mascot uses a probabilistic scoring algorithm for protein identification that was adapted from the MOWSE algorithm. Mascot is freely available to use on the website of Matrix Science. A license is required for in-house use where more features can be incorporated.
Protein mass spectrometry refers to the application of mass spectrometry to the study of proteins. Mass spectrometry is an important method for the accurate mass determination and characterization of proteins, and a variety of methods and instrumentations have been developed for its many uses. Its applications include the identification of proteins and their post-translational modifications, the elucidation of protein complexes, their subunits and functional interactions, as well as the global measurement of proteins in proteomics. It can also be used to localize proteins to the various organelles, and determine the interactions between different proteins as well as with membrane lipids.
Top-down proteomics is a method of protein identification that either uses an ion trapping mass spectrometer to store an isolated protein ion for mass measurement and tandem mass spectrometry (MS/MS) analysis or other protein purification methods such as two-dimensional gel electrophoresis in conjunction with MS/MS. Top-down proteomics is capable of identifying and quantitating unique proteoforms through the analysis of intact proteins. The name is derived from the similar approach to DNA sequencing. During mass spectrometry intact proteins are typically ionized by electrospray ionization and trapped in a Fourier transform ion cyclotron resonance, quadrupole ion trap or Orbitrap mass spectrometer. Fragmentation for tandem mass spectrometry is accomplished by electron-capture dissociation or electron-transfer dissociation. Effective fractionation is critical for sample handling before mass-spectrometry-based proteomics. Proteome analysis routinely involves digesting intact proteins followed by inferred protein identification using mass spectrometry (MS). Top-down MS (non-gel) proteomics interrogates protein structure through measurement of an intact mass followed by direct ion dissociation in the gas phase.
Quantitative proteomics is an analytical chemistry technique for determining the amount of proteins in a sample. The methods for protein identification are identical to those used in general proteomics, but include quantification as an additional dimension. Rather than just providing lists of proteins identified in a certain sample, quantitative proteomics yields information about the physiological differences between two biological samples. For example, this approach can be used to compare samples from healthy and diseased patients. Quantitative proteomics is mainly performed by two-dimensional gel electrophoresis (2-DE), preparative native PAGE, or mass spectrometry (MS). However, a recent developed method of quantitative dot blot (QDB) analysis is able to measure both the absolute and relative quantity of an individual proteins in the sample in high throughput format, thus open a new direction for proteomic research. In contrast to 2-DE, which requires MS for the downstream protein identification, MS technology can identify and quantify the changes.
The in-gel digestion step is a part of the sample preparation for the mass spectrometric identification of proteins in course of proteomic analysis. The method was introduced in 1992 by Rosenfeld. Innumerable modifications and improvements in the basic elements of the procedure remain.
Label-free quantification is a method in mass spectrometry that aims to determine the relative amount of proteins in two or more biological samples. Unlike other methods for protein quantification, label-free quantification does not use a stable isotope containing compound to chemically bind to and thus label the protein.
Matthew Collins, is a professor at the University of Copenhagen, formerly as a Niels Bohr professor, and also holds a McDonald Chair in Palaeoproteomics at the University of Cambridge.
Stable isotope standards and capture by anti-peptide antibodies (SISCAPA) is a mass spectrometry method for measuring the amount of a protein in a biological sample.
Ancient proteins are complex mixtures and the term palaeoproteomics is used to characterise the study of proteomes in the past. Ancients proteins have been recovered from a wide range of archaeological materials, including bones, teeth, eggshells, leathers, parchments, ceramics, painting binders and well-preserved soft tissues like gut intestines. These preserved proteins have provided valuable information about taxonomic identification, evolution history (phylogeny), diet, health, disease, technology and social dynamics in the past.