Last updated

THOMAS was the first online database of United States Congress legislative information. A project of the Library of Congress, it was launched in January 1995 at the inception of the 104th Congress and retired on July 5, 2016; it has been superseded by [1]



The resource was a comprehensive, Internet-accessible source of information on the activities of Congress, including:

The database was named after Thomas Jefferson, who was the third President of the United States. [2] "THOMAS" was an acronym for "The House [of Representatives] Open Multimedia Access System". [3]

The website allowed users to share legislative information via several social networking sites, [4] and there were proposals for an application programming interface. [5]

Library of Congress Legislative Data Challenge

The Library of Congress created the Markup of US Legislation in Akoma Ntoso challenge [6] in July 2013 to create representations of selected US bills using the most recent Akoma Ntoso standard within a couple months for a $5,000 prize, [7] and the Legislative XML Data Mapping challenge in September 2013 [8] to produce a data map for US bill XML and UK bill XML to the most recent Akoma Ntoso schema within a couple months for a $10,000 prize. [9]

Related Research Articles

United States Code Official compilation of US federal statutes

The Code of Laws of the United States of America is the official compilation and codification of the general and permanent federal statutes of the United States. It contains 53 titles. The main edition is published every six years by the Office of the Law Revision Counsel of the House of Representatives, and cumulative supplements are published annually. The official version of those laws not codified in the United States Code can be found in United States Statutes at Large.

Extensible Markup Language (XML) is a markup language that defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. The World Wide Web Consortium's XML 1.0 Specification of 1998 and several other related specifications—all of them free open standards—define XML.

The Organization for the Advancement of Structured Information Standards is a global nonprofit consortium that works on the development, convergence, and adoption of open standards for cybersecurity, blockchain, Internet of Things (IoT), emergency management, cloud computing, legal data exchange, energy, content technologies, and other areas.

YAML is a human-readable data-serialization language. It is commonly used for configuration files and in applications where data is being stored or transmitted. YAML targets many of the same communications applications as Extensible Markup Language (XML) but has a minimal syntax which intentionally differs from SGML. It uses both Python-style indentation to indicate nesting, and a more compact format that uses [...] for lists and {...} for maps making YAML 1.2 a superset of JSON.

Congressional Research Service Public think tank

The Congressional Research Service (CRS), known as Congress's think tank, is a public policy research institute of the United States Congress. As a legislative branch agency within the Library of Congress, CRS works primarily and directly for Members of Congress, their Committees and staff on a confidential, nonpartisan basis.

Vern Ehlers American politician

Vernon James Ehlers was an American physicist and politician who represented Michigan in the U.S. House of Representatives from 1993 until his retirement in 2011. A Republican, he also served eight years in the Michigan Senate and two in the Michigan House of Representatives.

MARCstandards are a set of digital formats for the description of items catalogued by libraries, such as books. Working with the Library of Congress, American computer scientist Henriette Avram developed MARC in the 1960s to create records that could be read by computers and shared among libraries. By 1971, MARC formats had become the US national standard for dissemination of bibliographic data. Two years later, they became the international standard. There are several versions of MARC in use around the world, the most predominant being MARC 21, created in 1999 as a result of the harmonization of U.S. and Canadian MARC formats, and UNIMARC. UNIMARC is maintained by the Permanent UNIMARC Committee of the International Federation of Library Associations and Institutions (IFLA), and is widely used in Europe. The MARC 21 family of standards now includes formats for authority records, holdings records, classification schedules, and community information, in addition to the format for bibliographic records.

PubMed Central (PMC) is a free digital repository that archives open access full-text scholarly articles that have been published in biomedical and life sciences journals. As one of the major research databases developed by the National Center for Biotechnology Information (NCBI), PubMed Central is more than a document repository. Submissions to PMC are indexed and formatted for enhanced metadata, medical ontology, and unique identifiers which enrich the XML structured data for each article. Content within PMC can be linked to other NCBI databases and accessed via Entrez search and retrieval systems, further enhancing the public's ability to discover, read and build upon its biomedical knowledge.

Parliamentary informatics is the application of information technology to the documentation of legislative activity. The principal areas of concern are the provision, in a form conveniently readable to humans or machines, of information and statistics about:

The Privacy and Civil Liberties Oversight Board (PCLOB) is an independent agency within the executive branch of the United States government, established by Congress in 2004 to advise the President and other senior executive branch officials to ensure that concerns with respect to privacy and civil liberties in the United States are appropriately considered in the development and implementation of all laws, regulations, and executive branch policies related to terrorism.

The Spry Framework is an open source Ajax framework developed by Adobe Systems which is used in the construction of Rich Internet applications. Unlike other pure JavaScript frameworks such as the Dojo Toolkit and Prototype, Spry is geared towards web designers, not web developers. On August 29, 2012, Adobe announced that it will no longer continue development of Spry and handed it over to the community on GitHub.

Geospatial metadata is a type of metadata applicable to geographic data and information. Such objects may be stored in a geographic information system (GIS) or may simply be documents, data-sets, images or other objects, services, or related items that exist in some other native environment but whose features may be appropriate to describe in a (geographic) metadata catalog.

The Open Packaging Conventions (OPC) is a container-file technology initially created by Microsoft to store a combination of XML and non-XML files that together form a single entity such as an Open XML Paper Specification (OpenXPS) document. OPC-based file formats combine the advantages of leaving the independent file entities embedded in the document intact and resulting in much smaller files compared to normal use of XML.

John Boccieri American politician

John Boccieri is an American politician who was appointed to fill the 59th district seat in the Ohio House of Representatives on September 29, 2015. He left office after an unsuccessful run for Ohio State Senate in 2018. He served as the U.S. Representative for Ohio's 16th congressional district from 2009 to 2011, and lost his 2010 bid for reelection to Republican Jim Renacci. He is a member of the Democratic Party, and previously served in the Ohio State Senate and the Ohio House of Representatives. Boccieri resides in Poland, Ohio.

GovTrack is a website developed by then-student Joshua Tauberer. It is based in Washington, D.C., and was launched as a hobby. It enables its users to track the bills and members of the United States Congress. Users can add trackers to certain bills, thereby narrowing the scope of the information they receive. The website collects data on members of Congress, allowing users to check members' voting records and attendance relative to their peers. It propagates the ideology of increasing transparency in the government and building better communication between the general public and the government. The website was briefly "on pause" in September 2020 in protest of President Trump's refusal to commit to a peaceful transition of power regarding the 2020 U.S. Presidential Election.

Legal XML is a non-profit organization developing in the frame of the OASIS consortium open standards for legal documents, such as electronic court filing, court documents, legal citations, and transcripts, and related applications. The building block for Legal XML standards is eXtensible Markup Language ("XML").

Akoma Ntoso

Akoma Ntoso is an international technical standard for representing executive, legislative and judiciary documents in a structured manner using a domain specific, legal XML vocabulary.

LexML Brasil

LexML Brasil is a project of Brazil's Electronic Government initiative. Its objective is to establish open data systems, integrate work processes and share data, in the context of identifying and structuring executive, legislative and judiciary documents. The LexML-BR standards define a set of simple technology-neutral electronic protocols and representations, based on XML and HTTP ecossistem.

The SAFE Banking Act is proposed legislation regarding disposition of funds gained through the cannabis industry in the United States. On March 7, 2019, the bill was introduced in U.S. House of Representatives by Ed Perlmutter (D-CO) and was referred to the Judiciary and Financial Services Committees. On March 28, 2019, the Financial Services Committee voted 45 to 15 to advance the bill to the full House. The bill had "broad bipartisan support", and there were 152 cosponsors at the time of the committee vote – over a third of the entire House. Perlmutter, along with Washington Representative Denny Heck, "have introduced similar bills every Congress since 2013". On April 11, 2019, Oregon Senator Jeff Merkley introduced a companion bill in the U.S. Senate and the bill was referred to the Senate Banking, Housing, and Urban Affairs Committee. On June 6, 2019, the House bill moved out of committee and was placed on the Union Calendar for a vote.


  1. "Frequently Asked Questions (FAQ): THOMAS Retirement". Library of Congress . Retrieved October 18, 2014.
  2. " to Retire July 5". News from the Library of Congress. The Library of Congress. April 28, 2016. Retrieved August 30, 2018.
  3. Vlietstra, J. (2001). Dictionary of Acronyms and Technical Abbreviations: For Information and Communication Technologies and Related Areas. Springer Science & Business Media. p. 624. ISBN   9781852333973.
  4. "Sharing THOMAS Content with the Share Tool". THOMAS. Library of Congress. Archived from the original on 2010-12-10.
  5. Zetter, Kim (March 5, 2009). "the database of United States Congress legislative information". Wired .
  6. "Markup of US Legislation in Akoma Ntoso". Archived from the original on 2013-08-25. Retrieved 2013-09-23.
  7. Gheen, Tina (July 16, 2013). "Library of Congress Announces First Legislative Data Challenge". Library of Congress.
  8. "Legislative XML Data Mapping". Legislative XML Data Mapping.
  9. Gheen, Tina (September 10, 2013). "Second Library of Congress Legislative Data Challenge Launched". Library of Congress.
  10. Gheen, Tina (December 19, 2013). "First Legislative Data Challenge Winner Announced". Library of Congress . Retrieved December 20, 2013.
  11. Gheen, Tina (February 25, 2014). "Jim Mangiafico and Garrett Schure Announced as Winners of the Second Library of Congress Legislative Data Challenge". Library of Congress . Retrieved February 25, 2014.