MICRO Relational Database Management System

Last updated

The MICRO Relational Database Management System was the first large-scale set-theoretic database management system to be used in production. [1] Though MICRO was initially considered to be an "Information Management System", it was eventually recognized to provide all the capabilities of an RDBMS. MICRO's major underpinnings and algorithms were based on the Set-Theoretic Data Structure (STDS) model developed by D. L. Childs of the University of Michigan's CONCOMP (Conversational Use of Computers) Project. [2] [3] [4] MICRO featured a natural language interface which allowed non-programmers to use the system. [5] [6]

Implementation of MICRO began in 1970 as part of the Labor Market Information System (LMIS) project at the University of Michigan's Institute of Labor and Industrial Relations (ILIR). Dr. Malcolm S. Cohen was Director of the LMIS Project and was the principal innovator and designer of the original MICRO Retrieval System. [7] Carol Easthope and Jack Guskin were the principal programmers. D.L. Childs, Vice President of Set Theoretic Information Systems (STIS) Corporation, provided continuing guidance in the use of Set-Theoretic Data Structure (STDS) data access software for MICRO. Funding came from the Office of Manpower Administration within the U.S. Department of Labor. [5] MICRO was first used for the study of large social science data bases referred to as micro data; hence the name. Organizations such as the US Department of Labor, the US Environmental Protection Agency, and researchers from the University of Alberta, the University of Michigan, Wayne State University, the University of Newcastle upon Tyne, and Durham University used MICRO to manage very large scale databases until 1998.

MICRO runs under the Michigan Terminal System (MTS), the interactive time-sharing system developed at the University of Michigan that runs on IBM System/360 Model 67, System/370, and compatible mainframe computers. [8] MICRO provides a query language, a database directory, and a data dictionary to create an interface between the user and the very efficient proprietary Set-Theoretic Data Structure (STDS) software developed by the Set-Theoretic Information Systems Corporation (STIS) of Ann Arbor, Michigan. The lower level routines from STIS treat the data bases as sets and perform set operations on them, e.g., union, intersection, restrictions, etc. Although the underlying STDS model is based on set theory, the MICRO user interface is similar to those subsequently used in relational database management systems. [2] [9] MICRO's data representation can be thought of as a matrix or table in which the rows represent different records or "cases", and the columns contain individual data items for each record; however, the actual data representation is in set-theoretic form. In labor market applications the rows typically represent job applicants or employees and columns represent fields such as age, sex, and income or type of industry, number of employees, and payroll. [1]

MICRO permits users with little programming experience to define, enter, interrogate, manipulate, and update collections of data in a relatively unstructured and unconstrained environment. [5] An interactive system, MICRO is powerful in terms of the complexity of requests which can be made by users without prior programming language experience. [10] MICRO includes basic statistical computations such as mean, variance, frequency, median, etc. If more rigorous statistical analysis are desired, the data from a MICRO database can be exported to the Michigan Interactive Data Analysis System (MIDAS), [11] a statistical analysis package available under the Michigan Terminal System. [12]

Related Research Articles

Database organized collection of data

A database is an organized collection of data, generally stored and accessed electronically from a computer system. Where databases are more complex they are often developed using formal design and modeling techniques.

A relational database is a digital database based on the relational model of data, as proposed by E. F. Codd in 1970. A software system used to maintain relational databases is a relational database management system (RDBMS). Many relational database systems have an option of using the SQL for querying and maintaining the database.

The relational model (RM) for database management is an approach to managing data using a structure and language consistent with first-order predicate logic, first described in 1969 by English computer scientist Edgar F. Codd, where all data is represented in terms of tuples, grouped into relations. A database organized in terms of the relational model is a relational database.

SQL is a domain-specific language used in programming and designed for managing data held in a relational database management system (RDBMS), or for stream processing in a relational data stream management system (RDSMS). It is particularly useful in handling structured data, i.e. data incorporating relations among entities and variables.

Object-relational database database management system

An object-relational database (ORD), or object-relational database management system (ORDBMS), is a database management system (DBMS) similar to a relational database, but with an object-oriented database model: objects, classes and inheritance are directly supported in database schemas and in the query language. In addition, just as with pure relational systems, it supports extension of the data model with custom data types and methods.

Edgar F. Codd computer scientist

Edgar Frank "Ted" Codd was an English computer scientist who, while working for IBM, invented the relational model for database management, the theoretical basis for relational databases and relational database management systems. He made other valuable contributions to computer science, but the relational model, a very influential general theory of data management, remains his most mentioned, analyzed and celebrated achievement.

IBM Db2 Family Relational model database server

Db2 is a family of data management products, including database servers, developed by IBM. They initially supported the relational model, but were extended to support object-relational features and non-relational structures like JSON and XML. The brand name was originally styled as DB/2, then DB2 until 2017 and finally changed to its present form.

Michigan Terminal System mainframe operating system

The Michigan Terminal System (MTS) is one of the first time-sharing computer operating systems. Developed in 1967 at the University of Michigan for use on IBM S/360-67, S/370 and compatible mainframe computers, it was developed and used by a consortium of eight universities in the United States, Canada, and the United Kingdom over a period of 33 years.

Online analytical processing, or OLAP, is an approach to answer multi-dimensional analytical (MDA) queries swiftly in computing. OLAP is part of the broader category of business intelligence, which also encompasses relational databases, report writing and data mining. Typical applications of OLAP include business reporting for sales, marketing, management reporting, business process management (BPM), budgeting and forecasting, financial reporting and similar areas, with new applications emerging, such as agriculture.

IDEF

IDEF, initially an abbreviation of ICAM Definition and renamed in 1999 as Integration Definition, is a family of modeling languages in the field of systems and software engineering. They cover a wide range of uses from functional modeling to data, simulation, object-oriented analysis and design, and knowledge acquisition. These definition languages were developed under funding from U.S. Air Force and, although still most commonly used by them and other military and United States Department of Defense (DoD) agencies, are in the public domain.

David L. Mills American computer scientist

David L. Mills is an American computer engineer and Internet pioneer. Mills earned his PhD in Computer and Communication Sciences from the University of Michigan in 1971. While at Michigan he worked on the ARPA sponsored Conversational Use of Computers (CONCOMP) project and developed DEC PDP-8 based hardware and software to allow terminals to be connected over phone lines to an IBM 360 mainframe.

Data modeling (in software engineering) process of creating a data model for an information system by applying certain formal techniques

Data modeling in software engineering is the process of creating a data model for an information system by applying certain formal techniques.

Data integration involves combining data residing in different sources and providing users with a unified view of them. This process becomes significant in a variety of situations, which include both commercial and scientific domains. Data integration appears with increasing frequency as the volume and the need to share existing data explodes. It has become the focus of extensive theoretical work, and numerous open problems remain unsolved. Data integration encourages collaboration between internal as well as external users.

Raymond F. Boyce (1947–1974) was an American computer scientist who was known for his research in relational databases. He is best known for his work co-developing the SQL database language and Boyce-Codd normal form.

Britton Lee, Inc. former information technology company

Britton Lee Inc. was a pioneering relational database company. Renamed ShareBase, it was acquired by Teradata in June, 1990.

Michael Stonebraker American computer scientist

Michael Ralph Stonebraker is a computer scientist specializing in database research. Through a series of academic prototypes and commercial startups, Stonebraker's research and products are central to many relational database systems. He is also the founder of many database companies, including Ingres Corporation, Illustra, Paradigm4, StreamBase Systems, Tamr, Vertica and VoltDB, and served as chief technical officer of Informix. He is also an editor for the book Readings in Database Systems.

Franklin Herbert Westervelt was an American engineer, computer scientist, and educator at the University of Michigan and Wayne State University. Westervelt received degrees in Mathematics, Mechanical and Electrical Engineering from the College of Engineering at the University of Michigan. He attained his PhD in 1961. He was a Professor of Mechanical Engineering at the University of Michigan and an Associate Director at the U-M Computing Center. He was involved in early studies on how to use computers in engineering education.

The following

Information technology (IT) is the use of computers to store, retrieve, transmit, and manipulate data or information. IT is typically used within the context of business operations as opposed to personal or entertainment technologies. IT is considered to be a subset of information and communications technology (ICT). An information technology system is generally an information system, a communications system or, more specifically speaking, a computer system – including all hardware, software and peripheral equipment – operated by a limited group of users.

David L. Childs was a computer scientist noted for his work on his Extended Set Theoretic approach to data base management and cited by Edgar F. Codd in his key paper "A Relational Model of Data for Large Shared Data Banks".

References

  1. 1 2 "A set theoretic data structure and retrieval language" (PDF), William R. Hershey and Carol H. Easthope, Paper from the Session on Data Structures, Spring Joint Computer Conference, May 1972 in ACM SIGIR Forum, Volume 7, Issue 4 (December 1972), pp. 45-55, DOI=10.1145/1095495.1095500
  2. 1 2 "Sets, Data Models and Data Independence", by Ken North a Dr. Dobb's Blogger, March 10, 2010
  3. Description of a set-theoretic data structure, D. L. Childs, 1968, Technical Report 3 of the CONCOMP (Research in Conversational Use of Computers) Project, University of Michigan, Ann Arbor, Michigan, USA
  4. Feasibility of a Set-Theoretic Data Structure : A General Structure Based on a Reconstituted Definition of Relation, D. L. Childs, 1968, Technical Report 6 of the CONCOMP (Research in Conversational Use of Computers) Project, University of Michigan, Ann Arbor, Michigan, USA
  5. 1 2 3 MICRO Information Management System (Version 5.0) Reference Manual, M.A. Kahn, D.L. Rumelhart, and B.L. Bronson, October 1977, Institute of Labor and Industrial Relations (ILIR), University of Michigan and Wayne State University
  6. MICRO: A Relational Database Management System, Harry F. Clark, David E. Hetrick, Robert C. Bressan, July 1992, Institute of Labor and Industrial Relations (ILIR), University of Michigan, 451 pages, ISBN   9780877363507
  7. Feasibility of a Labor Market Information System, Volume 3, Final Report for Period July 1, 1970-June 30, 1974, Malcolm S. Cohen, Labor Market Information System (LMIS) Project, Institute of Labor and Industrial Relations, University of Michigan, June 1974, 76 pages, PDF
  8. "Chapter 6: MICRO" in Introduction to database management systems on MTS, Rick Rilio, User Guide Series, Computing Center, University of Michigan, March 1986, pages 147-189
  9. "A Relational Model of Data for Large Shared Data Banks", E.F. Codd, Communications of the ACM, volume 13, issue 6 (June 1970), pp. 77–387, doi= 10.1145/362384.362685
  10. "Use of a Relational Database to Support Clinical Research: Application in a Diabetes Program", Diane Lomatch, M.P.H., Terry Truax, M.S., Peter Savage, M.D., Diabetes Center Unit, MDRTC, University of Michigan, 1981
  11. Documentation for MIDAS, Daniel J. Fox and Kenneth E. Guire, Third Edition (September 1976), Statistical Research Laboratory, University of Michigan, 203 pp.
  12. "Converting from Traditional File Structures to Database Management Systems: A Powerful Tool for Nursing Management", Yvonne Marie Abdoo, Ph.D., R.N, Wayne State University College of Nursing, 1987