Document engineering

Last updated

Document engineering is a computer science discipline. It focuses on the tools, processes and systems that improve the ability to create, manage, and maintain documents digitally. [1]

In the context of document engineering, document may refer to any kind of document, but most often refers to documents somehow encoded in digital forms, and to ordered pieces of information extracted or derived from documents, typically to be used by computer applications or web services rather than directly by people. The structure of information both within and as extracted from documents, has particular relevance in the areas of XML and SQL schema design.

From 2003 to 2007, the University of California, Berkeley operated a research center for document engineering, which has been subsumed by its program in Information and Service Design. [2]

The conventional discipline that most resembles document engineering is probably business informatics. However, document engineering emphasizes the need for conceptual modeling of documents and processes at an implementable granularity, and so involves linguistics, ontology, database theory, and a wide range of other disciplines.

The ACM Symposium on Document Engineering [3] is a yearly conference of computer scientists interested in document engineering and related domains of text or document processing, and has been held since 2001.

Related Research Articles

<span class="mw-page-title-main">Computing</span> Activity involving calculations or computing machinery

Computing is any goal-oriented activity requiring, benefiting from, or creating computing machinery. It includes the study and experimentation of algorithmic processes, and the development of both hardware and software. Computing has scientific, engineering, mathematical, technological, and social aspects. Major computing disciplines include computer engineering, computer science, cybersecurity, data science, information systems, information technology, and software engineering.

<span class="mw-page-title-main">Computer science</span> Study of computation

Computer science is the study of computation, information, and automation. Computer science spans theoretical disciplines to applied disciplines.

Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an information need. The information need can be specified in the form of a search query. In the case of document retrieval, queries can be based on full-text or other content-based indexing. Information retrieval is the science of searching for information in a document, searching for documents themselves, and also searching for the metadata that describes data, and for databases of texts, images or sounds.

Software engineering is an engineering approach to software development. A practitioner, called a software engineer, applies the engineering design process to develop software.

An information system (IS) is a formal, sociotechnical, organizational system designed to collect, process, store, and distribute information. From a sociotechnical perspective, information systems are composed by four components: task, people, structure, and technology. Information systems can be defined as an integration of components for collection, storage and processing of data of which the data is used to provide information, contribute to knowledge as well as digital products that facilitate decision making.

Theoretical computer science is a subfield of computer science and mathematics that focuses on the abstract and mathematical foundations of computation.

The following outline is provided as an overview of and topical guide to human–computer interaction:

The World Wide Web has become a major delivery platform for a variety of complex and sophisticated enterprise applications in several domains. In addition to their inherent multifaceted functionality, these Web applications exhibit complex behaviour and place some unique demands on their usability, performance, security, and ability to grow and evolve. However, a vast majority of these applications continue to be developed in an ad hoc way, contributing to problems of usability, maintainability, quality and reliability. While Web development can benefit from established practices from other related disciplines, it has certain distinguishing characteristics that demand special considerations. In recent years, there have been developments towards addressing these considerations.

Enterprise architecture (EA) is a business function concerned with the structures and behaviours of a business, especially business roles and processes that create and use business data. The international definition according to the Federation of Enterprise Architecture Professional Organizations is "a well-defined practice for conducting enterprise analysis, design, planning, and implementation, using a comprehensive approach at all times, for the successful development and execution of strategy. Enterprise architecture applies architecture principles and practices to guide organizations through the business, information, process, and technology changes necessary to execute their strategies. These practices utilize the various aspects of an enterprise to identify, motivate, and achieve these changes."

Human-centered computing (HCC) studies the design, development, and deployment of mixed-initiative human-computer systems. It is emerged from the convergence of multiple disciplines that are concerned both with understanding human beings and with the design of computational artifacts. Human-centered computing is closely related to human-computer interaction and information science. Human-centered computing is usually concerned with systems and practices of technology use while human-computer interaction is more focused on ergonomics and the usability of computing artifacts and information science is focused on practices surrounding the collection, manipulation, and use of information.

<span class="mw-page-title-main">Homer R. Warner</span> American cardiologist

Homer Richards Warner was an American cardiologist who was an early proponent of medical informatics who pioneered many aspects of computer applications to medicine. Author of the book, Computer-Assisted Medical Decision-Making, published in 1979, he served as CIO for the University of Utah Health Sciences Center, as president of the American College of Medical Informatics, and was actively involved with the National Institutes of Health. He was first chair of the Department of Medical Informatics at the University of Utah School of Medicine, the first American medical program to formally offer a degree in medical informatics.

<span class="mw-page-title-main">Jan Dietz</span> Dutch computer scientist

Jean Leonardus Gerardus (Jan) Dietz is a Dutch Information Systems researcher, Professor Emeritus of Information Systems Design at the Delft University of Technology, known for the development of the Design & Engineering Methodology for Organisations. and his work on Enterprise Engineering.

<span class="mw-page-title-main">Human–computer interaction</span> Academic discipline studying the relationship between computer systems and their users

Human–computer interaction (HCI) is research in the design and the use of computer technology, which focuses on the interfaces between people (users) and computers. HCI researchers observe the ways humans interact with computers and design technologies that allow humans to interact with computers in novel ways. A device that allows interaction between human being and a computer is known as a "Human-computer Interface (HCI)".

Informatics is the study of computational systems. According to the ACM Europe Council and Informatics Europe, informatics is synonymous with computer science and computing as a profession, in which the central notion is transformation of information. In some cases, the term "informatics" may also be used with different meanings, e.g. in the context of social computing, or in context of library science.

Data-intensive computing is a class of parallel computing applications which use a data parallel approach to process large volumes of data typically terabytes or petabytes in size and typically referred to as big data. Computing applications that devote most of their execution time to computational requirements are deemed compute-intensive, whereas applications are deemed data-intensive require large volumes of data and devote most of their processing time to I/O and manipulation of data.

<span class="mw-page-title-main">Data science</span> Field of study to extract insights from data

Data science is an interdisciplinary academic field that uses statistics, scientific computing, scientific methods, processes, scientific visualization, algorithms and systems to extract or extrapolate knowledge and insights from potentially noisy, structured, or unstructured data.

This glossary of computer science is a list of definitions of terms and concepts used in computer science, its sub-disciplines, and related fields, including terms relevant to software, data science, and computer programming.

<span class="mw-page-title-main">Information engineering</span> Engineering discipline

Information engineering is the engineering discipline that deals with the generation, distribution, analysis, and use of information, data, and knowledge in systems. The field first became identifiable in the early 21st century.

References

  1. "Start | The ACM Symposium on Document Engineering". doceng.org. Retrieved 2024-07-19.
  2. "Welcome to the Center for Document Engineering". Archived from the original on January 1, 2008. Retrieved January 4, 2008.{{cite web}}: CS1 maint: bot: original URL status unknown (link)
  3. ACM Symposium on Document Engineering