Public health informatics

Last updated July 29, 2025

Public health informatics has been defined as the systematic application of information and computer science and technology to public health practice, research, and learning.^[1] It is one of the subdomains of health informatics, data management applied to medical systems.

The structure of public health informatics data collection and management in the United States is divided among both the federal and state levels. The Centers for Disease Control and Prevention (CDC) is the department at the federal level, and locally, it belongs to the state departments of health.^[2] These programs have standardized the reporting of digital health data by hospitals and clinics. The government departments can then gather this data, analyze it, and use it for a variety of purposes. Such purposes typically fall under the three major domains of public health informatics: understanding more about complex processes that occur, storing a record of public health data, and analyzing and publicizing a general version of gathered data for public consumption. Additionally, data collected from social media can also be included in these processes, refining its accuracy.^[3]

Job opportunities in this field include positions with the CDC and the American Medical Informatics Association, which provides more information about informatics for professionals in medical fields.

Health Informatics in the United States

In developed countries like the United States, public health informatics is practiced by individuals in public health agencies at the federal and state levels and in the larger local health jurisdictions. Additionally, research and training in public health informatics takes place at a variety of academic institutions.

At the federal Centers for Disease Control and Prevention in US states like Atlanta, Georgia, the Public Health Surveillance and Informatics Program Office (PHSIPO) focuses on advancing the state of information science and applies digital information technologies to aid in the detection and management of diseases and syndromes in individuals and populations.^[1]

The bulk of the work of public health informatics in the United States, as with public health generally, takes place at the state and local level, in the state departments of health and the county or parish departments of health.^[2] At a state health department the activities may include: collection and storage of vital statistics (birth and death records); collection of reports of communicable disease cases from doctors, hospitals, and laboratories, used for infectious disease surveillance; display of infectious disease statistics and trends; collection of child immunization and lead screening information; daily collection and analysis of emergency room data to detect early evidence of biological threats; collection of hospital capacity information to allow for planning of responses in case of emergencies. Each of these activities presents its own information processing challenge.^[4]

Collection of public health data

Since the beginning of the World Wide Web, public health agencies with sufficient information technology resources have been transitioning to web-based collection of public health data, and, more recently, to automated messaging of the same information. In the years roughly 2000 to 2005 the Centers for Disease Control and Prevention, under its National Electronic Disease Surveillance System (NEDSS),^[5] built and provided free to states a comprehensive web and message-based reporting system called the NEDSS Base System (NBS).^[6] Due to the funding being limited and it not being wise to have fiefdom-based systems, only a few states and larger counties have built their own versions of electronic disease surveillance systems, such as Pennsylvania's PA-NEDSS.^[7] These do not provide timely full intestate notification services causing an increase in disease rates versus the NEDSS federal product.

To promote interoperability, the CDC has encouraged the adoption in public health data exchange of several standard vocabularies and messaging formats from the health care world. The most prominent of these are: the Health Level 7 (HL7) standards for health care messaging; the LOINC system for encoding laboratory test and result information; and the Systematized Nomenclature of Medicine (SNOMED) vocabulary of health care concepts.^[8]

Since about 2005, the CDC has promoted the idea of the Public Health Information Network to facilitate the transmission of data from various partners in the health care industry and elsewhere (hospitals, clinical and environmental laboratories, doctors' practices, pharmacies) to local health agencies, then to state health agencies, and then to the CDC.^[9] At each stage the entity must be capable of receiving the data, storing it, aggregating it appropriately, and transmitting it to the next level. A typical example would be infectious disease data, which hospitals, labs, and doctors are legally required to report to local health agencies; local health agencies must report to their state public health department; and which the states must report in aggregate form to the CDC. Among other uses, the CDC publishes the Morbidity and Mortality Weekly Report (MMWR) based on these data acquired systematically from across the United States.^[10]

Major issues in the collection of public health data are: awareness of the need to report data; lack of resources of either the reporter or collector; lack of interoperability of data interchange formats, which can be at the purely syntactic or at the semantic level; variation in reporting requirements across the states, territories, and localities.^[11]

Public health informatics can be thought of or divided into three categories.

Studying health data models

The first category is to discover and study models of complex systems, such as disease transmission. This can be done through different types of data collections, such as hospital surveys, or electronic surveys submitted to the organization (such as the CDC).^[12] Transmission rates or disease incidence rates/surveillance can be obtained through government organizations, such as the CDC, or global organizations, such as WHO. Not only disease transmission/rates can be looked at. Public health informatics can also delve into people with/without health insurance and the rates at which they go to the doctor.^[13] Before the advent of the internet, public health data in the United States, like other healthcare and business data, were collected on paper forms and stored centrally at the relevant public health agency. If the data were to be computerized they required a distinct data entry process, were stored in the various file formats of the day and analyzed by mainframe computers using standard batch processing.^[14]

Storing public health data

The second category is to find ways to improve the efficiency of different public health systems. This is done through various collections methods, storage of data and how the data is used to improve current health problems. In order to keep everything standardized, vocabulary and word usage needs to be consistent throughout all systems. Finding new ways to link together and share new data with current systems is important to keep everything up to date.^[15]

Storage of public health data shares the same data management issues as other industries. Like other industries, the details of how these issues play out are affected by the nature of the data being managed.^[16]

Due to the complexity and variability of public health data, like health care data generally, the issue of data modeling presents a particular challenge. While a generation ago flat data sets for statistical analysis were the norm, today's requirements of interoperability and integrated sets of data across the public health enterprise require more sophistication.^[17] The relational database is increasingly the norm in public health informatics. Designers and implementers of the many sets of data required for various public health purposes must find a workable balance between very complex and abstract data models such as HL7's Reference Information Model (RIM) or CDC's Public Health Logical Data Model, and simplistic, ad hoc models that untrained public health practitioners come up with and feel capable of working with.^[18]

Due to the variability of the incoming data to public health jurisdictions, data quality assurance is also a major issue.^[19]

Maintaining current public health data

Finally, the last category can be thought as maintaining and enriching current systems and models to adapt to overflow of data and storing/sorting of this new data. This can be as simple as connecting directly to an electronic data collection source, such as health records from the hospital, or can go public information (CDC) about disease rates/transmission. Finding new algorithms that will sort through large quantities of data quickly and effectively is necessary as well.^[20]

The need to extract usable public health information from the mass of data available requires the public health informaticist to become familiar with a range of analysis tools, ranging from business intelligence tools to produce routine or ad hoc reports, to sophisticated statistical analysis tools such as DAP/SAS and PSPP/SPSS, to Geographical Information Systems (GIS) to expose the geographical dimension of public health trends. Such analyses usually require methods that appropriately secure the privacy of the health data. One approach is to separate the individually identifiable variables of the data from the rest.^[21] Another broader approach is to use social media to analyze health trends. Since the late 2000s, data from social media websites such as Twitter and Facebook, as well as search engines such as Google and Bing, have been used extensively in detecting trends in public health.^[3]

The health informatics industry

There are a few organizations out there that provide useful information for those professionals that want to be more involved in public health informatics. Such as the American Medical Informatics Association (AMIA). AMIA is for professions that are involved in health care, informatics research, biomedical research, including physicians, scientists, researchers, and students. The main goals of AMIA are to move from 'bench to bedside', help improve the impact of health innovations and advance the public health informatics field. They hold annual conferences, online classes and webinars, which are free to their members. There is also a career center specific for the biomedical and health informatics community.^[22]

Many jobs or fellowships in public health informatics are offered. The CDC (Center for Disease Control) has various fellowship programs, while multiple colleges/companies offer degree programs or training in this field.^[23]

For more information on these topics, follow the links below:

Programs | Johns Hopkins | Bloomberg School of Public Health
"What We Do". www.phii.org. Retrieved 12 September 2023.
SAPPHIRE (Health care) or Situational Awareness and Preparedness for Public Health Incidences and Reasoning Engines is a semantics-based health information system capable of tracking and evaluating situations and occurrences that may affect public health.

References

1 2 "Framework for Evaluating Public Health Surveillance Systems for Early Detection of Outbreaks: Recommendations from the CDC Working Group Prepared by James W. Buehler, M.D.,1 Richard S. Hopkins, M.D.,2 J. Marc Overhage, M.D.,3 Daniel M. Sosin, M.D.,2 Van Tong, M.P.H.2 1 Department of Epidemiology, Rollins School of Public Health, Emory University 2 Division of Public Health Surveillance and Informatics, Epidemiology Program Office, CDC 3 Indiana University School of Medicine The material in this report originated in the Epidemiology Program Office, Stephen B. Thacker, M.D., Director, and the Division of Public Health Surveillance and Informatics, Daniel M. Sosin, M.D., Director. Summary". www.cdc.gov. Retrieved 17 November 2024.
1 2 Massoudi, B L, and K G Chester. “Public Health, Population Health, and Epidemiology Informatics: Recent Research and Trends in the United States.” Yearbook of medical informatics vol. 26,1 (2017): 241-247. doi:10.15265/IY-2017-035
1 2 Ayers, John W.; Althouse, Benjamin M.; Dredze, Mark (9 April 2014). "Could Behavioral Medicine Lead the Web Data Revolution?". JAMA. 311 (14): 1399–1400. doi:10.1001/jama.2014.1505. ISSN 0098-7484. PMC 4670613 . PMID 24577162.
↑ Health, Institute of Medicine (US) Committee for the Study of the Future of Public (1988), "Summary of the Public Health System in the United States", The Future of Public Health, National Academies Press (US), retrieved 17 November 2024
↑ Group, The National Electronic Disease Surveillance System Working (2001). "National Electronic Disease Surveillance System (NEDSS): A Standards-Based Approach To Connect Public Health and Clinical Medicine" . Journal of Public Health Management and Practice. 7 (6): 43–50. doi:10.1097/00124784-200107060-00005. ISSN 1078-4659. JSTOR 44971583. PMID 11713753.{{cite journal}}: |last= has generic name (help)
↑ CDC (21 February 2024). "About National Electronic Disease Surveillance System Base System (NBS)". National Electronic Disease Surveillance System Base System (NBS). Retrieved 17 November 2024.
↑ "PA-NEDSS | Department of Health | Commonwealth of Pennsylvania". www.pa.gov. Retrieved 17 November 2024.
↑ CDC (30 September 2024). "Data Interchange Standards". PHIN Tools and Resources. Retrieved 17 November 2024.
↑ CDC (10 June 2024). "PHIN Tools & Resources for Public Health". PHIN Tools and Resources. Retrieved 17 November 2024.
↑ "NATIONAL CENTER FOR PUBLIC HEALTH INFORMATICS (CPE)". stacks.cdc.gov. 23 October 2008. Retrieved 17 November 2024.
↑ CDC (24 October 2024). "About Public Health Data Interoperability". Public Health Data Interoperability. Retrieved 17 November 2024.
↑ CDC (11 September 2024). "Surveys and Data Collection Systems". National Center for Health Statistics. Retrieved 17 November 2024.
↑ Felix, Suad El Burai (2024). "A Standard Framework for Evaluating Large Health Care Data and Related Resources". MMWR Supplements. 73 (3): 1–13. doi:10.15585/mmwr.su7303a1. ISSN 2380-8950. PMC 11078514 . PMID 38713639.
↑ "Portfolio:Public health informatics - Write Edit Teach". www.writediteach.com. Retrieved 17 November 2024.
↑ "Using Technologies for Data Collection and Management | Epidemic Intelligence Service | CDC". www.cdc.gov. 25 September 2019. Retrieved 17 November 2024.
↑ "What is Health Data Management? Benefits, Challenges and Storage". Cloudian. Retrieved 17 November 2024.
↑ Walker, Daniel M et al. “Perspectives on Challenges and Opportunities for Interoperability: Findings From Key Informant Interviews With Stakeholders in Ohio.” JMIR medical informatics vol. 11 e43848. 24 Feb. 2023, doi:10.2196/43848
↑ Priyatna, Freddy et al. “Querying clinical data in HL7 RIM based relational model with morph-RDB.” Journal of biomedical semantics vol. 8,1 49. 5 Oct. 2017, doi:10.1186/s13326-017-0155-8
↑ Chen, Hong et al. “A review of data quality assessment methods for public health information systems.” International journal of environmental research and public health vol. 11,5 5170-207. 14 May. 2014, doi:10.3390/ijerph110505170
↑ "Data Modernization Initiative | CDC". www.cdc.gov. 18 October 2024. Retrieved 17 November 2024.
↑ CDC (13 September 2024). "Data and Analysis Tools". National Center for Health Statistics. Retrieved 17 November 2024.
↑ "Programs | Johns Hopkins | Bloomberg School of Public Health".
↑ "What We Do". www.phii.org. Retrieved 12 September 2023.

Public Health Informatics and Information Systems by Patrick W. O'Carroll, William A. Yasnoff, M. Elizabeth Ward, Laura H. Ripp, Ernest L. Martin, D.A. Ross, A.R. Hinman, K. Saarlas, William H. Foege (Hardcover - Oct 16, 2002) ISBN 0-387-95474-0
A Vision for More Effective Public Health Information Technology on SSRN
Olmeda, Christopher J. (2000). Information Technology in Systems of Care. Delfin Press. ISBN 978-0-9821442-0-6
on FDA
Health Data Tools and Statistics

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[:0-1] 1 2 "Framework for Evaluating Public Health Surveillance Systems for Early Detection of Outbreaks: Recommendations from the CDC Working Group Prepared by James W. Buehler, M.D.,1 Richard S. Hopkins, M.D.,2 J. Marc Overhage, M.D.,3 Daniel M. Sosin, M.D.,2 Van Tong, M.P.H.2 1 Department of Epidemiology, Rollins School of Public Health, Emory University 2 Division of Public Health Surveillance and Informatics, Epidemiology Program Office, CDC 3 Indiana University School of Medicine The material in this report originated in the Epidemiology Program Office, Stephen B. Thacker, M.D., Director, and the Division of Public Health Surveillance and Informatics, Daniel M. Sosin, M.D., Director. Summary". www.cdc.gov. Retrieved 17 November 2024.

[:1-2] 1 2 Massoudi, B L, and K G Chester. “Public Health, Population Health, and Epidemiology Informatics: Recent Research and Trends in the United States.” Yearbook of medical informatics vol. 26,1 (2017): 241-247. doi:10.15265/IY-2017-035

[:2-3] 1 2 Ayers, John W.; Althouse, Benjamin M.; Dredze, Mark (9 April 2014). "Could Behavioral Medicine Lead the Web Data Revolution?". JAMA. 311 (14): 1399–1400. doi:10.1001/jama.2014.1505. ISSN 0098-7484. PMC 4670613 . PMID 24577162.

[4] Health, Institute of Medicine (US) Committee for the Study of the Future of Public (1988), "Summary of the Public Health System in the United States", The Future of Public Health, National Academies Press (US), retrieved 17 November 2024

[5] Group, The National Electronic Disease Surveillance System Working (2001). "National Electronic Disease Surveillance System (NEDSS): A Standards-Based Approach To Connect Public Health and Clinical Medicine" . Journal of Public Health Management and Practice. 7 (6): 43–50. doi:10.1097/00124784-200107060-00005. ISSN 1078-4659. JSTOR 44971583. PMID 11713753.{{cite journal}}: |last= has generic name (help)

[6] CDC (21 February 2024). "About National Electronic Disease Surveillance System Base System (NBS)". National Electronic Disease Surveillance System Base System (NBS). Retrieved 17 November 2024.

[7] "PA-NEDSS | Department of Health | Commonwealth of Pennsylvania". www.pa.gov. Retrieved 17 November 2024.

[8] CDC (30 September 2024). "Data Interchange Standards". PHIN Tools and Resources. Retrieved 17 November 2024.

[9] CDC (10 June 2024). "PHIN Tools & Resources for Public Health". PHIN Tools and Resources. Retrieved 17 November 2024.

[10] "NATIONAL CENTER FOR PUBLIC HEALTH INFORMATICS (CPE)". stacks.cdc.gov. 23 October 2008. Retrieved 17 November 2024.

[11] CDC (24 October 2024). "About Public Health Data Interoperability". Public Health Data Interoperability. Retrieved 17 November 2024.

[12] CDC (11 September 2024). "Surveys and Data Collection Systems". National Center for Health Statistics. Retrieved 17 November 2024.

[13] Felix, Suad El Burai (2024). "A Standard Framework for Evaluating Large Health Care Data and Related Resources". MMWR Supplements. 73 (3): 1–13. doi:10.15585/mmwr.su7303a1. ISSN 2380-8950. PMC 11078514 . PMID 38713639.

[14] "Portfolio:Public health informatics - Write Edit Teach". www.writediteach.com. Retrieved 17 November 2024.

[15] "Using Technologies for Data Collection and Management | Epidemic Intelligence Service | CDC". www.cdc.gov. 25 September 2019. Retrieved 17 November 2024.

[16] "What is Health Data Management? Benefits, Challenges and Storage". Cloudian. Retrieved 17 November 2024.

[17] Walker, Daniel M et al. “Perspectives on Challenges and Opportunities for Interoperability: Findings From Key Informant Interviews With Stakeholders in Ohio.” JMIR medical informatics vol. 11 e43848. 24 Feb. 2023, doi:10.2196/43848

[18] Priyatna, Freddy et al. “Querying clinical data in HL7 RIM based relational model with morph-RDB.” Journal of biomedical semantics vol. 8,1 49. 5 Oct. 2017, doi:10.1186/s13326-017-0155-8

[19] Chen, Hong et al. “A review of data quality assessment methods for public health information systems.” International journal of environmental research and public health vol. 11,5 5170-207. 14 May. 2014, doi:10.3390/ijerph110505170

[20] "Data Modernization Initiative | CDC". www.cdc.gov. 18 October 2024. Retrieved 17 November 2024.

[21] CDC (13 September 2024). "Data and Analysis Tools". National Center for Health Statistics. Retrieved 17 November 2024.

[auto-22] "Programs | Johns Hopkins | Bloomberg School of Public Health".

[23] "What We Do". www.phii.org. Retrieved 12 September 2023.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

v t e Health informatics
Subdisciplines	Medical image computing and imaging informatics Artificial intelligence in healthcare Neuroinformatics in healthcare Behavior informatics in healthcare Computational biology in healthcare Translational bioinformatics Translational medicine health information technology Telemedicine Public health informatics Health information management Consumer health informatics
Medical classification	Continuity of Care Record HRHIS ICD ISO 27799 LOINC
Professional organizations	American Association for Medical Systems and Informatics American Medical Informatics Association Australian Society / Australasian College (to merge from 2020) Brazilian Society European Federation Indian Association International Association American College of Medical Informatics
Other concepts	Electronic health record Health Level 7 Remote manipulator Personalized medicine / precision medicine List of medical and health informatics journals openEHR