Infoveillance is a type of syndromic surveillance that specifically utilizes information found online. [1] The term, along with the term infodemiology , was coined by Gunther Eysenbach to describe research that uses online information to gather information about human behavior. [2] [3] [4]
Eysenbach's work using Google Search queries led to the birth of Google Flu Trends, and other search engines have also been used. [5] [6] Other researchers have utilized social media sites such as Twitter to observe disease outbreak patterns. [7] [8] Infoveillance can detect disease outbreaks faster than traditional public health surveillance systems with minimal costs involved. [9]
Infoveillance methods may be either passive or active. [4] Traditional infoveillance data like search engine queries and website navigation behavior are considered passive, as they attempt to recognize trends automatically, without action (or often even awareness) on the part of the internet users who are generating the data for analysis. Active infoveillance occurs when users choose to respond to a survey, enter symptoms into a website or app, or otherwise participate directly in surveillance efforts by contributing additional information. [4]
Beginning in 2008, Google used aggregated search query data to detect influenza trends and compared the results to countries' official surveillance data with the goal of predicting the spread of the flu. [10] In light of evidence that emerged in 2013 showing that Google Flu Trends sometimes substantially overestimated actual flu rates, researchers proposed a series of more advanced and better-performing approaches to flu modeling from Google search queries. [11] Google Flu Trends stopped publishing reports in 2015. [12]
Google also used aggregated search query data to detect dengue fever trends. [13] Research has also cast doubt on the accuracy of some of these predictions. [14] Google has continued this work to track and predict the COVID-19 pandemic, creating an open dataset on COVID-related search queries for use by researchers. [15]
Other flu prediction projects, including Flu Detector, have come and gone since the advent and removal of Google Flu Trends. Flu Detector was developed by Vasileios Lampos and other researchers at the University of Bristol. [7] It was an application of machine learning that first used feature selection to automatically extract flu-related terms from Twitter content and then used those terms to compute a flu-score for several UK regions based on geolocated tweets. It also formed the basis for a proposed generalized scheme able to track other events. [16]
Mood of the Nation was also developed by Lampos' team. It performed mood analysis on tweets geo-located in various regions of the United Kingdom by computing on a daily basis scores for four types of emotion: anger, fear, joy and sadness.[ citation needed ]
The rise of infoveillance brings up questions about privacy. Privacy concerns are partially dependent on the level of analysis and how data are collected and managed. [4] For instance, individuals may be re-identifiable from search query datasets that have not been properly de-identified. [17] Privacy concerns are increased if data analysis is not done automatically and if search trajectories of individual users are examined. [4]
Epidemiology is the study and analysis of the distribution, patterns and determinants of health and disease conditions in a defined population.
Text mining, text data mining (TDM) or text analytics is the process of deriving high-quality information from text. It involves "the discovery by computer of new, previously unknown information, by automatically extracting information from different written resources." Written resources may include websites, books, emails, reviews, and articles. High-quality information is typically obtained by devising patterns and trends by means such as statistical pattern learning. According to Hotho et al. (2005) we can distinguish between three different perspectives of text mining: information extraction, data mining, and a knowledge discovery in databases (KDD) process. Text mining usually involves the process of structuring the input text, deriving patterns within the structured data, and finally evaluation and interpretation of the output. 'High quality' in text mining usually refers to some combination of relevance, novelty, and interest. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, production of granular taxonomies, sentiment analysis, document summarization, and entity relation modeling.
Avian influenza, also known as avian flu or bird flu, is a disease caused by the influenza A virus (IAV) which primarily affects birds but can sometimes affect mammals including humans. Wild aquatic birds are the primary host of Influenza A virus (IAV), which is endemic in many bird populations.
Influenza vaccines, colloquially known as flu shots, are vaccines that protect against infection by influenza viruses. New versions of the vaccines are developed twice a year, as the influenza virus rapidly changes. While their effectiveness varies from year to year, most provide modest to high protection against influenza. Vaccination against influenza began in the 1930s, with large-scale availability in the United States beginning in 1945.
Swine influenza is an infection caused by any of several types of swine influenza viruses. Swine influenza virus (SIV) or swine-origin influenza virus (S-OIV) refers to any strain of the influenza family of viruses that is endemic in pigs. As of 2009, identified SIV strains include influenza C and the subtypes of influenza A known as H1N1, H1N2, H2N1, H3N1, H3N2, and H2N3.
Public health surveillance is, according to the World Health Organization (WHO), "the continuous, systematic collection, analysis and interpretation of health-related data needed for the planning, implementation, and evaluation of public health practice." Public health surveillance may be used to track emerging health-related issues at an early stage and find active solutions in a timely manner. Surveillance systems are generally called upon to provide information regarding when and where health problems are occurring and who is affected.
Google Trends is a website by Google that analyzes the popularity of top search queries in Google Search across various regions and languages. The website uses graphs to compare the search volume of different queries over time.
An e-patient is a health consumer who participates fully in their own medical care, primarily by gathering information about medical conditions that impact them and their families, using the Internet and other digital tools. The term encompasses those who seek guidance for their own ailments, and the friends and family members who research on their behalf. E-patients report two effects of their health research: "better health information and services, and different, but not always better, relationships with their doctors."
Gunther Eysenbach is a German-Canadian researcher on healthcare, especially health policy, eHealth, and consumer health informatics.
The Journal of Medical Internet Research is a peer-reviewed open-access medical journal established in 1999 covering eHealth and "healthcare in the Internet age". The editors-in-chief are Gunther Eysenbach and Rita Kukafka. The publisher is JMIR Publications.
GISAID, the Global Initiative on Sharing All Influenza Data, previously the Global Initiative on Sharing Avian Influenza Data, is a global science initiative established in 2008 to provide access to genomic data of influenza viruses. The database was expanded to include the coronavirus responsible for the COVID-19 pandemic, as well as other pathogens. The database has been described as "the world's largest repository of COVID-19 sequences". GISAID facilitates genomic epidemiology and real-time surveillance to monitor the emergence of new COVID-19 viral strains across the planet.
"Health 2.0" is a term introduced in the mid-2000s, as the subset of health care technologies mirroring the wider Web 2.0 movement. It has been defined variously as including social media, user-generated content, and cloud-based and mobile technologies. Some Health 2.0 proponents see these technologies as empowering patients to have greater control over their own health care and diminishing medical paternalism. Critics of the technologies have expressed concerns about possible misinformation and violations of patient privacy.
Influenza-like illness (ILI), also known as flu-like syndrome or flu-like symptoms, is a medical diagnosis of possible influenza or other illness causing a set of common symptoms. These include fever, shivering, chills, malaise, dry cough, loss of appetite, body aches, nausea, and sneezing typically in connection with a sudden onset of illness. In most cases, the symptoms are caused by cytokines released by immune system activation, and are thus relatively non-specific.
Culturomics is a form of computational lexicology that studies human behavior and cultural trends through the quantitative analysis of digitized texts. Researchers data mine large digital archives to investigate cultural phenomena reflected in language and word usage. The term is an American neologism first described in a 2010 Science article called Quantitative Analysis of Culture Using Millions of Digitized Books, co-authored by Harvard researchers Jean-Baptiste Michel and Erez Lieberman Aiden.
Google Flu Trends (GFT) was a web service operated by Google. It provided estimates of influenza activity for more than 25 countries. By aggregating Google Search queries, it attempted to make accurate predictions about flu activity. This project was first launched in 2008 by Google.org to help predict outbreaks of flu.
Infodemiology was defined by Gunther Eysenbach in the early 2000s as information epidemiology. It is an area of science research focused on scanning the internet for user-contributed health-related content, with the ultimate goal of improving public health. Later, it is also defined as the science of mitigating public health problems resulting from an infodemic.
John Brownstein is a Canadian epidemiologist and Professor of Medicine at the Harvard Medical School as well as the Chief Innovation Officer at Boston Children’s Hospital. His research focuses on development of computational methods in epidemiology for applications to public health also known as computational epidemiology or e-epidemiology He is also the founder of several global public health surveillance systems including HealthMap. He is most known for his work on global tracking of disease outbreaks.
Participatory surveillance is community-based monitoring of other individuals. This term can be applied to both digital media studies and ecological field studies. In the realm of media studies, it refers to how users surveil each other using the internet. Either through the use of social media, search engines, and other web-based methods of tracking, an individual has the power to find information both freely or non freely given about the individual being searched. Issues of privacy emerge within this sphere of participatory surveillance, predominantly focused on how much information is available on the web that an individual does not consent to. More so, disease outbreak researchers can study social-media based patterns to decrease the time it takes to detect an outbreak, an emerging field of study called infodemiology. Within the realm of ecological fieldwork, participatory surveillance is used as an overarching term for the method in which indigenous and rural communities are used to gain greater accessibility to causes of disease outbreak. By using these communities, disease outbreak can be spotted earlier than through traditional means or healthcare institutions.
Neil Morris Ferguson is a British epidemiologist and professor of mathematical biology, who specialises in the patterns of spread of infectious disease in humans and animals. He is the director of the Jameel Institute, and of the MRC Centre for Global Infectious Disease Analysis, and head of the Department of Infectious Disease Epidemiology in the School of Public Health and Vice-Dean for Academic Development in the Faculty of Medicine, all at Imperial College London.
An infodemic is a rapid and far-reaching spread of both accurate and inaccurate information about certain issues. The word is a portmanteau of information and epidemic and is used as a metaphor to describe how misinformation and disinformation can spread like a virus from person to person and affect people like a disease. This term, originally coined in 2003 by David Rothkopf, rose to prominence in 2020 during the COVID-19 pandemic.
{{cite journal}}
: CS1 maint: multiple names: authors list (link)