Data philanthropy

Last updated

Data philanthropy describes a form of collaboration in which private sector companies share data for public benefit. [1] There are multiple uses of data philanthropy being explored from humanitarian, corporate, human rights, and academic use. Since introducing the term in 2011, the United Nations Global Pulse has advocated for a global "data philanthropy movement". [2]

Contents

Definition

A large amount of data collected from the Internet comes from user-generated content. This includes blogs, posts on social networks, and information submitted in forms. Besides user-generated data, corporations are also currently mining data from consumers in order to understand customers, identify new markets, and make investment decisions. Kirkpatrick, the Director at United Nations Global Pulse, labelled this data "massive passive data" or "data exhaust". [3] Data philanthropy is the idea that something positive can come from this overload of data. Data philanthropy is defined as the private sector sharing this data in ways that the public can benefit. [1] The term philanthropy helps to emphasize that data sharing is a positive act and that the shared data is a public good. [3]

Challenges

A challenge that comes with sharing data is the Internet privacy of the user whose data is being used. Mathematical techniques (differential privacy and space time boxes) have been introduced in order to make personal data accessible, while providing the users such data with anonymity. But even if these algorithms work, there is always the possibility and fear of re-identification. [1]

The other challenge is convincing corporations to share their data. The big data that corporations collect provides them with market competitiveness. They are able to infer meaning regarding consumer behaviour. The fear is that by sharing all their information, they may lose their competitive edge. [1]

Furthermore, numerous moral challenges are encountered. One proposal on how to solve these moral challenges has been brought to light by Mariarosaria Taddeo in 2016, providing an ethical framework that aims to address them. [4]

Sharing strategies

The goal of data philanthropy is to create a global data commons where companies, governments, and individuals can contribute anonymous, aggregated datasets. [2] The United Nations Global Pulse offers four different tactics that companies can use to share their data that preserve consumer anonymity. These include: [1]

  1. Share aggregated and derived data sets for analysis under nondisclosure agreements (NDA)
  2. Allow researchers to analyse data within the private company's own network, under NDA
  3. Real-Time Data Commons: data pooled and aggregated between multiple companies of the same industry to protect competitiveness
  4. Public/Private Alerting Network: companies mine data behind their own firewalls and share indicators

By providing these four tactics, United Nations Global Pulse hopes to provide initiative and options for companies to share their data with the public.

Digital disease detection

By using data gathered from social media, cell phones, and other communication modes, health researchers have been able to track the spread of diseases. [5]

In the United States, HealthMap, is using data philanthropy related tactics to track the outbreak of diseases. HealthMap analyses data from publicly available media sources such as news websites, government alerts, and social media sites like X (formerly known as Twitter) for outbreaks of various illnesses around the world. [5] [6] Another website, Flu Near You, allows users to report their own health status on a weekly basis. Traditional flu surveillance can take up to 2 weeks to confirm outbreaks. [5] Doctors must wait for a virological test to confirm the outbreak before reporting it to the Centers for Disease Control. This form of data philanthropy allows for up to date information regarding various health concerns by using publicly available information gathered from news outlets, government alerts, and social media sites. It is the data gathered on social media sites, where users are not aware of their data being mined that leads to HealthMap and Flu Near You being considered data philanthropy. [5]

The Centers for Disease Control and Prevention collaborated with Google and launched Google Flu Trends in 2008, a website that tracks flu-related searches and user location to track the spread of the flu. Users can visit the website to compare the amount of flu-related search activity versus the reported numbers of flu outbreaks on a graphic map. The difficulty with this method of tracking is that Google searched are sometimes performed due to curiosity rather than because an individual is suffering from the flu. According to Ashley Fowlkes, an epidemiologist in the CDC Influenza division, "the Google Flu Trends system tries to account for that type of media bias by modelling search terms over time to see which ones remain stable". [5] Google Flu Trends is no longer publishing current flu estimates on the public website. Visitors to the site can still view and download previous estimates. Current data can be shared with verified researchers. [7]

A study by Harvard School of Public Health (HSPH) released in the October 12, 2012 issues of the journal Science discussed how phone data helped curb the spread of malaria in Kenya. The researchers mapped phone calls and texts made by 14,816,521 Kenyan mobile phone subscribers. [8] When individuals left their primary living location the destination and length of journey was calculated. This data was then compared to a 2009 malaria prevalence map to estimate the disease's commonness in each location. Combining all this information, the researchers can estimate the probability of an individual carrying malaria and map the movement of the disease. This research can be used to track the spread of similar diseases. [8]

Application in various fields

Through data philanthropy 'big data' corporations such as social networking sites, telecommunication companies, search engines amongst others, collect and make user generated information available to a data sharing system. This also permits institutions to give back to a beneficial cause. With the onset of technological advancements, sharing data on a global scale and an in-depth analysis of these data structures could alter the reaction towards certain occurrences, be it natural disasters, epidemics, worldwide economic problems and many other events. Some analyst have argued [9] that this aggregated Information is beneficial for the common good and can lead to developments in research and data production in a range of varied fields. [9]

Humanitarian aid

Calling patterns of mobile phone users can determine the socioeconomic standings of the populace which can be used to deduce "its access to housing, education, healthcare, and basic services such as water and electricity". [9] Researchers from Columbia University and Karolinska Institute utilize information from mobile phone providers, in order to assist in the dispersal of resources by deducing the movement of those displaced by natural disasters. Big data can also provide information on looming disasters and can assist relief organizations in rapid response and locating displaced individuals. By analysing certain patterns within this 'big data', could successfully transform the response to destructive occurrences like natural disasters, outbreaks of diseases and global economic distress, by employing real-time information to achieve a comprehension of the welfare of individuals. Corporations utilize digital services, such as human sensor systems to detect and solve impending problems within communities. This is a strategy implemented by the private sector in order to protect its citizens by anonymously dispersing customer information to the public sector, whilst also ensuring the protection of their privacy. [9]

Impoverished areas

Poverty still remains a worldwide issue with over 2.5 billion people [10] currently impoverished. Accumulating accurate data has been a complex issue but developments in technology and utilising 'big data', [10] is one solution for improving this situation. Statistics indicate the widespread use of mobile phones, even within impoverished communities. This availability could prove vital in gathering data on populations living in poverty. Additional data can be collected through Internet access, social media, utility payments and governmental statistics. Data-driven activities can lead to the cumulation of 'big data', which in turn can assist international non-governmental organization in documenting and evaluating the needs of underprivileged populations. Through data philanthropy, NGO's can distribute information whilst cooperating with governments and private companies. [10]

Corporate

Data philanthropy incorporates aspects of social philanthropy by permitting corporations to create profound impacts through the act of giving back by dispersing proprietary datasets. [11] The public sector, is faced with an unequal and limited access to the frequency of data and they also produce, collect and preserve information, which has proven to be an essential asset. Company's track and analyse users online activities, so as to gain more insight into their needs in relation to new products and services. [12] These companies view the welfare of the population as a vital key to the expansion and progression of businesses by using their data to places a spotlight on the plight of global citizens. [9] Experts in the private sector contend the importance of merging various data streams such as retail, mobile phone and social media data to create necessary solutions to handle global issues. Despite the inevitable risk of sharing private information, it works in a beneficial manner and serves the interest of the public. [13] The digital revolution causes an extensive production of 'big data' that is user-generated and available on the web. Corporations accumulate information on customer preferences through the digital services they utilize and products they purchase, in order to gain a clear insight on their clientele and future market opportunities. [9] However the rights of individuals concerning privacy and ownership of data are a controversial issue as governments and other institutions can use this collective data for other unethical purposes. Companies monitor and probe consumer online activities in order to better comprehend and develop tailored needs for their clientele and in turn increase their profits. [14]

Academia

Data philanthropy plays an important role in academia. Researchers encounter countless obstacles whilst attempting to access data. This data is available to a limited number of researchers with sole access to restricted resources who are authorized to utilize this information; like social media streams enabling them to produce more knowledge and develop new studies. For example, Twitter markets access to its real-time APIs at exorbitant prices, which often surpasses the budgets of most researchers. 'Data grants' [14] is a trial program created by Twitter that provides a selective number of academics and researchers with access to real-time databases in order to garner further knowledge. They apply to gain entry into vast data downloads, on specific topics. [14]

Human rights

Data philanthropy aids the human rights movement, by assisting in the dispersal of evidence for truth commissions and war crimes tribunals. Proponents of human rights accumulate data on abuse occurring within states, which is then used for scientific analysis and propels awareness and action. For example, non-profit organizations compile data from Human Rights monitors in war zones in order to assist the UN High Commissioner for Human Rights. It uncovers inconsistencies in the number of casualties of war, which in turn leads to international attention and exerts influence on discussions relating to global policy. [14]

See also

Related Research Articles

<span class="mw-page-title-main">Privacy</span> Seclusion from unwanted attention

Privacy is the ability of an individual or group to seclude themselves or information about themselves, and thereby express themselves selectively.

Public health surveillance is, according to the World Health Organization (WHO), "the continuous, systematic collection, analysis and interpretation of health-related data needed for the planning, implementation, and evaluation of public health practice." Public health surveillance may be used to track emerging health-related issues at an early stage and find active solutions in a timely manner. Surveillance systems are generally called upon to provide information regarding when and where health problems are occurring and who is affected.

<span class="mw-page-title-main">Disease surveillance</span> Monitoring spread of disease to establish patterns of progression

Disease surveillance is an epidemiological practice by which the spread of disease is monitored in order to establish patterns of progression. The main role of disease surveillance is to predict, observe, and minimize the harm caused by outbreak, epidemic, and pandemic situations, as well as increase knowledge about which factors contribute to such circumstances. A key part of modern disease surveillance is the practice of disease case reporting.

<span class="mw-page-title-main">GISAID</span> Global initiative for sharing virus data

GISAID, the Global Initiative on Sharing All Influenza Data, previously the Global Initiative on Sharing Avian Influenza Data, is a global science initiative established in 2008 to provide access to genomic data of influenza viruses. The database was expanded to include the coronavirus responsible for the COVID-19 pandemic, as well as other pathogens. The database has been described as "the world's largest repository of COVID-19 sequences". GISAID facilitates genomic epidemiology and real-time surveillance to monitor the emergence of new COVID-19 viral strains across the planet.

Tele-epidemiology is the application of telecommunications to epidemiological research and application, including space-based and internet-based systems.

<span class="mw-page-title-main">Health 2.0</span>

"Health 2.0" is a term introduced in the mid-2000s, as the subset of health care technologies mirroring the wider Web 2.0 movement. It has been defined variously as including social media, user-generated content, and cloud-based and mobile technologies. Some Health 2.0 proponents see these technologies as empowering patients to have greater control over their own health care and diminishing medical paternalism. Critics of the technologies have expressed concerns about possible misinformation and violations of patient privacy.

mHealth Medicine and public health supported by mobile devices

mHealth is an abbreviation for mobile health, a term used for the practice of medicine and public health supported by mobile devices. The term is most commonly used in reference to using mobile communication devices, such as mobile phones, tablet computers and personal digital assistants (PDAs), and wearable devices such as smart watches, for health services, information, and data collection. The mHealth field has emerged as a sub-segment of eHealth, the use of information and communication technology (ICT), such as computers, mobile phones, communications satellite, patient monitors, etc., for health services and information. mHealth applications include the use of mobile devices in collecting community and clinical health data, delivery/sharing of healthcare information for practitioners, researchers and patients, real-time monitoring of patient vital signs, the direct provision of care as well as training and collaboration of health workers.

Reality mining is the collection and analysis of machine-sensed environmental data pertaining to human social behavior, with the goal of identifying predictable patterns of behavior. In 2008, MIT Technology Review called it one of the "10 technologies most likely to change the way we live."

Urban computing is an interdisciplinary field which pertains to the study and application of computing technology in urban areas. This involves the application of wireless networks, sensors, computational power, and data to improve the quality of densely populated areas. Urban computing is the technological framework for smart cities.

The social data revolution is the shift in human communication patterns towards increased personal information sharing and its related implications, made possible by the rise of social networks in the early 2000s. This phenomenon has resulted in the accumulation of unprecedented amounts of public data.

Infoveillance is a type of syndromic surveillance that specifically utilizes information found online. The term, along with the term infodemiology, was coined by Gunther Eysenbach to describe research that uses online information to gather information about human behavior.

<span class="mw-page-title-main">Google Flu Trends</span> Former web service operated by Google

Google Flu Trends (GFT) was a web service operated by Google. It provided estimates of influenza activity for more than 25 countries. By aggregating Google Search queries, it attempted to make accurate predictions about flu activity. This project was first launched in 2008 by Google.org to help predict outbreaks of flu.

The United Nations Global Pulse is an initiative of the United Nations that attempts to "bring real-time monitoring and prediction to development and aid programs."

Azumio is a mobile health company that specializes in biometric mobile technology. Founded in 2011, Azumio develops Apple iOS and Android health apps and services. Azumio has released 24 apps on iOS, 5 apps on Android, and 3 apps on Windows Phone. The company is headquartered in Palo Alto, California.

Participatory surveillance is community-based monitoring of other individuals. This term can be applied to both digital media studies and ecological field studies. In the realm of media studies, it refers to how users surveil each other using the internet. Either through the use of social media, search engines, and other web-based methods of tracking, an individual has the power to find information both freely or non freely given about the individual being searched. Issues of privacy emerge within this sphere of participatory surveillance, predominantly focused on how much information is available on the web that an individual does not consent to. More so, disease outbreak researchers can study social-media based patterns to decrease the time it takes to detect an outbreak, an emerging field of study called infodemiology. Within the realm of ecological fieldwork, participatory surveillance is used as an overarching term for the method in which indigenous and rural communities are used to gain greater accessibility to causes of disease outbreak. By using these communities, disease outbreak can be spotted earlier than through traditional means or healthcare institutions.

An infodemic is a rapid and far-reaching spread of both accurate and inaccurate information about certain issues. The word is a portmanteau of "information" and "epidemic" and is used as a metaphor to describe how misinformation and disinformation can spread like a virus from person to person and affect people like a disease. This term, originally coined in 2003 by David Rothkopf, rose to prominence in 2020 during the COVID-19 pandemic.

<span class="mw-page-title-main">Caroline Buckee</span> Epidemiologist and Associate Professor

Caroline O'Flaherty Buckee is an epidemiologist. She is an associate professor of Epidemiology and is the associate director of the Center for Communicable Disease Dynamics, both at the Harvard T.H. Chan School of Public Health. Buckee is known for her work in digital epidemiology, where mathematical models track mobile and satellite data to understand the transmission of infectious diseases through populations in an effort to understand the spatial dynamics of disease transmission. Her work examines the implications of conducting surveillance and implementing control programs as a way to understand and predict what will happen when dealing with outbreaks of infectious diseases like malaria and COVID-2019.

Data collaboratives are a form of collaboration in which participants from different sectors—including private companies, research institutions, and government agencies—can exchange data and data expertise to help solve public problems.

<span class="mw-page-title-main">Mobile positioning data</span>

Mobile positioning data (MPD) is a form of big datawhich results from the high data volumes of mobile positioning – tracking the location of mobile phones.

<span class="mw-page-title-main">Rumi Chunara</span> American computer scientist

Rumi Chunara is a computer scientist who is an associate professor of biostatistics at the New York University School of Global Public Health. She develops computational and statistical approaches to acquire, integrate and make use of data improve population-level public health.

References

  1. 1 2 3 4 5 Pawelke, A. and Tatevossian, A. (2013, May 8) Data philanthropy: where are we now? United Nations Global Pulse.
  2. 1 2 Coren, M. (2011, December 9) Data Philanthropy Open data for world-changing solutions. Fast Company.
  3. 1 2 Kirkpatrick, R. (2011, September 20). Data philanthropy is good for business Forbes.
  4. Taddeo, M. (2016). "Data philanthropy and the design of the infraethics for information societies". Philosophical Transactions of the Royal Society A. 374 (2083). DOI: 10.1098/rsta.2016.0113
  5. 1 2 3 4 5 Schmidt, C. (2012). Trending Now: Using Social Media to Predict and Track Disease Outbreaks. Environ Health Perspect, 120(1), A30–a33-A30–a33.
  6. Reddy, E. (2015, July 14). Using Twitter data to study the world's health Twitter.
  7. O'Connor, F. (2015, August 20). Google Flu Trends calls out sick, indefinitely PC World.
  8. 1 2 Datz, T. (2012, October 11). Using cell phone data to curb the spread of malaria. Harvard Chan.
  9. 1 2 3 4 5 6 Data Philanthropy is Good for Business, by Robert Kirkpatrick, Forbes, 2011-09-20
  10. 1 2 3 Lifting Up: How Big Data Can Help Eliminate Poverty, by Rick Delgado, Smart Data Collection , 2014-05-23
  11. Data Philanthropy for Humanitarian Response, by Irevolution, 2012-07-04
  12. Data Is a Form of Corporate Philanthropy, by Matt Stempeck,Harvard Business Review 2014-07-24
  13. A New Type of Philanthropy: Donating Data, by Robert Kirkpatrick,Harvard Business Review 2013-03-21
  14. 1 2 3 4 Big Data Means More Than Big Profits, by Jim Fruchterman, Harvard Business Review, 2013-03-19