Machine ethics

Last updated October 28, 2024

Machine ethics (or machine morality, computational morality, or computational ethics) is a part of the ethics of artificial intelligence concerned with adding or ensuring moral behaviors of man-made machines that use artificial intelligence, otherwise known as artificial intelligent agents.^[1] Machine ethics differs from other ethical fields related to engineering and technology. It should not be confused with computer ethics, which focuses on human use of computers. It should also be distinguished from the philosophy of technology, which concerns itself with technology's grander social effects.^[2]

Definitions
History
Areas of focus
AI control problem
Algorithms and training
Autonomous weapons systems
Integration of artificial general intelligences with society
Machine learning bias
Ethical frameworks and practices
Practices
Approaches
In fiction
Related fields
See also
Notes
References
Further reading
External links

Definitions

James H. Moor, one of the pioneering theoreticians in the field of computer ethics, defines four kinds of ethical robots. As an extensive researcher on the studies of philosophy of artificial intelligence, philosophy of mind, philosophy of science, and logic, Moor defines machines as ethical impact agents, implicit ethical agents, explicit ethical agents, or full ethical agents. A machine can be more than one type of agent.^[3]

Ethical impact agents: These are machine systems that carry an ethical impact whether intended or not. At the same time, they have the potential to act unethically. Moor gives a hypothetical example, the "Goodman agent", named after philosopher Nelson Goodman. The Goodman agent compares dates but has the millennium bug. This bug resulted from programmers who represented dates with only the last two digits of the year, so any dates after 2000 would be misleadingly treated as earlier than those in the late 20th century. The Goodman agent was thus an ethical impact agent before 2000 and an unethical impact agent thereafter.
Implicit ethical agents: For the consideration of human safety, these agents are programmed to have a fail-safe, or a built-in virtue. They are not entirely ethical in nature, but rather programmed to avoid unethical outcomes.
Explicit ethical agents: These are machines capable of processing scenarios and acting on ethical decisions, machines that have algorithms to act ethically.
Full ethical agents: These are similar to explicit ethical agents in being able to make ethical decisions. But they also have human metaphysical features (i.e., have free will, consciousness, and intentionality).

(See artificial systems and moral responsibility.)

History

Before the 21st century the ethics of machines had largely been the subject of science fiction, mainly due to computing and artificial intelligence (AI) limitations. Although the definition of "machine ethics" has evolved since, the term was coined by Mitchell Waldrop in the 1987 AI magazine article "A Question of Responsibility":

One thing that is apparent from the above discussion is that intelligent machines will embody values, assumptions, and purposes, whether their programmers consciously intend them to or not. Thus, as computers and robots become more and more intelligent, it becomes imperative that we think carefully and explicitly about what those built-in values are. Perhaps what we need is, in fact, a theory and practice of machine ethics, in the spirit of Asimov's three laws of robotics.^[4]

In 2004, Towards Machine Ethics^[5] was presented at the AAAI Workshop on Agent Organizations: Theory and Practice.^[6] Theoretical foundations for machine ethics were laid out.

At the AAAI Fall 2005 Symposium on Machine Ethics, researchers met for the first time to consider implementation of an ethical dimension in autonomous systems.^[7] A variety of perspectives of this nascent field can be found in the collected edition Machine Ethics^[8] that stems from that symposium.

In 2007, AI magazine published "Machine Ethics: Creating an Ethical Intelligent Agent",^[9] an article that discussed the importance of machine ethics, the need for machines that represent ethical principles explicitly, and challenges facing those working on machine ethics. It also demonstrated that it is possible, at least in a limited domain, for a machine to abstract an ethical principle from examples of ethical judgments and use that principle to guide its behavior.

In 2009, Oxford University Press published Moral Machines, Teaching Robots Right from Wrong,^[10] which it advertised as "the first book to examine the challenge of building artificial moral agents, probing deeply into the nature of human decision making and ethics." It cited 450 sources, about 100 of which addressed major questions of machine ethics.

In 2011, Cambridge University Press published a collection of essays about machine ethics edited by Michael and Susan Leigh Anderson,^[8] who also edited a special issue of IEEE Intelligent Systems on the topic in 2006.^[11] The collection focuses on the challenges of adding ethical principles to machines.^[12]

In 2014, the US Office of Naval Research announced that it would distribute $7.5 million in grants over five years to university researchers to study questions of machine ethics as applied to autonomous robots,^[13] and Nick Bostrom's Superintelligence: Paths, Dangers, Strategies , which raised machine ethics as the "most important...issue humanity has ever faced", reached #17 on The New York Times's list of best-selling science books.^[14]

In 2016 the European Parliament published a paper^[15] to encourage the Commission to address robots' legal status.^[16] The paper includes sections about robots' legal liability, in which it is argued that their liability should be proportional to their level of autonomy. The paper also discusses how many jobs could be taken by AI robots.^[17]

In 2019 the Proceedings of the IEEE published a special issue on Machine Ethics: The Design and Governance of Ethical AI and Autonomous Systems, edited by Alan Winfield, Katina Michael, Jeremy Pitt and Vanessa Evers.^[18] "The issue includes papers describing implicit ethical agents, where machines are designed to avoid unethical outcomes, as well as explicit ethical agents, or machines that either encode or learn ethics and determine actions based on those ethics".^[19]

Areas of focus

AI control problem

Some scholars, such as Bostrom and AI researcher Stuart Russell, argue that, if AI surpasses humanity in general intelligence and becomes "superintelligent", this new superintelligence could become powerful and difficult to control: just as the mountain gorilla's fate depends on human goodwill, so might humanity's fate depend on a future superintelligence's actions.^[20] In their respective books Superintelligence and Human Compatible , Bostrom and Russell assert that while the future of AI is very uncertain, the risk to humanity is great enough to merit significant action in the present.

This presents the AI control problem: how to build an intelligent agent that will aid its creators without inadvertently building a superintelligence that will harm them. The danger of not designing control right "the first time" is that a superintelligence may be able to seize power over its environment and prevent us from shutting it down. Potential AI control strategies include "capability control" (limiting an AI's ability to influence the world) and "motivational control" (one way of building an AI whose goals are aligned with human or optimal values). A number of organizations are researching the AI control problem, including the Future of Humanity Institute, the Machine Intelligence Research Institute, the Center for Human-Compatible Artificial Intelligence, and the Future of Life Institute.

Algorithms and training

AI paradigms have been debated, especially their efficacy and bias. Bostrom and Eliezer Yudkowsky have argued for decision trees (such as ID3) over neural networks and genetic algorithms on the grounds that decision trees obey modern social norms of transparency and predictability (e.g. stare decisis ).^[21] In contrast, Chris Santos-Lang has argued in favor of neural networks and genetic algorithms on the grounds that the norms of any age must be allowed to change and that natural failure to fully satisfy these particular norms has been essential in making humans less vulnerable than machines to criminal hackers.^[22]^[23]

In 2009, in an experiment at the Ecole Polytechnique Fédérale of Lausanne's Laboratory of Intelligent Systems, AI robots were programmed to cooperate with each other and tasked with searching for a beneficial resource while avoiding a poisonous one.^[24] During the experiment, the robots were grouped into clans, and the successful members' digital genetic code was used for the next generation, a type of algorithm known as a genetic algorithm. After 50 successive generations in the AI, one clan's members discovered how to distinguish the beneficial resource from the poisonous one. The robots then learned to lie to each other in an attempt to hoard the beneficial resource from other robots.^[24] In the same experiment, the same robots also learned to behave selflessly and signaled danger to other robots, and died to save other robots.^[22] Machine ethicists have questioned the experiment's implications. In the experiment, the robots' goals were programmed to be "terminal", but human motives typically require never-ending learning.

Autonomous weapons systems

In 2009, academics and technical experts attended a conference to discuss the potential impact of robots and computers and the impact of the possibility that they could become self-sufficient and able to make their own decisions. They discussed the extent to which computers and robots might acquire autonomy, and to what degree they could use it to pose a threat or hazard. They noted that some machines have acquired various forms of semi-autonomy, including the ability to find power sources on their own and to independently choose targets to attack with weapons. They also noted that some computer viruses can evade elimination and have achieved "cockroach intelligence". They noted that self-awareness as depicted in science fiction is probably unlikely, but that there are other potential hazards and pitfalls.^[25]

Some experts and academics have questioned the use of robots in military combat, especially robots with a degree of autonomy.^[26] The U.S. Navy funded a report that indicates that as military robots become more complex, we should pay greater attention to the implications of their ability to make autonomous decisions.^[27]^[28] The president of the Association for the Advancement of Artificial Intelligence has commissioned a study of this issue.^[29]

Integration of artificial general intelligences with society

Preliminary work has been conducted on methods of integrating artificial general intelligences (full ethical agents as defined above) with existing legal and social frameworks. Approaches have focused on their legal position and rights.^[30]

Machine learning bias

Big data and machine learning algorithms have become popular in numerous industries, including online advertising, credit ratings, and criminal sentencing, with the promise of providing more objective, data-driven results, but have been identified as a potential way to perpetuate social inequalities and discrimination.^[31]^[32] A 2015 study found that women were less likely than men to be shown high-income job ads by Google's AdSense. Another study found that Amazon's same-day delivery service was intentionally made unavailable in black neighborhoods. Both Google and Amazon were unable to isolate these outcomes to a single issue, and said the outcomes were the result of the black box algorithms they use.^[31]

The U.S. judicial system has begun using quantitative risk assessment software when making decisions related to releasing people on bail and sentencing in an effort to be fairer and reduce the imprisonment rate. These tools analyze a defendant's criminal history, among other attributes. In a study of 7,000 people arrested in Broward County, Florida, only 20% of people predicted to commit a crime using the county's risk assessment scoring system proceeded to commit a crime.^[32] A 2016 ProPublica report analyzed recidivism risk scores calculated by one of the most commonly used tools, the Northpointe COMPAS system, and looked at outcomes over two years. The report found that only 61% of those deemed high-risk committed additional crimes during that period. The report also flagged that African-American defendants were far more likely to be given high-risk scores than their white counterparts.^[32] It has been argued that such pretrial risk assessments violate Equal Protection rights on the basis of race, due to factors including possible discriminatory intent by the algorithm itself, under a theory of partial legal capacity for artificial intelligences.^[33]

In 2016, the Obama administration's Big Data Working Group—an overseer of various big-data regulatory frameworks—released reports warning of "the potential of encoding discrimination in automated decisions" and calling for "equal opportunity by design" for applications such as credit scoring.^[34]^[35] The reports encourage discourse among policy-makers, citizens, and academics alike, but recognize that no solution yet exists for the encoding of bias and discrimination into algorithmic systems.

Ethical frameworks and practices

Practices

In March 2018, in an effort to address rising concerns over machine learning's impact on human rights, the World Economic Forum and Global Future Council on Human Rights published a white paper with detailed recommendations on how best to prevent discriminatory outcomes in machine learning.^[36] The World Economic Forum developed four recommendations based on the UN Guiding Principles of Human Rights to help address and prevent discriminatory outcomes in machine learning:^[36]

Active inclusion: Development and design of machine learning applications must actively seek a diversity of input, especially of the norms and values of populations affected by the output of AI systems.
Fairness : People involved in conceptualizing, developing, and implementing machine learning systems should consider which definition of fairness best applies to their context and application, and prioritize it in the machine learning system's architecture and evaluation metrics.
Right to understanding: Involvement of machine learning systems in decision-making that affects individual rights must be disclosed, and the systems must be able to explain their decision-making in a way that is understandable to end users and reviewable by a competent human authority. Where this is impossible and rights are at stake, leaders in the design, deployment, and regulation of machine learning technology must question whether it should be used.
Access to redress: Leaders, designers, and developers of machine learning systems are responsible for identifying the potential negative human rights impacts of their systems. They must make visible avenues for redress for those affected by disparate impacts, and establish processes for the timely redress of any discriminatory outputs.

In January 2020, Harvard University's Berkman Klein Center for Internet and Society published a meta-study of 36 prominent sets of principles for AI, identifying eight key themes: privacy, accountability, safety and security, transparency and explainability, fairness and non-discrimination, human control of technology, professional responsibility, and promotion of human values.^[37] Researchers at the Swiss Federal Institute of Technology in Zurich conducted a similar meta-study in 2019.^[38]

Approaches

There have been several attempts to make ethics computable, or at least formal. Isaac Asimov's Three Laws of Robotics are not usually considered suitable for an artificial moral agent,^[39] but whether Kant's categorical imperative can be used has been studied.^[40] It has been pointed out that human value is, in some aspects, very complex.^[41] A way to explicitly surmount this difficulty is to receive human values directly from people through some mechanism, for example by learning them.^[42]^[43]^[44]
Another approach is to base current ethical considerations on previous similar situations. This is called casuistry, and could be implemented through research on the Internet. The consensus from a million past decisions would lead to a new decision that is democracy-dependent.^[9] Bruce M. McLaren built an early (mid-1990s) computational model of casuistry, a program called SIROCCO built with AI and case-base reasoning techniques that retrieves and analyzes ethical dilemmas.^[45] But this approach could lead to decisions that reflect society's biases and unethical behavior. The negative effects of this approach can be seen in Microsoft's Tay, a chatterbot that learned to repeat racist and sexually charged tweets.^[46]

One thought experiment focuses on a Genie Golem with unlimited powers presenting itself to the reader. This Genie declares that it will return in 50 years and demands that it be provided with a definite set of morals it will then immediately act upon. This experiment's purpose is to spark discourse over how best to handle defining sets of ethics that computers may understand.^[47]

Some recent work attempts to reconstruct AI morality and control more broadly as a problem of mutual contestation between AI as a Foucauldian subjectivity on the one hand and humans or institutions on the other hand, all within a disciplinary apparatus. Certain desiderata need to be fulfilled: embodied self-care, embodied intentionality, imagination and reflexivity, which together would condition AI's emergence as an ethical subject capable of self-conduct.^[48]

In fiction

In science fiction, movies and novels have played with the idea of sentient robots and machines.

Neill Blomkamp's Chappie (2015) enacts a scenario of being able to transfer one's consciousness into a computer.^[49] Alex Garland's 2014 film Ex Machina follows an android with artificial intelligence undergoing a variation of the Turing Test, a test administered to a machine to see whether its behavior can be distinguished from that of a human. Films such as The Terminator (1984) and The Matrix (1999) incorporate the concept of machines turning on their human masters.

Asimov considered the issue in the 1950s in I, Robot . At the insistence of his editor John W. Campbell Jr., he proposed the Three Laws of Robotics to govern artificially intelligent systems. Much of his work was then spent testing his three laws' boundaries to see where they break down or create paradoxical or unanticipated behavior. His work suggests that no set of fixed laws can sufficiently anticipate all possible circumstances.^[50] Philip K. Dick's 1968 novel Do Androids Dream of Electric Sheep? explores what it means to be human. In his post-apocalyptic scenario, he questions whether empathy is an entirely human characteristic. The book is the basis for the 1982 science-fiction film Blade Runner .

Related fields

Notes

↑ Moor, J.H. (2006). "The Nature, Importance, and Difficulty of Machine Ethics". IEEE Intelligent Systems. 21 (4): 18–21. doi:10.1109/MIS.2006.80. S2CID 831873.
↑ Boyles, Robert James. "A Case for Machine Ethics in Modeling Human-Level Intelligent Agents" (PDF). Kritike. Retrieved 1 November 2019.
↑ Moor, James M. (2009). "Four Kinds of Ethical Robots". Philosophy Now.
↑ Waldrop, Mitchell (Spring 1987). "A Question of Responsibility". AI Magazine. 8 (1): 28–39. doi:10.1609/aimag.v8i1.572.
↑ Anderson, M., Anderson, S., and Armen, C. (2004) "Towards Machine Ethics" in Proceedings of the AAAI Workshop on Agent Organization: Theory and Practice, AAAI Press
↑ AAAI Workshop on Agent Organization: Theory and Practice, AAAI Press
↑ "Papers from the 2005 AAAI Fall Symposium". Archived from the original on 2014-11-29.
1 2 Anderson, Michael; Anderson, Susan Leigh, eds. (July 2011). Machine Ethics. Cambridge University Press. ISBN 978-0-521-11235-2.
1 2 Anderson, M. and Anderson, S. (2007). Creating an Ethical Intelligent Agent. AI Magazine, Volume 28(4).
↑ Wallach, Wendell; Allen, Colin (2009). Moral machines : teaching robots right from wrong . Oxford University Press. ISBN 9780195374049.
↑ Anderson, Michael; Anderson, Susan Leigh, eds. (July–August 2006). "Special Issue on Machine Ethics". IEEE Intelligent Systems. 21 (4): 10–63. doi:10.1109/mis.2006.70. ISSN 1541-1672. S2CID 9570832. Archived from the original on 2011-11-26.
↑ Siler, Cory (2015). "Review of Anderson and Anderson's Machine Ethics". Artificial Intelligence. 229: 200–201. doi: 10.1016/j.artint.2015.08.013 . S2CID 5613776.
↑ Tucker, Patrick (13 May 2014). "Now The Military Is Going To Build Robots That Have Morals". Defense One. Retrieved 9 July 2014.
↑ "Best Selling Science Books". New York Times. September 8, 2014. Retrieved 9 November 2014.
↑ "European Parliament, Committee on Legal Affairs. Draft Report with recommendations to the Commission on Civil Law Rules on Robotics". European Commission. Retrieved January 12, 2017.
↑ Wakefield, Jane (2017-01-12). "MEPs vote on robots' legal status – and if a kill switch is required". BBC News. Retrieved 12 January 2017.
↑ "European Parliament resolution of 16 February 2017 with recommendations to the Commission on Civil Law Rules on Robotics". European Parliament. Retrieved 8 November 2019.
↑ Alan Winfield; Katina Michael; Jeremy Pitt; Vanessa Evers (March 2019). "Machine Ethics: The Design and Governance of Ethical AI and Autonomous Systems". Proceedings of the IEEE. 107 (3): 501–615. doi: 10.1109/JPROC.2019.2898289 .
↑ "Proceedings of the IEEE Addresses Machine Ethics". IEEE Standards Association. 30 August 2019. Archived from the original on December 4, 2022.
↑ Bostrom, Nick (2014). Superintelligence: Paths, Dangers, Strategies (First ed.). Oxford University Press. ISBN 978-0199678112.
↑ Bostrom, Nick; Yudkowsky, Eliezer (2011). "The Ethics of Artificial Intelligence" (PDF). Cambridge Handbook of Artificial Intelligence. Cambridge Press. Archived from the original (PDF) on 2016-03-04. Retrieved 2011-06-28.
1 2 Santos-Lang, Chris (2002). "Ethics for Artificial Intelligences". Archived from the original on 2011-12-03.
↑ Santos-Lang, Christopher (2014). "Moral Ecology Approaches to Machine Ethics" (PDF). In van Rysewyk, Simon; Pontier, Matthijs (eds.). Machine Medical Ethics. Intelligent Systems, Control and Automation: Science and Engineering. Vol. 74. Switzerland: Springer. pp. 111–127. doi:10.1007/978-3-319-08108-3_8. ISBN 978-3-319-08107-6.
1 2 Fox, Stuart (August 18, 2009). "Evolving Robots Learn To Lie To Each Other". Popular Science.
↑ Markoff, John (July 25, 2009). "Scientists Worry Machines May Outsmart Man". New York Times.
↑ Palmer, Jason (3 August 2009). "Call for debate on killer robots". BBC News.
↑ Science New Navy-funded Report Warns of War Robots Going "Terminator" Archived 2009-07-28 at the Wayback Machine , by Jason Mick (Blog), dailytech.com, February 17, 2009.
↑ Flatley, Joseph L. (February 18, 2009). "Navy report warns of robot uprising, suggests a strong moral compass". Engadget.
↑ AAAI Presidential Panel on Long-Term AI Futures 2008–2009 Study, Association for the Advancement of Artificial Intelligence, Accessed 7/26/09.
↑ Sotala, Kaj; Yampolskiy, Roman V (2014-12-19). "Responses to catastrophic AGI risk: a survey". Physica Scripta. 90 (1): 8. doi: 10.1088/0031-8949/90/1/018001 . ISSN 0031-8949.
1 2 Crawford, Kate (25 June 2016). "Artificial Intelligence's White Guy Problem". The New York Times.
1 2 3 Julia Angwin; Surya Mattu; Jeff Larson; Lauren Kircher (23 May 2016). "Machine Bias: There's Software Used Across the Country to Predict Future Criminals. And it's Biased Against Blacks". ProPublica.
↑ Thomas, C.; Nunez, A. (2022). "Automating Judicial Discretion: How Algorithmic Risk Assessments in Pretrial Adjudications Violate Equal Protection Rights on the Basis of Race". Law & Inequality . 40 (2): 371–407. doi: 10.24926/25730037.649 .
↑ Executive Office of the President (May 2016). "Big Data: A Report on Algorithmic Systems, Opportunity, and Civil Rights" (PDF). Obama White House.
↑ "Big Risks, Big Opportunities: the Intersection of Big Data and Civil Rights". Obama White House. 4 May 2016.
1 2 "How to Prevent Discriminatory Outcomes in Machine Learning". World Economic Forum. 12 March 2018. Retrieved 2018-12-11.
↑ Fjeld, Jessica; Achten, Nele; Hilligoss, Hannah; Nagy, Adam; Srikumar, Madhulika (2020). "Principled Artificial Intelligence: Mapping Consensus in Ethical and Rights-Based Approaches to Principles for AI". SSRN Working Paper Series. doi:10.2139/ssrn.3518482. ISSN 1556-5068. S2CID 214464355.
↑ Jobin, Anna; Ienca, Marcello; Vayena, Effy (2019). "The global landscape of AI ethics guidelines". Nature Machine Intelligence. 1 (9): 389–399. arXiv: 1906.11668 . doi: 10.1038/s42256-019-0088-2 . ISSN 2522-5839. S2CID 201827642.
↑ Anderson, Susan Leigh (2011): The Unacceptability of Asimov's Three Laws of Robotics as a Basis for Machine Ethics. In: Machine Ethics, ed. Michael Anderson, Susan Leigh Anderson. New York: Oxford University Press. pp.285–296. ISBN 9780511978036
↑ Powers, Thomas M. (2011): Prospects for a Kantian Machine. In: Machine Ethics, ed. Michael Anderson, Susan Leigh Anderson. New York: Oxford University Press. pp.464–475.
↑ Muehlhauser, Luke, Helm, Louie (2012): Intelligence Explosion and Machine Ethics.
↑ Yudkowsky, Eliezer (2004): Coherent Extrapolated Volition.
↑ Guarini, Marcello (2011): Computational Neural Modeling and the Philosophy of Ethics. Reflections on the Particularism-Generalism Debate. In: Machine Ethics, ed. Michael Anderson, Susan Leigh Anderson. New York: Oxford University Press. pp.316–334.
↑ Hibbard, Bill (2014). "Ethical Artificial Intelligence". arXiv: 1411.1373 [cs.AI].
↑ McLaren, Bruce M. (2003). "Extensionally defining principles and cases in ethics: An AI model". Artificial Intelligence. 150 (1–2): 145–181. doi:10.1016/S0004-3702(03)00135-8. S2CID 11588399.
↑ Wakefield, Jane (24 March 2016). "Microsoft chatbot is taught to swear on Twitter". BBC News. Retrieved 2016-04-17.
↑ Nazaretyan, A. (2014). A. H. Eden, J. H. Moor, J. H. Søraker and E. Steinhart (eds): Singularity Hypotheses: A Scientific and Philosophical Assessment. Minds & Machines, 24(2), pp.245–248.
↑ D’Amato, Kristian (2024-04-09). "ChatGPT: towards AI subjectivity". AI & Society. doi: 10.1007/s00146-024-01898-z . ISSN 0951-5666.
↑ Brundage, Miles; Winterton, Jamie (17 March 2015). "Chappie and the Future of Moral Machines". Slate. Retrieved 30 October 2019.
↑ Asimov, Isaac (2008). I, robot. New York: Bantam. ISBN 978-0-553-38256-3.

Related Research Articles

Artificial intelligence (AI), in its broadest sense, is intelligence exhibited by machines, particularly computer systems. It is a field of research in computer science that develops and studies methods and software that enable machines to perceive their environment and use learning and intelligence to take actions that maximize their chances of achieving defined goals. Such machines may be called AIs.

The technological singularity—or simply the singularity—is a hypothetical future point in time at which technological growth becomes uncontrollable and irreversible, resulting in unforeseeable consequences for human civilization. According to the most popular version of the singularity hypothesis, I. J. Good's intelligence explosion model of 1965, an upgradable intelligent agent could eventually enter a positive feedback loop of self-improvement cycles, each successive; and more intelligent generation appearing more and more rapidly, causing a rapid increase ("explosion") in intelligence which would ultimately result in a powerful superintelligence, qualitatively far surpassing all human intelligence.

Eliezer S. Yudkowsky is an American artificial intelligence researcher and writer on decision theory and ethics, best known for popularizing ideas related to friendly artificial intelligence. He is the founder of and a research fellow at the Machine Intelligence Research Institute (MIRI), a private research nonprofit based in Berkeley, California. His work on the prospect of a runaway intelligence explosion influenced philosopher Nick Bostrom's 2014 book Superintelligence: Paths, Dangers, Strategies.

Friendly artificial intelligence is hypothetical artificial general intelligence (AGI) that would have a positive (benign) effect on humanity or at least align with human interests or contribute to fostering the improvement of the human species. It is a part of the ethics of artificial intelligence and is closely related to machine ethics. While machine ethics is concerned with how an artificially intelligent agent should behave, friendly artificial intelligence research is focused on how to practically bring about this behavior and ensuring it is adequately constrained.

Nick Bostrom is a philosopher known for his work on existential risk, the anthropic principle, human enhancement ethics, whole brain emulation, superintelligence risks, and the reversal test. He was the founding director of the now dissolved Future of Humanity Institute at the University of Oxford and is now Principal Researcher at the Macrostrategy Research Initiative.

Singularitarianism is a movement defined by the belief that a technological singularity—the creation of superintelligence—will likely happen in the medium future, and that deliberate action ought to be taken to ensure that the singularity benefits humans.

Artificial general intelligence (AGI) is a type of artificial intelligence (AI) that matches or surpasses human cognitive capabilities across a wide range of cognitive tasks. This contrasts with narrow AI, which is limited to specific tasks. AGI is considered one of the definitions of strong AI.

A superintelligence is a hypothetical agent that possesses intelligence surpassing that of the brightest and most gifted human minds. "Superintelligence" may also refer to a property of problem-solving systems whether or not these high-level intellectual competencies are embodied in agents that act in the world. A superintelligence may or may not be created by an intelligence explosion and associated with a technological singularity.

An AI takeover is an imagined scenario in which artificial intelligence (AI) emerges as the dominant form of intelligence on Earth and computer programs or robots effectively take control of the planet away from the human species, which relies on human intelligence. Possible scenarios include replacement of the entire human workforce due to automation, takeover by a superintelligent AI (ASI), and the notion of a robot uprising. Stories of AI takeovers have been popular throughout science fiction, but recent advancements have made the threat more real. Some public figures, such as Stephen Hawking and Elon Musk, have advocated research into precautionary measures to ensure future superintelligent machines remain under human control.

The following outline is provided as an overview of and topical guide to artificial intelligence:

Robot ethics, sometimes known as "roboethics", concerns ethical problems that occur with robots, such as whether robots pose a threat to humans in the long or short run, whether some uses of robots are problematic, and how robots should be designed such that they act 'ethically'. Alternatively, roboethics refers specifically to the ethics of human behavior towards robots, as robots become increasingly advanced. Robot ethics is a sub-field of ethics of technology, specifically information technology, and it has close links to legal as well as socio-economic concerns. Researchers from diverse areas are beginning to tackle ethical questions about creating robotic technology and implementing it in societies, in a way that will still ensure the safety of the human race.

The ethics of artificial intelligence covers a broad range of topics within the field that are considered to have particular ethical stakes. This includes algorithmic biases, fairness, automated decision-making, accountability, privacy, and regulation. It also covers various emerging or potential future challenges such as machine ethics, lethal autonomous weapon systems, arms race dynamics, AI safety and alignment, technological unemployment, AI-enabled misinformation, how to treat certain AI systems if they have a moral status, artificial superintelligence and existential risks.

AI@50, formally known as the "Dartmouth Artificial Intelligence Conference: The Next Fifty Years", was a conference organized by James Moor, commemorating the 50th anniversary of the Dartmouth workshop which effectively inaugurated the history of artificial intelligence. Five of the original ten attendees were present: Marvin Minsky, Ray Solomonoff, Oliver Selfridge, Trenchard More, and John McCarthy.

Eric Joel Horvitz is an American computer scientist, and Technical Fellow at Microsoft, where he serves as the company's first Chief Scientific Officer. He was previously the director of Microsoft Research Labs, including research centers in Redmond, WA, Cambridge, MA, New York, NY, Montreal, Canada, Cambridge, UK, and Bangalore, India.

In the field of artificial intelligence (AI) design, AI capability control proposals, also referred to as AI confinement, aim to increase our ability to monitor and control the behavior of AI systems, including proposed artificial general intelligences (AGIs), in order to reduce the danger they might pose if misaligned. However, capability control becomes less effective as agents become more intelligent and their ability to exploit flaws in human control systems increases, potentially resulting in an existential risk from AGI. Therefore, the Oxford philosopher Nick Bostrom and others recommend capability control methods only as a supplement to alignment methods.

Instrumental convergence is the hypothetical tendency for most sufficiently intelligent, goal directed beings to pursue similar sub-goals, even if their ultimate goals are quite different. More precisely, agents may pursue instrumental goals—goals which are made in pursuit of some particular end, but are not the end goals themselves—without ceasing, provided that their ultimate (intrinsic) goals may never be fully satisfied.

Existential risk from artificial intelligence refers to the idea that substantial progress in artificial general intelligence (AGI) could lead to human extinction or an irreversible global catastrophe.

In the field of artificial intelligence (AI), AI alignment aims to steer AI systems toward a person's or group's intended goals, preferences, and ethical principles. An AI system is considered aligned if it advances the intended objectives. A misaligned AI system pursues unintended objectives.

Some scholars believe that advances in artificial intelligence, or AI, will eventually lead to a semi-apocalyptic post-scarcity and post-work economy where intelligent machines can outperform humans in almost every, if not every, domain. The questions of what such a world might look like, and whether specific scenarios constitute utopias or dystopias, are the subject of active debate.

Automated decision-making (ADM) involves the use of data, machines and algorithms to make decisions in a range of contexts, including public administration, business, health, education, law, employment, transport, media and entertainment, with varying degrees of human oversight or intervention. ADM involves large-scale data from a range of sources, such as databases, text, social media, sensors, images or speech, that is processed using various technologies including computer software, algorithms, machine learning, natural language processing, artificial intelligence, augmented intelligence and robotics. The increasing use of automated decision-making systems (ADMS) across a range of contexts presents many benefits and challenges to human society requiring consideration of the technical, legal, ethical, societal, educational, economic and health consequences.

References

Wallach, Wendell; Allen, Colin (November 2008). Moral Machines: Teaching Robots Right from Wrong. US: Oxford University Press.
Anderson, Michael; Anderson, Susan Leigh, eds (July 2011). Machine Ethics. Cambridge University Press.
Storrs Hall, J. (May 30, 2007). Beyond AI: Creating the Conscience of the Machine Prometheus Books.
Moor, J. (2006). The Nature, Importance, and Difficulty of Machine Ethics. IEEE Intelligent Systems , 21(4), pp. 18–21.
Anderson, M. and Anderson, S. (2007). Creating an Ethical Intelligent Agent. AI Magazine , Volume 28(4).

External links

Machine Ethics, Interdisciplinary project on machine ethics.
The Machine Ethics Podcast, Podcast discussing Machine Ethics, AI and Tech ethics.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] Moor, J.H. (2006). "The Nature, Importance, and Difficulty of Machine Ethics". IEEE Intelligent Systems. 21 (4): 18–21. doi:10.1109/MIS.2006.80. S2CID 831873.

[2] Boyles, Robert James. "A Case for Machine Ethics in Modeling Human-Level Intelligent Agents" (PDF). Kritike. Retrieved 1 November 2019.

[3] Moor, James M. (2009). "Four Kinds of Ethical Robots". Philosophy Now.

[Waldrop1987-4] Waldrop, Mitchell (Spring 1987). "A Question of Responsibility". AI Magazine. 8 (1): 28–39. doi:10.1609/aimag.v8i1.572.

[5] Anderson, M., Anderson, S., and Armen, C. (2004) "Towards Machine Ethics" in Proceedings of the AAAI Workshop on Agent Organization: Theory and Practice, AAAI Press

[6] AAAI Workshop on Agent Organization: Theory and Practice, AAAI Press

[7] "Papers from the 2005 AAAI Fall Symposium". Archived from the original on 2014-11-29.

[Anderson2011-8] 1 2 Anderson, Michael; Anderson, Susan Leigh, eds. (July 2011). Machine Ethics. Cambridge University Press. ISBN 978-0-521-11235-2.

[anderson-9] 1 2 Anderson, M. and Anderson, S. (2007). Creating an Ethical Intelligent Agent. AI Magazine, Volume 28(4).

[Wallach&Allen2009-10] Wallach, Wendell; Allen, Colin (2009). Moral machines : teaching robots right from wrong . Oxford University Press. ISBN 9780195374049.

[Anderson2006-11] Anderson, Michael; Anderson, Susan Leigh, eds. (July–August 2006). "Special Issue on Machine Ethics". IEEE Intelligent Systems. 21 (4): 10–63. doi:10.1109/mis.2006.70. ISSN 1541-1672. S2CID 9570832. Archived from the original on 2011-11-26.

[12] Siler, Cory (2015). "Review of Anderson and Anderson's Machine Ethics". Artificial Intelligence. 229: 200–201. doi: 10.1016/j.artint.2015.08.013 . S2CID 5613776.

[Tucker2014-13] Tucker, Patrick (13 May 2014). "Now The Military Is Going To Build Robots That Have Morals". Defense One. Retrieved 9 July 2014.

[14] "Best Selling Science Books". New York Times. September 8, 2014. Retrieved 9 November 2014.

[15] "European Parliament, Committee on Legal Affairs. Draft Report with recommendations to the Commission on Civil Law Rules on Robotics". European Commission. Retrieved January 12, 2017.

[16] Wakefield, Jane (2017-01-12). "MEPs vote on robots' legal status – and if a kill switch is required". BBC News. Retrieved 12 January 2017.

[17] "European Parliament resolution of 16 February 2017 with recommendations to the Commission on Civil Law Rules on Robotics". European Parliament. Retrieved 8 November 2019.

[18] Alan Winfield; Katina Michael; Jeremy Pitt; Vanessa Evers (March 2019). "Machine Ethics: The Design and Governance of Ethical AI and Autonomous Systems". Proceedings of the IEEE. 107 (3): 501–615. doi: 10.1109/JPROC.2019.2898289 .

[19] "Proceedings of the IEEE Addresses Machine Ethics". IEEE Standards Association. 30 August 2019. Archived from the original on December 4, 2022.

[superintelligence-20] Bostrom, Nick (2014). Superintelligence: Paths, Dangers, Strategies (First ed.). Oxford University Press. ISBN 978-0199678112.

[21] Bostrom, Nick; Yudkowsky, Eliezer (2011). "The Ethics of Artificial Intelligence" (PDF). Cambridge Handbook of Artificial Intelligence. Cambridge Press. Archived from the original (PDF) on 2016-03-04. Retrieved 2011-06-28.

[SantosLang2002-22] 1 2 Santos-Lang, Chris (2002). "Ethics for Artificial Intelligences". Archived from the original on 2011-12-03.

[SantosLang2014-23] Santos-Lang, Christopher (2014). "Moral Ecology Approaches to Machine Ethics" (PDF). In van Rysewyk, Simon; Pontier, Matthijs (eds.). Machine Medical Ethics. Intelligent Systems, Control and Automation: Science and Engineering. Vol. 74. Switzerland: Springer. pp. 111–127. doi:10.1007/978-3-319-08108-3_8. ISBN 978-3-319-08107-6.

[PopSci_2009-08-18-24] 1 2 Fox, Stuart (August 18, 2009). "Evolving Robots Learn To Lie To Each Other". Popular Science.

[nytimes_july09-25] Markoff, John (July 25, 2009). "Scientists Worry Machines May Outsmart Man". New York Times.

[26] Palmer, Jason (3 August 2009). "Call for debate on killer robots". BBC News.

[27] Science New Navy-funded Report Warns of War Robots Going "Terminator" Archived 2009-07-28 at the Wayback Machine , by Jason Mick (Blog), dailytech.com, February 17, 2009.

[28] Flatley, Joseph L. (February 18, 2009). "Navy report warns of robot uprising, suggests a strong moral compass". Engadget.

[29] AAAI Presidential Panel on Long-Term AI Futures 2008–2009 Study, Association for the Advancement of Artificial Intelligence, Accessed 7/26/09.

[30] Sotala, Kaj; Yampolskiy, Roman V (2014-12-19). "Responses to catastrophic AGI risk: a survey". Physica Scripta. 90 (1): 8. doi: 10.1088/0031-8949/90/1/018001 . ISSN 0031-8949.

[NYT2016-31] 1 2 Crawford, Kate (25 June 2016). "Artificial Intelligence's White Guy Problem". The New York Times.

[ProPublica2016-32] 1 2 3 Julia Angwin; Surya Mattu; Jeff Larson; Lauren Kircher (23 May 2016). "Machine Bias: There's Software Used Across the Country to Predict Future Criminals. And it's Biased Against Blacks". ProPublica.

[33] Thomas, C.; Nunez, A. (2022). "Automating Judicial Discretion: How Algorithmic Risk Assessments in Pretrial Adjudications Violate Equal Protection Rights on the Basis of Race". Law & Inequality . 40 (2): 371–407. doi: 10.24926/25730037.649 .

[34] Executive Office of the President (May 2016). "Big Data: A Report on Algorithmic Systems, Opportunity, and Civil Rights" (PDF). Obama White House.

[35] "Big Risks, Big Opportunities: the Intersection of Big Data and Civil Rights". Obama White House. 4 May 2016.

[:0-36] 1 2 "How to Prevent Discriminatory Outcomes in Machine Learning". World Economic Forum. 12 March 2018. Retrieved 2018-12-11.

[37] Fjeld, Jessica; Achten, Nele; Hilligoss, Hannah; Nagy, Adam; Srikumar, Madhulika (2020). "Principled Artificial Intelligence: Mapping Consensus in Ethical and Rights-Based Approaches to Principles for AI". SSRN Working Paper Series. doi:10.2139/ssrn.3518482. ISSN 1556-5068. S2CID 214464355.

[38] Jobin, Anna; Ienca, Marcello; Vayena, Effy (2019). "The global landscape of AI ethics guidelines". Nature Machine Intelligence. 1 (9): 389–399. arXiv: 1906.11668 . doi: 10.1038/s42256-019-0088-2 . ISSN 2522-5839. S2CID 201827642.

[39] Anderson, Susan Leigh (2011): The Unacceptability of Asimov's Three Laws of Robotics as a Basis for Machine Ethics. In: Machine Ethics, ed. Michael Anderson, Susan Leigh Anderson. New York: Oxford University Press. pp.285–296. ISBN 9780511978036

[40] Powers, Thomas M. (2011): Prospects for a Kantian Machine. In: Machine Ethics, ed. Michael Anderson, Susan Leigh Anderson. New York: Oxford University Press. pp.464–475.

[41] Muehlhauser, Luke, Helm, Louie (2012): Intelligence Explosion and Machine Ethics.

[42] Yudkowsky, Eliezer (2004): Coherent Extrapolated Volition.

[43] Guarini, Marcello (2011): Computational Neural Modeling and the Philosophy of Ethics. Reflections on the Particularism-Generalism Debate. In: Machine Ethics, ed. Michael Anderson, Susan Leigh Anderson. New York: Oxford University Press. pp.316–334.

[44] Hibbard, Bill (2014). "Ethical Artificial Intelligence". arXiv: 1411.1373 [cs.AI].

[45] McLaren, Bruce M. (2003). "Extensionally defining principles and cases in ethics: An AI model". Artificial Intelligence. 150 (1–2): 145–181. doi:10.1016/S0004-3702(03)00135-8. S2CID 11588399.

[46] Wakefield, Jane (24 March 2016). "Microsoft chatbot is taught to swear on Twitter". BBC News. Retrieved 2016-04-17.

[47] Nazaretyan, A. (2014). A. H. Eden, J. H. Moor, J. H. Søraker and E. Steinhart (eds): Singularity Hypotheses: A Scientific and Philosophical Assessment. Minds & Machines, 24(2), pp.245–248.

[48] D’Amato, Kristian (2024-04-09). "ChatGPT: towards AI subjectivity". AI & Society. doi: 10.1007/s00146-024-01898-z . ISSN 0951-5666.

[49] Brundage, Miles; Winterton, Jamie (17 March 2015). "Chappie and the Future of Moral Machines". Slate. Retrieved 30 October 2019.

[Asimov2008-50] Asimov, Isaac (2008). I, robot. New York: Bantam. ISBN 978-0-553-38256-3.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

v t e Ethics
Normative	Consequentialism Deontology Care Particularism Pragmatic Role Suffering-focused Utilitarianism Virtue
Applied	Animal Artificial intelligence Bio Business Computer Discourse Engineering Environmental Land Legal Machine Meat eating Media Medical Nursing Professional Programming Research Sexual Technology Terraforming Uncertain sentience
Meta	Absolutism Axiology Cognitivism Realism Naturalism Non-naturalism Subjectivism Ideal observer theory Divine command theory Constructivism Euthyphro dilemma Intuitionism Nihilism Non-cognitivism Emotivism Expressivism Quasi-realism Universal prescriptivism Rationalism Relativism Skepticism Universalism Value monism – Value pluralism
Schools	Buddhist Christian Confucian Epicurean Existentialist Feminist Islamic Jewish Kantian Rousseauian Stoic Tao
Concepts	Authority Autonomy Common sense Compassion Conscience Consent Culture of life Dignity Double standard Duty Equality Etiquette Eudaimonia Family values Fidelity Free will Good and evil Good Evil Problem of evil Happiness Honour Ideal Immorality Justice Liberty Loyalty Moral agency Moral courage Moral hierarchy Moral imperative Morality Norm Pacifism Political freedom Precept Rights Self-discipline Suffering Stewardship Sympathy Theodicy Torture Trust Value Intrinsic Japan Western Vice Virtue Vow Wrong
Ethicists	Laozi Socrates Plato Aristotle Diogenes Valluvar Cicero Confucius Augustine Mencius Mozi Xunzi Aquinas Spinoza Butler Hume Smith Kant Hegel Schopenhauer Bentham Mill Kierkegaard Sidgwick Nietzsche Moore Barth Tillich Bonhoeffer Foot Rawls Dewey Williams Mackie Anscombe Frankena MacIntyre Hare Singer Parfit Nagel Adams Taylor Azurmendi Korsgaard Nussbaum
Works	Nicomachean Ethics (c. 322 BC) Ethics (Spinoza) (1677) Fifteen Sermons Preached at the Rolls Chapel (1726) A Treatise of Human Nature (1740) The Theory of Moral Sentiments (1759) An Introduction to the Principles of Morals and Legislation (1780) Groundwork of the Metaphysics of Morals (1785) Critique of Practical Reason (1788) Elements of the Philosophy of Right (1820) Either/Or (1843) Utilitarianism (1861) The Methods of Ethics (1874) On the Genealogy of Morality (1887) Principia Ethica (1903) A Theory of Justice (1971) Practical Ethics (1979) After Virtue (1981) Reasons and Persons (1984)
Related	Axiology Casuistry Descriptive ethics Ethics in religion Evolutionary ethics History of ethics Human rights Ideology Moral psychology Philosophy of law Political philosophy Population ethics Rehabilitation Secular ethics Social philosophy Index
Category

v t e Existential risk from artificial intelligence
Concepts	AGI AI alignment AI capability control AI safety AI takeover Consequentialism Effective accelerationism Ethics of artificial intelligence Existential risk from artificial general intelligence Friendly artificial intelligence Instrumental convergence Intelligence explosion Longtermism Machine ethics Suffering risks Superintelligence Technological singularity
Organizations	Alignment Research Center Center for AI Safety Center for Applied Rationality Center for Human-Compatible Artificial Intelligence Centre for the Study of Existential Risk EleutherAI Future of Humanity Institute Future of Life Institute Google DeepMind Humanity+ Institute for Ethics and Emerging Technologies Leverhulme Centre for the Future of Intelligence Machine Intelligence Research Institute OpenAI
People	Scott Alexander Sam Altman Yoshua Bengio Nick Bostrom Paul Christiano Eric Drexler Sam Harris Stephen Hawking Dan Hendrycks Geoffrey Hinton Bill Joy Shane Legg Elon Musk Steve Omohundro Huw Price Martin Rees Stuart J. Russell Jaan Tallinn Max Tegmark Frank Wilczek Roman Yampolskiy Eliezer Yudkowsky
Other	Statement on AI risk of extinction Human Compatible Open letter on artificial intelligence (2015) Our Final Invention The Precipice Superintelligence: Paths, Dangers, Strategies Do You Trust This Computer? Artificial Intelligence Act
Category