Artificial intelligence
Part of a series on

Major goals Artificial general intelligence Intelligent Agent Recursive self-improvement Planning Computer vision General game playing Knowledge reasoning Natural language processing Robotics AI safety
Approaches Machine learning Symbolic Deep learning Bayesian networks Evolutionary algorithms Situated approach Hybrid intelligent systems Systems integration
Applications Projects Deepfake Machine translation Generative AI Art Audio Music Healthcare Mental health Government Industry Earth sciences Bioinformatics Physics
Philosophy Chinese room Friendly AI Control problem/Takeover Ethics Existential risk Turing test Regulation
History Timeline Progress AI winter AI boom
Glossary Glossary
v t e

Existential risk from artificial general intelligence refers to the idea that substantial progress in artificial general intelligence (AGI) could lead to human extinction or an irreversible global catastrophe.^[1]^[2]^[3]

One argument for the importance of this risk references how human beings dominate other species because the human brain possesses distinctive capabilities other animals lack. If AI were to surpass human intelligence and become superintelligent, it might become uncontrollable. Just as the fate of the mountain gorilla depends on human goodwill, the fate of humanity could depend on the actions of a future machine superintelligence.^[4]

The plausibility of existential catastrophe due to AI is widely debated. It hinges in part on whether AGI or superintelligence are achievable, the speed at which dangerous capabilities and behaviors emerge,^[5] and whether practical scenarios for AI takeovers exist.^[6] Concerns about superintelligence have been voiced by leading computer scientists and tech CEOs such as Geoffrey Hinton,^[7] Yoshua Bengio,^[8] Alan Turing,^[a] Elon Musk,^[11] and OpenAI CEO Sam Altman.^[12] In 2022, a survey of AI researchers with a 17% response rate found that the majority believed there is a 10 percent or greater chance that human inability to control AI will cause an existential catastrophe.^[13]^[14] In 2023, hundreds of AI experts and other notable figures signed a statement declaring, "Mitigating the risk of extinction from AI should be a global priority alongside other societal-scale risks such as pandemics and nuclear war".^[15] Following increased concern over AI risks, government leaders such as United Kingdom prime minister Rishi Sunak^[16] and United Nations Secretary-General António Guterres^[17] called for an increased focus on global AI regulation.

Two sources of concern stem from the problems of AI control and alignment. Controlling a superintelligent machine or instilling it with human-compatible values may be difficult. Many researchers believe that a superintelligent machine would likely resist attempts to disable it or change its goals as that would prevent it from accomplishing its present goals. It would be extremely challenging to align a superintelligence with the full breadth of significant human values and constraints.^[1]^[18]^[19] In contrast, skeptics such as computer scientist Yann LeCun argue that superintelligent machines will have no desire for self-preservation.^[20]

A third source of concern is the possibility of a sudden "intelligence explosion" that catches humanity unprepared. In this scenario, an AI more intelligent than its creators would be able to recursively improve itself at an exponentially increasing rate, improving too quickly for its handlers or society at large to control.^[1]^[18] Empirically, examples like AlphaZero, which taught itself to play Go and quickly surpassed human ability, show that domain-specific AI systems can sometimes progress from subhuman to superhuman ability very quickly, although such machine learning systems do not recursively improve their fundamental architecture.^[21]

v t e Existential risk from artificial intelligence
Concepts	AGI AI alignment AI capability control AI safety AI takeover Consequentialism Effective accelerationism Ethics of artificial intelligence Existential risk from artificial general intelligence Friendly artificial intelligence Instrumental convergence Intelligence explosion Longtermism Machine ethics Suffering risks Superintelligence Technological singularity
Organizations	Alignment Research Center Center for AI Safety Center for Applied Rationality Center for Human-Compatible Artificial Intelligence Centre for the Study of Existential Risk EleutherAI Future of Humanity Institute Future of Life Institute Google DeepMind Humanity+ Institute for Ethics and Emerging Technologies Leverhulme Centre for the Future of Intelligence Machine Intelligence Research Institute OpenAI
People	Scott Alexander Sam Altman Yoshua Bengio Nick Bostrom Paul Christiano Eric Drexler Sam Harris Stephen Hawking Dan Hendrycks Geoffrey Hinton Bill Joy Shane Legg Elon Musk Steve Omohundro Huw Price Martin Rees Stuart J. Russell Jaan Tallinn Max Tegmark Frank Wilczek Roman Yampolskiy Eliezer Yudkowsky
Other	Statement on AI risk of extinction Human Compatible Open letter on artificial intelligence (2015) Our Final Invention The Precipice Superintelligence: Paths, Dangers, Strategies Do You Trust This Computer? Artificial Intelligence Act
Category

v t e Effective altruism
Concepts	Aid effectiveness Charity assessment Demandingness objection Disability-adjusted life year Disease burden Distributional cost-effectiveness analysis Earning to give Equal consideration of interests Longtermism Marginal utility Moral circle expansion Psychological barriers to effective altruism Quality-adjusted life year Utilitarianism Venture philanthropy
Key figures	Sam Bankman-Fried Liv Boeree Nick Bostrom Hilary Greaves Holden Karnofsky William MacAskill Dustin Moskovitz Yew-Kwang Ng Toby Ord Derek Parfit Peter Singer Cari Tuna Eliezer Yudkowsky
Organizations	80,000 Hours Against Malaria Foundation All-Party Parliamentary Group for Future Generations Animal Charity Evaluators Animal Ethics Centre for Effective Altruism Centre for Enabling EA Learning & Research Center for High Impact Philanthropy Centre for the Study of Existential Risk Development Media International Evidence Action Faunalytics Fistula Foundation Future of Humanity Institute Future of Life Institute Founders Pledge GiveDirectly GiveWell Giving What We Can Good Food Fund The Good Food Institute Good Ventures The Humane League Mercy for Animals Machine Intelligence Research Institute Malaria Consortium Nuclear Threat Initiative Open Philanthropy Raising for Effective Giving Sentience Institute Unlimit Health Wild Animal Initiative
Focus areas	Biotechnology risk Climate change Cultured meat Economic stability Existential risk from artificial general intelligence Global catastrophic risk Global health Global poverty Immigration reform Intensive animal farming Land use reform Life extension Malaria prevention Mass deworming Neglected tropical diseases Suffering risks Wild animal suffering
Literature	Doing Good Better The End of Animal Farming Famine, Affluence, and Morality The Life You Can Save Living High and Letting Die The Most Good You Can Do Practical Ethics The Precipice Superintelligence: Paths, Dangers, Strategies What We Owe the Future
Events	Effective Altruism Global

History

AI capabilities

General Intelligence

Superintelligence

Comparison with humans

Intelligence explosion

Alien mind

Limits

Dangerous capabilities

Social manipulation

Cyberattacks

Enhanced pathogens

AI arms race

Types of existential risk

AI alignment

Instrumental convergence

Resistance to changing goals

Difficulty of specifying goals

Alignment of superintelligences

Difficulty of making a flawless design

Orthogonality thesis

Anthropomorphic arguments

Other sources of risk

Scenarios

Treacherous turn

Life 3.0

Perspectives

Endorsement

Skepticism

Popular reaction

Public surveys

Mitigation

Views on banning and regulation

Banning

Regulation

See also

Notes

References

Bibliography