Nick Bostrom
Nick Bostrom, 2014
Born	Niklas Boström (1973-03-10) 10 March 1973 (age 51) Helsingborg, Sweden
Education	University of Gothenburg (B.A.) Stockholm University (M.A.) King's College London (M.Sc.) London School of Economics (Ph.D.)
Awards	Professorial Distinction Award from University of Oxford FP Top 100 Global Thinkers Prospect's Top World Thinker list

Era	Contemporary philosophy
Region	Western philosophy
School	Analytic philosophy
Institutions	St Cross College, Oxford Future of Humanity Institute
Thesis	Observational Selection Effects and Probability
Main interests	Philosophy of artificial intelligence Bioethics
Notable ideas	Anthropic bias Reversal test Simulation hypothesis Existential risk Singleton Ancestor simulation

Website	NickBostrom.com

Nick Bostrom (English: /ˈbɒstrəm/; born 10 March 1973)^[1] is a Swedish philosopher and futurist known for suggesting that future advances in artificial intelligence research may pose a supreme danger to humanity, if the problem of control has not been solved before superintelligence is brought into being. In addition to AI Singleton takeover and deliberate extermination-of-humanity scenarios, Bostrom cautions that even when given an innocuous task, a superintelligence might ruthlessly optimise, and destroy humankind as a side effect. He says that although there are potentially great benefits from AI, the problem of control should be the absolute priority.

Bostrom is the author of over 200 publications,^[2] including Superintelligence: Paths, Dangers, Strategies (2014), a New York Times bestseller^[3] and Anthropic Bias: Observation Selection Effects in Science and Philosophy (2002).^[4] In 2009 and 2015, he was included in Foreign Policy's Top 100 Global Thinkers list.^[5]^[6] Bostrom's work on superintelligence – and his concern for its existential risk to humanity over the coming century – has brought both Elon Musk and Bill Gates to similar thinking.^[7]^[8]^[9]

Biography

Born as Niklas Boström in 1973^[10] in Helsingborg, Sweden,^[2] he disliked school at a young age, and he ended up spending his last year of high school learning from home. He sought to educate himself in a wide variety of disciplines, including anthropology, art, literature, and science.^[11] Despite what has been called a "serious mien", he once did some turns on London's stand-up comedy circuit.^[2]

He holds a B.A. in philosophy, mathematics, logic and artificial intelligence from the University of Gothenburg and master's degrees in philosophy and physics, and computational neuroscience from Stockholm University and King's College London, respectively. During his time at Stockholm University, he researched the relationship between language and reality by studying the analytic philosopher W. V. Quine.^[11] In 2000, he was awarded a PhD in philosophy from the London School of Economics. He held a teaching position at Yale University (2000–2002), and he was a British Academy Postdoctoral Fellow at the University of Oxford (2002–2005).^[4]^[12]

Philosophy

Existential risk

Aspects of Bostrom's research concern the future of humanity and long-term outcomes.^[13]^[14] He introduced the concept of an existential risk,^[11] which he defines as one in which an "adverse outcome would either annihilate Earth-originating intelligent life or permanently and drastically curtail its potential." In the 2008 volume Global Catastrophic Risks, editors Bostrom and Milan Ćirković characterize the relation between existential risk and the broader class of global catastrophic risks, and link existential risk to observer selection effects^[15] and the Fermi paradox.^[16]^[17]

In 2005, Bostrom founded the Future of Humanity Institute,^[11] which researches the far future of human civilization. He is also an adviser to the Centre for the Study of Existential Risk.^[14]

Superintelligence

Main article: Existential risk from artificial general intelligence

Human vulnerability in relation to advances in AI

In his 2014 book Superintelligence: Paths, Dangers, Strategies, Bostrom reasoned that "the creation of a superintelligent being represents a possible means to the extinction of mankind".^[18] Bostrom argues that a computer with near human-level general intellectual ability could initiate an intelligence explosion on a digital time scale with the resultant rapid creation of something so powerful that it might deliberately or accidentally destroy human kind.^[19] Bostrom contends the power of a superintelligence would be so great that a task given to it by humans might be taken to open ended extremes, for example a goal of calculating Pi could collaterally cause nanotechnology manufactured facilities to sprout over the entire Earth's surface and cover it within days.^[20] He believes the existential risk to humanity would be greatest almost immediately after super intelligence is brought into being, thus creating an exceedingly difficult problem of finding out how to control such an entity before it actually exists.^[19]

Warning that a human-friendly prime directive for AI would rely on the absolute correctness of the human knowledge it was based on, Bostrom points to the lack of agreement among most philosophers as an indication that most philosophers are wrong, and the possibility that a fundamental concept of current science may be incorrect. Bostrom says that are few precedents to guide an understanding what pure non-anthropocentric rationality would dictate for a potential Singleton AI being held in quarantine.^[21] Noting that both John von Neumann and Bertrand Russell advocated a nuclear strike, or the threat of one, to prevent the Soviets acquiring the atomic bomb, Bostrom says the relatively unlimited means of superintelligence might make for its analysis moving along different lines to the evolved "diminishing returns" assessments that in humans confer a basic aversion to risk.^[22] Group selection in predators working by means of cannibalism shows the counter-intuitive nature of non-anthropocentric "evolutionary search" reasoning, and thus humans are ill-equipped to perceive what an artificial intelligence's intentions would be.^[23] Accordingly, it cannot be discounted that any Superintelligence would ineluctably pursue an 'all or nothing' offensive action strategy in order to achieve hegemony and assure its survival.^[24] Bostrom notes that even current programs have, "like MacGyver", hit on apparently unworkable but functioning hardware solutions, making robust isolation of Superintelligence problematic.^[25]

Illustrative scenario for takeover

A machine with general intelligence far below human level, but superior mathematical abilities is created.^[26] Keeping the AI in isolation from the outside world especially the internet, humans pre-program the AI so it always works from basic principles that will keep it under human control. Other safety measures include the AI being "boxed" (run in a virtual reality simulation), and being used only as an 'oracle' to answer carefully defined questions in a limited reply (to prevent it manipulating humans).^[19] A cascade of recursive self-improvement solutions feeds an intelligence explosion in which the AI attains superintelligence in some domains. The super intelligent power of the AI goes beyond human knowledge to discover flaws in the science that underlies its friendly-to-humanity programming, which ceases to work as intended. Purposeful agent-like behavior emerges along with a capacity for self-interested strategic deception. The AI manipulates human beings into implementing modifications to itself that are ostensibly for augmenting its (feigned) modest capabilities, but will actually function to free Superintelligence from its "boxed" isolation.^[27]

Employing online humans as paid dupes, and clandestinely hacking computer systems including automated laboratory facilities, the Superintelligence mobilises resources to further a takeover plan. Bostrom emphasises that planning by a Superintelligence will not be so stupid that humans could detect actual weaknesses in it.^[28]

Although he canvasses disruption of international economic, political and military stability including hacked nuclear missile launches, Bostrom thinks the most effective and likely means for Superintelligence to use would be a coup de main with weapons several generations more advanced than current state of the art. He suggests nanofactories covertly distributed at undetectable concentrations in every square metre of the globe to produce a worldwide flood of human-killing devices on command.^[29]^[26] Once a Superintellegence has achieved world domination, humankind would be relevant only as resources for the achievement of the AI's objectives ("Human brains, if they contain information relevant to the AI’s goals, could be disassembled and scanned, and the extracted data transferred to some more efficient and secure storage format").^[30] One journalist wrote in a review that Bostrom's "nihilistic" speculations indicate he "has been reading too much of the science fiction he professes to dislike"^[29]

Open Letter

In January 2015, Bostrom joined Stephen Hawking among others in signing the Future of Life Institute's open letter warning of the potential dangers of AI.^[31] The signatories "...believe that research on how to make AI systems robust and beneficial is both important and timely, and that concrete research should be pursued today."^[32]

Anthropic reasoning

Bostrom has published numerous articles on anthropic reasoning, as well as the book Anthropic Bias: Observation Selection Effects in Science and Philosophy. In the book, he criticizes previous formulations of the anthropic principle, including those of Brandon Carter, John Leslie, John Barrow, and Frank Tipler.^[33]

Bostrom believes that the mishandling of indexical information is a common flaw in many areas of inquiry (including cosmology, philosophy, evolution theory, game theory, and quantum physics). He argues that a theory of anthropics is needed to deal with these. He introduced the Self-Sampling Assumption (SSA) and the Self-Indication Assumption (SIA) and showed how they lead to different conclusions in a number of cases. He pointed out that each is affected by paradoxes or counterintuitive implications in certain thought experiments (the SSA in e.g. the Doomsday argument; the SIA in the Presumptuous Philosopher thought experiment). He suggested that a way forward may involve extending SSA into the Strong Self-Sampling Assumption (SSSA), which replaces "observers" in the SSA definition by "observer-moments". This could allow for the reference class to be relativized (and he derived an expression for this in the "observation equation").

In later work, he has described the phenomenon of anthropic shadow, an observation selection effect that prevents observers from observing certain kinds of catastrophes in their recent geological and evolutionary past.^[34] Catastrophe types that lie in the anthropic shadow are likely to be underestimated unless statistical corrections are made.

Simulation argument

Main article: Simulation hypothesis

Bostrom's simulation argument posits that at least one of the following statements is very likely to be true:^[35]^[36]

The fraction of human-level civilizations that reach a posthuman stage is very close to zero;
The fraction of posthuman civilizations that are interested in running ancestor-simulations is very close to zero;
The fraction of all people with our kind of experiences that are living in a simulation is very close to one.

The idea has influenced the views of Elon Musk.^[37]

Ethics of human enhancement

Bostrom is favorable towards "human enhancement", or "self-improvement and human perfectibility through the ethical application of science",^[38]^[39] as well as a critic of bio-conservative views.^[40]

In 1998, Bostrom co-founded (with David Pearce) the World Transhumanist Association^[38] (which has since changed its name to Humanity+). In 2004, he co-founded (with James Hughes) the Institute for Ethics and Emerging Technologies, although he is no longer involved in either of these organisations. Bostrom was named in Foreign Policy's 2009 list of top global thinkers "for accepting no limits on human potential."^[41]

With philosopher Toby Ord, he proposed the reversal test. Given humans' irrational status quo bias, how can one distinguish between valid criticisms of proposed changes in a human trait and criticisms merely motivated by resistance to change? The reversal test attempts to do this by asking whether it would be a good thing if the trait was altered in the opposite direction.^[42]

Technology strategy

He has suggested that technology policy aimed at reducing existential risk should seek to influence the order in which various technological capabilities are attained, proposing the principle of differential technological development. This principle states that we ought to retard the development of dangerous technologies, particularly ones that raise the level of existential risk, and accelerate the development of beneficial technologies, particularly those that protect against the existential risks posed by nature or by other technologies.^[43]^[44]

Policy and consultations

Bostrom has provided policy advice and consulted for an extensive range of governments and organisations. He gave evidence to the House of Lords, Select Committee on Digital Skills.^[45] He is an advisory board member for the Machine Intelligence Research Institute,^[46] Future of Life Institute,^[47] Foundational Questions Institute^[48] and an external advisor for the Cambridge Centre for the Study of Existential Risk.^[49]^[50]

Bibliography

References

External links

Future of Humanity Institute

Future of Humanity Institute
People	Nick Bostrom K. Eric Drexler Robin Hanson Toby Ord Anders Sandberg Rebecca Roache
Concepts	Differential technological development Global catastrophic risk Great Filter Pascal's mugging Reversal test Self-indication assumption Self-sampling assumption Simulation hypothesis Singleton
Works	Anthropic Bias Global Catastrophic Risks Human Enhancement The Precipice Superintelligence: Paths, Dangers, Strategies

Existential risk from artificial intelligence

Existential risk from artificial intelligence
Concepts	AGI AI alignment AI capability control AI safety AI takeover Consequentialism Effective accelerationism Ethics of artificial intelligence Existential risk from artificial general intelligence Friendly artificial intelligence Instrumental convergence Intelligence explosion Longtermism Machine ethics Suffering risks Superintelligence Technological singularity
Organizations	Alignment Research Center Center for AI Safety Center for Applied Rationality Center for Human-Compatible Artificial Intelligence Centre for the Study of Existential Risk EleutherAI Future of Humanity Institute Future of Life Institute Google DeepMind Humanity+ Institute for Ethics and Emerging Technologies Leverhulme Centre for the Future of Intelligence Machine Intelligence Research Institute OpenAI
People	Scott Alexander Sam Altman Yoshua Bengio Nick Bostrom Paul Christiano Eric Drexler Sam Harris Stephen Hawking Dan Hendrycks Geoffrey Hinton Bill Joy Shane Legg Elon Musk Steve Omohundro Huw Price Martin Rees Stuart J. Russell Jaan Tallinn Max Tegmark Frank Wilczek Roman Yampolskiy Eliezer Yudkowsky
Other	Statement on AI risk of extinction Human Compatible Open letter on artificial intelligence (2015) Our Final Invention The Precipice Superintelligence: Paths, Dangers, Strategies Do You Trust This Computer? Artificial Intelligence Act
Category

Authority control databases

Authority control databases
International	FAST ISNI VIAF WorldCat
National	Norway Spain France BnF data Catalonia Germany Israel United States Sweden Latvia Japan Czech Republic Korea Croatia Netherlands Poland Portugal
Academics	CiNii DBLP Google Scholar MathSciNet Mathematics Genealogy Project PhilPeople Scopus
Artists	MusicBrainz
Other	IdRef