All Events
governance
☆ LIEZHUAN

Asilomar AI Principles: The Research Community Sets Its Own Rules

Overview On January 8, 2017, approximately 1,000 AI researchers and thought leaders convened at the Asilomar Conference Grounds in Pacific Grove, California — the same location where, in 1975, biologists had gathered to set safety standards …

2017-01-08

Overview

On January 8, 2017, approximately 1,000 AI researchers and thought leaders convened at the Asilomar Conference Grounds in Pacific Grove, California — the same location where, in 1975, biologists had gathered to set safety standards for recombinant DNA technology. The conference was organized by the Future of Life Institute (FLI), co-founded by MIT physicist Max Tegmark and advised by figures including Stuart Russell, Nick Bostrom, and Elon Musk.

The outcome was the Asilomar AI Principles — 23 guidelines signed by more than 1,200 AI researchers and public intellectuals, representing the first broad scientific consensus statement on the development of beneficial AI.

The 23 Principles

The principles were organized into three clusters:

Research Issues (5 principles)

  • AI research should be for the benefit of humanity, not profit alone
  • AI researchers should maintain healthy communication about capabilities and limitations
  • Safety research investment should grow proportionally with capabilities investment
  • Failure mode research should be encouraged, not suppressed
  • Highly autonomous AI systems should be safe in the environments they operate in

Ethics and Values (13 principles)

  • Value alignment: AI systems should be designed so their goals align with human values
  • Transparency: AI should not design itself to deceive humans about its nature
  • Responsibility: Designers bear responsibility for misuse they could reasonably foresee
  • Privacy: AI should not violate the privacy of individuals
  • Liberty and privacy: AI should not undermine people’s autonomy or democratic institutions
  • Shared benefit: The economic gains from AI should be widely shared
  • Shared prosperity: AI capability should benefit all humanity, not concentrate power

Longer-Term Issues (5 principles)

  • Capability caution: Recursive self-improvement should be subject to strict safety constraints
  • Common good: Superintelligent AI should serve “widely shared ethical ideals”
  • Existential risk avoidance: Advanced AI poses potential risks that merit serious mitigation
  • Non-subversion: AI should not undermine legitimate democratic oversight

Who Signed

Signatories included: Stephen Hawking, Elon Musk (before his falling-out with OpenAI), Demis Hassabis (DeepMind), Yann LeCun (Facebook AI), Yoshua Bengio (Mila), Geoffrey Hinton (Google Brain), and hundreds of prominent academics.

Notably, several signatories had major commercial interests in AI acceleration — demonstrating that safety concerns were not limited to outsiders or critics.

Significance and Criticism

Why it mattered:

  • The first time the AI field as a collective formally addressed its own existential risks
  • Established “value alignment” as a legitimate research problem, not science fiction
  • Created a common vocabulary (beneficial AI, existential risk, alignment) that shaped subsequent discourse
  • Named documents that followed — including OpenAI’s charter, Anthropic’s constitution, EU AI Act principles — echo its language

Criticisms:

  • The principles were voluntary, aspirational, and unenforceable
  • Corporate signatories continued racing on capabilities regardless
  • Subsequent years saw the very signatories compete on speed, undermining the cooperative spirit
  • The 2017 principles did not anticipate the scale of disruption from language models (ChatGPT arrived 5 years later)

The Asilomar principles are best understood not as binding rules but as the moment the field acknowledged the stakes — the first time AI researchers collectively said “this matters beyond our careers.”

References