Skip to content

Value Lock-in

Value lock-in occurs when AI enables the permanent entrenchment of a particular set of values, making future change extremely difficult or impossible. This is a symmetric critical outcome—lock-in could preserve beneficial values (democratic norms, human rights, flourishing) or entrench harmful ones (authoritarianism, narrow corporate interests, destructive ideologies).

The key insight is that AI may create unprecedented stability. Throughout history, bad regimes eventually fell; bad ideas eventually lost influence. AI could change this by providing tools for permanent control or by optimizing systems so effectively that alternatives become unviable.


Symmetric: Can be good or bad.

PoleDescriptionExample
Positive lock-inBeneficial values become stable and protectedDemocratic norms, human rights, flourishing embedded in stable institutions
Negative lock-inHarmful values become permanentAuthoritarian control, corporate extraction, ideological extremism entrenched forever

The same mechanisms can produce either outcome. The key question is: which values get locked in?


Loading diagram...

1. Power Concentration AI capabilities may concentrate in the hands of a small number of actors (states, corporations, individuals). Once concentrated, power can be self-reinforcing—those with AI advantages can use them to maintain their position.

2. AI-Enabled Surveillance Unprecedented surveillance capabilities could make resistance to any regime impossible. Dissenters can be identified before they organize; alternative power bases can be dismantled before they form.

3. Optimization Lock-in AI systems optimizing for particular objectives create path dependencies. Once systems are built around certain assumptions, changing course becomes increasingly costly. Society may “optimize itself into a corner.”

4. Infrastructure Dependency Critical systems may become dependent on particular AI configurations. Changing values would require rebuilding infrastructure—potentially prohibitively expensive.


An authoritarian regime uses AI to make its control permanent:

  • Total surveillance eliminates privacy and organized resistance
  • AI-optimized propaganda shapes beliefs at individual level
  • Economic systems reward loyalty and punish dissent
  • Security forces can identify and neutralize threats preemptively

Historical analogy: Totalitarian regimes of the 20th century eventually fell. AI might make such regimes stable indefinitely.

Corporate interests become permanently entrenched:

  • Regulatory capture amplified by AI lobbying and influence
  • Market positions become unassailable through AI advantages
  • Consumer behavior optimally manipulated
  • Democratic oversight becomes nominal

A particular ideology or worldview becomes permanent:

  • AI-optimized memetic content entrenches beliefs
  • Alternative perspectives filtered or suppressed
  • Education and socialization controlled
  • Could be religious, political, or philosophical

Even without malice, values may simply freeze:

  • Current values encoded into long-lasting AI systems
  • No mechanism for moral progress
  • Humanity locked into 2020s ethics forever
  • Loss of the “moral circle expansion” that has characterized human history

ParameterDirectionImpact on Negative Lock-in
AI Control ConcentrationHigh → EnablesFew actors can entrench their values
Human AgencyLow → EnablesLess capacity to resist or change course
Governance CapacityLow → EnablesInstitutions can’t prevent capture
Societal ResilienceLow → EnablesLess ability to recover from bad lock-in
Epistemic HealthLow → EnablesHarder to recognize bad values being locked in

Lock-in directly determines the long-run trajectory—whose values shape the future:

  • Positive lock-in → Stable, flourishing future
  • Negative lock-in → Permanent dystopia or stagnation

Bad lock-in may be as catastrophic as extinction:

  • Permanent totalitarianism could be “worse than death”
  • Value stagnation could foreclose most of the future’s potential value
  • Lock-in typically happens gradually, but the moment of “no return” may be sharp

Signs of negative lock-in forming:

  1. Power concentration accelerating beyond historical norms
  2. Surveillance capabilities expanding without oversight
  3. Dissent becoming systematically difficult
  4. AI systems encoding current values without mechanisms for updating
  5. Alternative AI development being foreclosed
  6. Democratic institutions weakening while AI capabilities grow

Signs of positive lock-in potential:

  1. Democratic oversight of AI development
  2. Distributed AI capabilities
  3. Strong civil society engagement
  4. International cooperation on AI governance
  5. Explicit attention to value representation and updating

To prevent negative lock-in:

  • Distributed AI development (prevent concentration)
  • Strong democratic oversight of AI deployment
  • Privacy protections against surveillance
  • Antitrust enforcement in AI markets
  • International cooperation to prevent authoritarian AI exports

To enable positive lock-in:

  • Deliberate encoding of beneficial values in AI systems
  • Mechanisms for updating values as moral understanding improves
  • Broad stakeholder input into AI development
  • Constitutional protections adapted for AI era

Lock-in risk depends heavily on which values are currently winning:

ScenarioAssessment
Some form of lock-in occursLikely if AI continues advancing
Authoritarian lock-inSignificant risk in some regions; global unclear
Corporate lock-inAlready partially occurring in some domains
Positive lock-inPossible but requires deliberate effort


Ratings

MetricScoreInterpretation
Changeability30/100Hard to prevent or redirect
X-risk Impact50/100Meaningful extinction risk
Trajectory Impact95/100Major effect on long-term welfare
Uncertainty65/100Moderate uncertainty in estimates