Skip to content

Field Building Analysis

📋Page Status
Quality:82 (Comprehensive)⚠️
Importance:78.5 (High)
Last edited:2025-12-28 (10 days ago)
Words:3.4k
Structure:
📊 6📈 1🔗 53📚 054%Score: 9/15
LLM Summary:Analysis of AI safety field-building interventions finds the field grew from ~400 FTEs (2022) to 1,100 FTEs (2025) at 21-30% annually, with training programs achieving 37% career conversion rates at costs of $5,000-40,000 per career change. Identifies talent pipeline over-optimization for researchers as key bottleneck.
Key Crux

Field Building and Community

CategoryMeta-level intervention
Time Horizon3-10+ years
Primary MechanismHuman capital development
Key MetricResearchers produced per year
Entry BarrierLow to Medium
Related
DimensionAssessmentEvidence
Field Size (2025)1,100 FTEs (600 technical, 500 non-technical)AI Safety Field Growth Analysis 2025
Annual Growth Rate21-30% since 2020Technical: 21% FTE growth; Non-technical: 30%
Total Philanthropic Funding$110-130M/year (2024)Overview of AI Safety Funding
Training Program Conversion37% work full-time in AI safetyBlueDot 2022 Cohort Analysis
Cost per Career Change$5,000-40,000 depending on programARENA lower-touch, MATS higher-touch
Key BottleneckTalent pipeline over-optimized for researchersEA Forum analysis
TractabilityMedium-HighPrograms show measurable outcomes

Field-building focuses on growing the AI safety ecosystem rather than doing direct research or policy work. The theory is that by increasing the number and quality of people working on AI safety, we multiply the impact of all other interventions.

This is a meta-level or capacity-building intervention—it doesn’t directly solve the technical or governance problems, but creates the infrastructure and talent pipeline that makes solving them possible.

The field has grown substantially: from approximately 400 full-time equivalents (FTEs) in 2022 to roughly 1,100 FTEs in 2025, with technical AI safety organizations growing at 24% annually and non-technical organizations at approximately 30% annually. However, this growth has created new challenges—the pipeline may be over-optimized for researchers while neglecting operations, policy, and other critical roles.

Loading diagram...

Key mechanisms:

  1. Talent pipeline: Train and recruit people into AI safety
  2. Knowledge dissemination: Spread ideas and frameworks
  3. Community building: Create support structures and networks
  4. Funding infrastructure: Direct resources to promising work
  5. Public awareness: Build broader support and understanding

Goal: Teach AI safety concepts and skills to potential contributors.

ProgramFormatDurationScaleCost/ParticipantPlacement RateKey Outcomes
MATSResearch mentorship3-4 months30-50/cohort~$20,000-40,00075% publish resultsAlumni at Anthropic, OpenAI, DeepMind; founded Apollo Research, Timaeus
ARENAIn-person bootcamp4-5 weeks20-30/cohort~$5,000-15,0008 confirmed FT positions (5.0 cohort)Alumni at Apollo Research, METR, UK AISI
BlueDot ImpactOnline cohort-based8 weeks1,000+/year~$440/student37% work FT in AI safety6,000+ trained since 2022; 75% completion rate
SPARPart-time remoteVaries50+/cohortLow (volunteer mentors)Research output focusedConnects aspiring researchers with professionals
AI Safety CampProject-based1-2 weeks20-40/campVariesProject completionMultiple camps globally

Key Programs in Detail:

MATS (ML Alignment & Theory Scholars):

  • Since 2021, has supported 298 scholars and 75 mentors
  • Summer 2024: 1,220 applicants, 3-5% acceptance rate (comparable to MIT admissions)
  • Spring 2024 Extension: 75% of scholars published results; 57% accepted to conferences
  • Notable: Nina Panickssery’s paper on steering Llama 2 won Outstanding Paper Award at ACL 2024
  • Alumni include researchers at Anthropic, OpenAI, and Google DeepMind
  • Received $23.6M in Open Philanthropy funding for general support

ARENA (Alignment Research Engineer Accelerator):

  • Run 2-3 bootcamps per year, each 4-5 weeks, based at LISA in London
  • ARENA 5.0: 8 participants confirmed full-time AI safety positions post-program
  • Participants rate exercise enjoyment 8.7/10, LISA location value 9.6/10
  • Alumni quote: “ARENA was the most useful thing that could happen to someone with a mathematical background who wants to enter technical AI safety research”
  • Claims to be among most cost-effective technical AI safety training programs

BlueDot Impact (formerly AI Safety Fundamentals):

  • Trained 6,000+ professionals worldwide since 2022
  • 2022 cohort analysis: 123 alumni (37% of 342) now work full-time on AI safety
  • 20 alumni would not be working on AI safety were it not for the course (counterfactual impact)
  • 75% completion rate (vs. 20% for typical Coursera courses)
  • Raised $34M total funding, including $25M in 2025
  • Alumni at Anthropic, Google DeepMind, UK AI Security Institute

Theory of change: Train people in AI safety → some pursue careers → net increase in research capacity

Effectiveness considerations:

  • High leverage: One good researcher can contribute for decades
  • Measurable conversion: BlueDot shows 37% career conversion; ARENA shows 8+ direct placements per cohort
  • Counterfactual question: BlueDot estimates 20 counterfactual career changes from 2022 cohort
  • Quality vs. quantity: More selective programs (MATS, ARENA) show higher placement rates
📊Cost per Career Change

Estimated cost to move one person into AI safety career via training programs

SourceEstimateDate
ARENA (successful cases)$5,000-15,000
MATS$20,000-40,000
BlueDot Impact$440-2,000

ARENA (successful cases): Direct program costs per career change

MATS: Higher touch, research mentorship

BlueDot Impact: Scalable online; 37% conversion rate

Who’s doing this:

  • ARENA (Redwood Research / independent)
  • MATS (independent, Lightcone funding)
  • BlueDot Impact
  • Various university courses and programs

Goal: Increase general understanding of AI risk and build support for safety efforts.

Approaches:

Popular Media:

  • Podcasts (Lex Fridman, Dwarkesh Patel, 80K Hours)
  • Books (Superintelligence, The Alignment Problem, The Precipice)
  • Documentaries and videos
  • News articles and op-eds
  • Social media presence

High-Level Engagement:

  • Statement on AI Risk (May 2023): Geoffrey Hinton, Yoshua Bengio, Demis Hassabis, Sam Altman, Dario Amodei signed
    • “Mitigating the risk of extinction from AI should be a global priority”
    • Raised public and elite awareness
  • Expert testimony to governments
  • Academic conferences and workshops
  • Industry events and presentations

Accessible Explanations:

  • Robert Miles YouTube channel
  • AI Safety memes and infographics
  • Explainer articles
  • University lectures and courses

Theory of change: Awareness → political will for governance + cultural shift toward safety + talent recruitment

Effectiveness:

  • Uncertain impact on x-risk: Unclear if awareness translates to action
  • Possible downsides:
    • AI hype and race dynamics
    • Association with less credible narratives
    • Backlash and polarization
  • Possible upsides:
    • Political support for regulation
    • Recruitment to field
    • Cultural shift in labs

Who’s doing this:

  • Individual communicators (Miles, Yudkowsky, Christiano, etc.)
  • Organizations (CAIS, FLI)
  • Journalists covering AI
  • Academics doing public engagement

Goal: Direct resources to high-impact work and people.

Funding SourceAmount (2024)% of TotalKey Recipients
Open Philanthropy~$63.6M49%CAIS ($8.5M), Redwood ($6.2M), MIRI ($4.1M)
Individual Donors (e.g., Jaan Tallinn)~$20M15%Various orgs and researchers
Government Funding~$32.4M25%AI Safety Institutes, university research
Corporate External Investment~$8.2M6%Frontier Model Forum AI Safety Fund
Academic Endowments~$6.8M5%University centers
Total Philanthropic$110-130M100%

Source: Overview of AI Safety Funding Situation

Note: This excludes internal corporate safety research budgets, estimated at greater than $500M annually across major AI labs. Total ecosystem funding including corporate is approximately $600-650M/year.

Context: Philanthropic funding for climate risk mitigation was approximately $9-15 billion in 2023—roughly 20x philanthropic AI safety funding. With over $189 billion invested in AI projected for 2024, safety funding remains less than 2% of total AI investment.

Major Funders:

Open Philanthropy:

  • Largest AI safety funder (~$50-65M/year to technical AI safety)
  • 2025 Technical AI Safety RFP: Expected to spend ~$40M over 5 months
  • Key 2024-25 grants: MATS ($23.6M), CAIS ($8.5M), Redwood Research ($6.2M)
  • Self-assessment: “Rate of spending was too slow” in 2024; committed to expanding support
  • Supporting work on AI safety since 2015

AI Safety Fund (Frontier Model Forum):

  • $10M+ collaborative initiative established October 2023
  • Founding members: Anthropic, Google, Microsoft, OpenAI
  • Philanthropic partners: Patrick J. McGovern Foundation, Packard Foundation, Schmidt Sciences, Jaan Tallinn

Survival and Flourishing Fund (SFF):

  • ~$30-50M/year
  • Broad AI safety focus
  • Supports unconventional projects
  • Smaller grants, more experimental

Effective Altruism Funds (Long-Term Future Fund):

  • ~$10-20M/year to AI safety
  • Small to medium grants
  • Individual researchers and projects
  • Lower bar for experimental work

Grantmaking Strategies:

Hits-based giving:

  • Accept high failure rate for potential breakthroughs
  • Fund unconventional approaches
  • Support early-stage ideas

Ecosystem development:

  • Fund infrastructure (ARENA, MATS, etc.)
  • Support conferences and gatherings
  • Build community spaces

Diversification:

  • Support multiple approaches
  • Don’t cluster too heavily
  • Hedge uncertainty

Theory of change: Capital → enables people and orgs to work on AI safety → research and policy progress

Bottlenecks:

Who should consider this:

  • Program officers at foundations
  • Individual donors with wealth
  • Fund managers
  • Requires: wealth or institutional position + good judgment + network

Goal: Create infrastructure that supports AI safety work.

Activities:

Gatherings and Conferences:

  • EA Global (AI safety track)
  • AI Safety conferences
  • Workshops and retreats
  • Local meetups
  • Online forums (Alignment Forum, LessWrong, Discord servers)

Career Support:

  • 80,000 Hours career advising
  • Mentorship programs
  • Job boards and hiring pipelines
  • Introductions and networking

Research Infrastructure:

  • Alignment Forum (discussion platform)
  • ArXiv overlays and aggregation
  • Compute access programs
  • Shared datasets and benchmarks

Emotional and Social Support:

  • Community spaces
  • Mental health resources
  • Peer support for difficult work
  • Social events

Theory of change: Supportive community → people stay in field longer → more cumulative impact + better mental health

Challenges:

  • Insularity: Echo chambers and groupthink
  • Barrier to entry: Can feel cliquish to newcomers
  • Time investment: Social events vs. object-level work
  • Ideological narrowness: Lack of diversity in perspectives

Who’s doing this:

  • CEA (Centre for Effective Altruism)
  • Local EA groups
  • Lightcone Infrastructure (LessWrong, Alignment Forum)
  • Individual organizers

Goal: Establish AI safety as legitimate academic field.

University Centers and Programs:

InstitutionCenter/ProgramFocusStatus
UC BerkeleyCHAI (Center for Human-Compatible AI)Foundational alignment researchActive
OxfordFuture of Humanity InstituteExistential risk researchClosed 2024
MITAI Safety InitiativeTechnical safety, governanceGrowing
StanfordHAI (Human-Centered AI)Broad AI policy, some safetyActive
Carnegie MellonAI Safety ResearchTechnical safetyActive
CambridgeLCFI, CSERExistential risk, policyActive

Key Developments (2024-2025):

Academic Incentives:

  • Tenure-track positions in AI safety emerging
  • PhD programs with safety focus
  • Grants for safety research (NSF, etc.)
  • Prestigious publication venues (NeurIPS safety track, ICLR)
  • Academic conferences (AI Safety research conferences)

Curriculum Development:

Challenges:

  • Slow timelines: Academic careers are 5-10 year investments
  • Misaligned incentives: Publish or perish vs. impact
  • Capabilities research: Universities also advance capabilities
  • Brain drain: Best people leave for industry/nonprofits (frontier labs pay 2-5x academic salaries)

Benefits:

  • Legitimacy: Academic credibility helps policy
  • Training: PhD pipeline
  • Long-term research: Can work on harder problems
  • Geographic distribution: Not just SF/Bay Area

Theory of change: Academic legitimacy → more talent + more funding + political influence → field growth


The AI safety field has grown substantially since 2020, with acceleration around 2023 coinciding with increased public attention following ChatGPT’s release.

YearTechnical AI Safety FTEsNon-Technical AI Safety FTEsTotal FTEsOrganizations
2015~50~20~70~15
2020~150~50~200~30
2022~300~100~400~50
2024~500~400~900~65
2025~600-645~500~1,100~70

Source: AI Safety Field Growth Analysis 2025

Growth rates:

  • Technical AI safety organizations: 24% annual growth
  • Technical AI safety FTEs: 21% annual growth
  • Non-technical AI safety: approximately 30% annual growth (accelerating since 2023)

Top research areas by FTEs:

  1. Miscellaneous technical safety (scalable oversight, adversarial robustness, jailbreaks)
  2. LLM safety
  3. Interpretability

Methodology note: These estimates may undercount people working on AI safety since many work at organizations that don’t explicitly brand themselves as AI safety organizations, particularly in technical safety in academia.


For field-building to be high impact:

  1. Talent is bottleneck: More people actually means more progress (vs. “too many cooks”)
  2. Sufficient time: Field-building is multi-year investment; need time before critical period
  3. Quality maintained: Growth doesn’t dilute quality or focus
  4. Absorptive capacity: Ecosystem can integrate new people
  5. Right people: Recruiting those with high potential for contribution
  6. Complementarity: New people enable work that wouldn’t happen otherwise

The AI safety field faces several structural challenges that limit the effectiveness of field-building efforts:

Pipeline Over-Optimization for Researchers

Section titled “Pipeline Over-Optimization for Researchers”

According to analysis on the EA Forum, the AI safety talent pipeline is over-optimized for researchers:

  • The majority of AI safety talent pipelines are optimized for selecting and producing researchers
  • Research is not the most neglected talent type in AI safety
  • This leads to research-specific talent being over-represented in the community
  • Supporting programs strongly select for research skills, missing other crucial roles

Neglected roles: Operations, program management, communications, policy implementation, organizational leadership.

There’s a massive gap between awareness-level training and the expertise required for selective research fellowships:

  • BlueDot plans to train 100,000 people in AI safety fundamentals over 4.5 years
  • But few programs bridge from introductory courses to elite research fellowships
  • Need scalable programs for the “missing middle”
  • Not enough talented founders are building AI safety organizations
  • Catalyze’s pilot program incubated 11 organizations, with participants reporting the program accelerated progress by an average of 11 months
  • Open positions often don’t exist because organizations haven’t been founded

AI Safety Institutes and external research struggle to compete with frontier AI companies:

  • Frontier companies offer substantially higher compensation packages
  • AISIs must appeal to researchers’ desire for public service and impact
  • Some approaches: joint university appointments, research sabbaticals, rotating fellowships

  • Too many people with insufficient expertise
  • “Alignment washing” - superficial engagement
  • Noise drowns out signal

Mitigation: Selective programs, emphasis on quality, mentorship

  • Publicly discussing AI capabilities could accelerate them
  • Spreading awareness of potential attacks
  • Attracting bad actors

Mitigation: Careful communication, expert judgment on what to share

  • Public attention accelerates AI development
  • Creates FOMO (fear of missing out)
  • Geopolitical competition

Mitigation: Frame carefully, emphasize cooperation, private engagement

  • Groupthink and echo chambers
  • Lack of ideological diversity
  • Social dynamics override epistemic rigor
  • Cult-like dynamics

Mitigation: Encourage disagreement, diverse perspectives, epistemic humility

Impact: Very High

  • Time for field-building to compound
  • Training pays off over decades
  • Can build robust institutions
  • Best time to invest in human capital

Impact: Low-Medium

  • Insufficient time for new people to become experts
  • Better to leverage existing talent
  • Exception: rapid deployment of already-skilled people

Impact: High

  • Every good researcher counts
  • Ecosystem effects are strong
  • More perspectives improve solutions

Impact: Low

  • Talent bottleneck is overstated
  • Coordination costs dominate
  • Focus on existing excellent people

Strong fit if you:

  • Enjoy teaching, mentoring, organizing
  • Good at operations and logistics
  • Strong communication skills
  • Can evaluate talent and potential
  • Patient with long timelines
  • Value community and culture

Specific roles:

  • Program manager: Run training programs (ARENA, MATS, etc.)
  • Grantmaker: Evaluate and fund projects
  • Educator: Teach courses, create content
  • Community organizer: Events, spaces, support
  • Communicator: Explain AI safety to various audiences

Backgrounds:

  • Education / pedagogy
  • Program management
  • Operations
  • Communications
  • Community organizing
  • Content creation

Entry paths:

  • Staff role at training program
  • Local group organizer → full-time
  • Teaching assistant → program lead
  • Communications role
  • Grantmaking entry programs

Less good fit if:

  • Prefer direct object-level work
  • Impatient with meta-level interventions
  • Don’t enjoy working with people
  • Want immediate measurable impact
  • ARENA (Redwood / independent)
  • MATS (independent)
  • BlueDot Impact (running AGI Safety Fundamentals)
  • AI Safety Camp
  • Centre for Effective Altruism (CEA)
    • EAG conferences
    • University group support
    • Community health
  • Lightcone Infrastructure
    • LessWrong, Alignment Forum
    • Conferences and events
    • Office spaces
  • Open Philanthropy (largest funder)
  • Survival and Flourishing Fund
  • EA Funds - Long-Term Future Fund
  • Founders Pledge
  • CHAI (UC Berkeley)
  • Various university groups
  • Individual content creators
  • Center for AI Safety (CAIS) (public advocacy)
  • Journalists and media
  • Leveraged impact: Enable many others
  • People-focused: Work with smart, motivated people
  • Varied work: Teaching, organizing, strategy
  • Lower barrier: Don’t need research-level technical skills
  • Rewarding: See people grow and succeed
  • Hard to measure: Impact is indirect and delayed
  • Meta-level: One step removed from object-level problem
  • Uncertain: May not produce expected talent
  • Community dependent: Success depends on others
  • Burnout risk: Emotionally demanding
  • Program staff: $10-100K
  • Directors: $100-150K
  • Grantmakers: $80-150K
  • Community organizers: $40-80K (often part-time)

Note: Field-building often pays less than technical research but more than pure volunteering

  • Program management
  • Teaching and mentoring
  • Evaluation and judgment
  • Operations
  • Communication

Field-building enables and amplifies:

  • Technical research: Creates researcher pipeline
  • Governance: Trains policy experts
  • Corporate influence: Provides talent to labs
  • All interventions: Increases capacity across the board

Key Questions

Is AI safety talent-constrained or idea-constrained?
Should we prioritize growth or quality in field-building?

If you want to contribute to field-building:

  1. Understand the field first:

    • Learn AI safety yourself
    • Engage with community
    • Understand current state
  2. Identify your niche:

    • Teaching? → Develop curriculum, TA for programs
    • Organizing? → Start local group, help with events
    • Funding? → Learn grantmaking, advise donors
    • Communication? → Write, make videos, explain concepts
  3. Start small:

    • Volunteer for existing programs
    • Organize local reading group
    • Create content
    • Help with events
  4. Build track record:

    • Demonstrate impact
    • Get feedback
    • Iterate and improve
  5. Scale up:

    • Apply for staff roles
    • Launch new programs
    • Seek funding for initiatives

Resources:

  • CEA community-building resources
  • 80,000 Hours on field-building
  • Alignment Forum posts on field growth
  • MATS/ARENA/BlueDot as examples


Field building improves the Ai Transition Model through multiple factors:

FactorParameterImpact
Misalignment PotentialSafety-Capability GapGrew field from 400 to 1,100 FTEs (2022-2025) at 21-30% annually
Misalignment PotentialAlignment RobustnessTraining programs achieve 37% career conversion at $5K-40K per career change
Civilizational CompetenceInstitutional QualityBuilds capacity across labs, government, and advocacy organizations

Key bottleneck is talent pipeline over-optimization for researchers; the field needs more governance, policy, and operations professionals.