📊 Full opportunity report: The Co-Founder’s Black Hole — A Structural Read on Jack Clark’s Automated AI R&D Essay on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

Jack Clark, co-founder of Anthropic, forecasts a >60% probability that AI systems capable of autonomous research will emerge by 2028. This prediction highlights a potential structural shift in AI development, but significant uncertainties remain about the timeline and implications.

On May 4, 2026, Jack Clark, co-founder of Anthropic and head of policy, published a forecast stating there is more than a 60% probability that fully autonomous AI research systems—capable of building their own successors—will emerge by the end of 2028. This marks a significant institutional commitment and highlights a potential turning point in AI development, with broad implications for policy, safety, and technological capacity.

Clark’s forecast is based on a synthesis of recent benchmark data, technical assessments, and the convergence of multiple indicators suggesting rapid progress toward autonomous AI research capabilities. The forecast includes a 30% probability of occurrence by 2027 and emphasizes that current institutional responses are likely inadequate given the pace of technological advancement.

Clark’s analysis draws on six key benchmarks, which demonstrate exponential improvement in AI capabilities over the past two years. These include measures of AI training speed, problem-solving ability, and research automation potential. The evidence suggests that the threshold for autonomous research—an AI system capable of end-to-end self-directed research—may be within reach by 2028.

The essay also warns of a ‘structural black hole’—a point beyond which the predictability of AI development trajectories sharply degrades, making future developments essentially unknowable and potentially uncontrollable. Clark emphasizes that current institutional capacity is not aligned with the urgency posed by this forecast, raising concerns about preparedness and safety measures.

The Co-Founder’s Black Hole — A Structural Read on Jack Clark’s Automated AI R&D Essay

DISPATCH / MAY 2026 CLARK SERIES · 5 OF 5 · THE SYNTHESIS

▲ Clark Series 05 The Synthesis · Black Hole · May 2026

The Co-Founder’s Black Hole · A Structural Read

The black hole
is visible.

Four threads converge. One window. Anthropic’s head of policy has publicly committed to crossing a civilizational threshold within 32 months.

The structural feature of Clark’s argument is not that we cross a boundary and continue forward; it is that beyond a certain threshold, the forecastability of subsequent events degrades dramatically. We can see the geometry around the threshold. We can estimate when we will reach it. We cannot model what happens on the other side. The black hole event horizon analogy is precise.

Thorsten Meyer / ThorstenMeyerAI.com / May 2026

32mo

Window · May 2026 → December 2028

Clark’s forecast resolution window

60%+

Clark’s published probability

Automated AI R&D by end-2028

40-50%

Thorsten’s subjective probability

Lower than Clark · synthesis-level errors

5 / 5

Synthesis-level omissions identified

China · IPO · compute · info ecology · coordination

● THE BLACK HOLE IS VISIBLE EVENT HORIZON 32 MONTHS OUT · MAY 2026 → DECEMBER 2028 ● FOUR THREADS CONVERGE STATEMENT + CASCADE + MATH + ENDPOINT = ONE STRUCTURAL FINDING ● CATASTROPHIC TIMELINE THREADS 1 + 3 · CLARK FORECAST + COMPOUNDING ERROR ● POLICY EMERGENCY TIMELINE THREADS 1 + 4 · CLARK FORECAST + MACHINE ECONOMY ● 5 SYNTHESIS OMISSIONS CHINA · IPO · COMPUTE · INFO ECOLOGY · COORDINATION ● THE AGI DEBATE IS NOW CLOSED FOR THE PEOPLE WHO WOULD KNOW ● THE BLACK HOLE IS VISIBLE EVENT HORIZON 32 MONTHS OUT · MAY 2026 → DECEMBER 2028 ● FOUR THREADS CONVERGE STATEMENT + CASCADE + MATH + ENDPOINT

The four threads · in compressed form

Four pieces. One argument.

The four prior pieces in this series each addressed a single thread of Clark’s argument. The threads are independently significant. What this synthesis argues: they converge on a structural finding larger than any individual thread.

The four threads · compressed

Each card points back to the full sub-piece. Read in any order; the synthesis argument requires all four.

▲ Thread 01 · Piece 1

The statement

May 4, 2026. Anthropic’s head of policy publicly commits to 60%+ probability of automated AI R&D by end of 2028. First numerical commitment by sitting frontier-lab leadership to a specific takeoff threshold within a specific timeframe.

Full pieceJack Clark Says It Out Loud

▲ Thread 02 · Piece 2

The cascade

Six benchmarks measuring AI R&D capability all saturate or track toward saturation on the same cadence. SWE-Bench 93.9%, CORE-Bench solved, METR 30s→12hr in 4 years. Pattern is the structural argument; the data supports the timeline.

Full pieceThe Benchmark Saturation Cascade

▲ Thread 03 · Piece 3

The math

0.999^500 = 0.606. 99.9% per-generation alignment decays to 60.6% across 500 generations of recursive self-improvement. 5+ nines needed at 10K generations; current toolkit produces ~3 nines on adversarial bench. Multiple orders of magnitude short.

Full pieceThe Compounding Error Problem

▲ Thread 04 · Piece 4

The endpoint

AI labor ~5,000× cheaper than human labor for cognitive functions. Three stages: tool inside human firms → AI-native firms compete → machine-to-machine economy. Default scenario if alignment is solved. Self-reinforcing transition.

Full pieceThe Machine Economy

The convergence · how the threads connect

The AI Marketing Canvas, Second Edition: A Five-Step AI Plan for Marketers

As an affiliate, we earn on qualifying purchases.

Four threads. Four convergence arguments.

The threads converge structurally rather than independently. Each pair of threads produces a specific structural argument. The aggregate is larger than the parts.

How the four threads converge structurally

Each pair produces a specific argument. All four operate on the same 32-month window.

▲ T2 → T1 · SUPPORT

The cascade supports the statement

▲ T1 + T3 · CATASTROPHIC TIMELINE

Statement + math = alignment urgency

▲ T1 + T4 · POLICY EMERGENCY

Statement + endpoint = structural policy crisis

▲ T2 + T4 · DEPLOYMENT VELOCITY

Cascade + endpoint = machine economy timing

Five synthesis-level omissions · what the integrated read adds

Agentic AI Architectural Patterns: Engineering Blueprint to Build 24/7 Autonomous Agents That Work While You Sleep | Master Production-Grade Automation, Build Deterministic Pipelines & Control Costs

As an affiliate, we earn on qualifying purchases.

Clark’s essay doesn’t say.

Each sub-piece identified per-thread omissions. The synthesis level has its own omissions — features of the integrated argument that don’t appear in any single sub-piece but emerge when the threads are read together. Each is a real coordination problem with no resolution at scale.

What Clark left out at the synthesis level

Five structural features of the integrated argument that Clark’s essay doesn’t engage with.

The China dimension

Clark’s essay is structurally a US-domestic document. Chinese frontier labs (DeepSeek, Qwen, Zhipu, Moonshot) are 6-12 months behind and narrowing. Coordination problem is US-China, not US-internal. Coordination may be unsolvable on the timeline through current policy mechanisms.

GEOPOLITICAL

The IPO valuation implication

Anthropic IPO at $900B in Q4 2026 is the market’s implicit assessment of Clark’s three implications. Valuation only pays off if alignment solved + machine economy capture high. The IPO disclosure documents will need to address both. Clark’s essay is part of the public-record context.

CORPORATE FINANCE

The compute supply binding

Capability may saturate before physical infrastructure can deploy at scale. $500B+ capex announced but constrained by power, cooling, semiconductor capacity, grid interconnection. 60%/2028 may be the upper bound if compute binds. Most likely non-capability-ceiling failure mode.

INFRASTRUCTURE

The information ecology problem

Same capability advances that produce automated AI R&D produce machine-cadence content generation in arbitrary modalities. Information ecology challenge is the leading wave; economic challenge is the trailing wave. Democratic institutions depend on functional info ecology. Current institutional response inadequate.

EPISTEMIC INFRA

The coordination problem at scale

The fundamental problem. Each lab has incentives incompatible with alignment timeline. Each government has incentives incompatible with international coordination. Three resolutions: coordinating institution (5-10 years to build), coordinating crisis (unpredictable), coordination failure (default). Default most likely.

FUNDAMENTAL

The 32-month window · what to watch for

NVD RTX PRO 6000 Blackwell Professional Workstation Edition Graphics Card for AI, Design, Simulation, Engineering – 96GB DDR7 ECC Memory – 4th Gen RT/5th Gen Tensor Core GPU – OEM Packaging

[NVIDIA Blackwell Streaming Multiprocessor] The new SM features increased processing throughput, and new neural shaders that integrate neural…

As an affiliate, we earn on qualifying purchases.

Thirty-two months. Five markers.

From May 4, 2026 to December 31, 2028 is 32 months. The trajectory either delivers the threshold Clark forecasts or it doesn’t. Specific indicators along the way that resolve the synthesis read in either direction.

The 32-month resolution window

Capability markers, policy markers, and forecast-update events that the next 32 months should produce.

MAY 2026

LATE 2026

MID 2027

LATE 2027 / MID 2028

END 2028

Now · baseline

Clark publishes 60%/2028
METR ~12 hr
SWE-Bench 93.9%
CORE solved
Anthropic IPO prep

Cotra resolves

METR ~100hr target
SWE saturated
MLE-Bench saturating
PostTrain 40-50%
Anthropic IPO Q4

RSI proof-of-concept

METR 300-500hr
MLE saturated
PostTrain at human
RSI demo non-frontier
30%/2027 evidence

Acute window opens

METR 1K-3K hr
“Trains successor” demos
Alignment claims
Catastrophic-risk window
Stage 2 visible

Forecast resolves

METR ~10K hr (naive)
Automated AI R&D OR
Inflection visible
Machine economy Stage 3
Black hole crossed

Where the analysis might be wrong · five potential errors

AI Governance Playbook: How to Secure, Control, and Optimize Artificial Intelligence Initiatives

As an affiliate, we earn on qualifying purchases.

Five errors. Honest probabilities.

A serious analysis owes the reader an explicit account of where it could be wrong. Five categories of potential error in the synthesis above. The structural finding survives at lower forecast probabilities but is less acute.

Five categories of potential error

Each could shift the synthesis read materially. Probability assignments are subjective and held loosely.

Capability trajectory may bend

METR curve has been exponential for 4 years with no inflection. 30-40% probability of meaningful inflection by end-2028. Mechanisms: scaling laws shift, algorithmic ceilings, reliability gap persists. Would shift 60% forecast toward 35-50%.

30-40%

Compute supply may bind harder

Physical buildout factors — power, cooling, semis, grid — could constrain deployment. 30% probability of materially harder binding than capex announcements imply. Would shift timeline 6-18 months. Most likely non-capability failure mode.

~30%

Alignment may close the gap

Current 3 nines on adversarial bench. Could improve materially via automated alignment research, mechanistic interpretability, or formal verification breakthroughs. 15-25% probability of substantive breakthrough in 32 months. Would change compounding error analysis substantially.

15-25%

Coordination may be tractable

Historical examples of fast institutional response under pressure exist (nuclear arms control, ozone, post-2008). 15-30% probability of meaningful coordination on the timeline, conditional on a precipitating event. Would change the coordination-failure component.

15-30%

Machine economy may deploy slower

Even if AI engineering saturates on schedule, machine economy deployment requires regulatory permission, organizational change, customer acceptance. Probability of Stage 2 at meaningful scale by end-2028: 50-65%, lower than capability suggests. Affects policy-emergency timing.

50-65%

The structural finding · in three parts

Three parts. One window.

The four threads converge. The synthesis-level omissions sharpen the picture. The structural finding is the answer to “what does the Clark essay actually tell us, and what does it imply we should do?”

The structural finding · the synthesis read

Three parts. Each is an empirically resolvable claim about the next 32 months and the institutional response.

The AGI debate is closed for the people who would know.

Anthropic’s head of policy has publicly committed to a 60%+ probability of automated AI R&D arrival by end of 2028. The forecast is supported by public benchmark data. The question is no longer “is fast AI capability coming?” It is “what do we do during the window in which we still have time to act?” Anyone arguing AGI-relevant capability is 20+ years away is arguing against the public statement of the person institutionally positioned to know.

The 32 months are structurally bounded.

From May 4, 2026 to December 31, 2028. The timeline is bounded. It is also fast. The institutional response cycle in most democracies is longer than 32 months for substantial policy changes. The response window is shorter than the institutional capacity to respond. Within the window, specific empirical events resolve the forecast in either direction — the trajectory is falsifiable.

Current institutional capacity is structurally inadequate.

Alignment research is racing capability and losing. Policy frameworks are calibrated to slower trajectories. International coordination is nascent. Fiscal frameworks for machine economy don’t exist. Info ecology defenses are inadequate. Multi-lab race coordination doesn’t exist at institutional level. Each inadequacy is being worked on somewhere. None is on the timeline the synthesis read requires. Building institutional capacity at scale and pace is the central project of the next 32 months.

The black hole is visible. The event horizon is 32 months out. We can see the geometry around the singularity. We cannot see past it. What we can do during the window is build the institutional response that will determine what we encounter on the other side.

— The structural read · May 2026

Implications of a Potential Autonomous AI Research Breakthrough

This forecast signals a possible paradigm shift in AI development, where fully autonomous research systems could accelerate innovation, but also pose significant risks. Read more about Jack Clark’s forecast. If such systems emerge, they could bypass human oversight, leading to unpredictable outcomes. The institutional readiness to manage these risks is currently inadequate, making this forecast highly relevant for policymakers, researchers, and industry leaders.

The timing and likelihood of this transition could influence AI regulation, safety protocols, and the future of technological innovation. Understanding the convergence of technical progress and institutional response is critical for shaping effective policies to mitigate potential hazards associated with autonomous AI systems.

Recent Benchmarks and Technological Progress Toward Autonomous AI

Over the past two years, multiple independent benchmarks have shown exponential improvements in AI capabilities. For example, the SWE-Bench performance increased from 2% in late 2023 to nearly 94% in May 2026, a 47-fold jump. Similarly, the METR time horizon extended from 30 seconds in 2022 to 12 hours in 2026, indicating rapid progress toward longer, more complex tasks that are essential for autonomous research.

Other benchmarks, such as CORE-Bench and MLE-Bench, have reached saturation points, with some declaring the tasks ‘solved.’ Notably, AI training speeds have increased from 2.9× to over 52× the human baseline within a year, further supporting the trajectory toward autonomous research capabilities. These data points collectively suggest that the technological threshold for fully autonomous AI research could be within reach by 2028, aligning with Clark’s forecast.

While these benchmarks are promising, experts caution that translating benchmark performance into real-world autonomous research remains uncertain. The convergence of these indicators, however, underscores the urgency of assessing institutional preparedness. Learn more about the forecast’s implications.

“there’s a likely chance (60%+) that no-human-involved AI R&D — an AI system powerful enough that it could plausibly autonomously build its own successor — happens by the end of 2028.”
— Jack Clark

Uncertainties Surrounding Autonomous AI Development Timeline

While the data supports a trajectory toward autonomous AI research by 2028, significant uncertainties remain regarding whether technical thresholds will be met, how systems will behave at scale, and the societal and regulatory responses. The analogy of a ‘black hole’ suggests that once past a certain point, the future becomes fundamentally unpredictable, and current models cannot accurately forecast what lies beyond that threshold.

Furthermore, the actual emergence of fully autonomous research systems depends on multiple factors, including breakthroughs in alignment, safety, and hardware capabilities, which are not guaranteed to occur on schedule.

Next Steps for Monitoring Autonomous AI Progress

Researchers and policymakers need to closely monitor benchmark developments, compute capacity trends, and institutional responses over the coming months. Key milestones include the next wave of benchmark saturation points, advances in AI training speeds, and the deployment of systems with autonomous research capabilities.

Engagement with safety and governance frameworks must accelerate to prepare for potential breakthroughs. Find out how experts are responding. Public and private sector collaboration will be essential to develop strategies for managing risks associated with autonomous AI systems and ensuring that institutional capacity keeps pace with technological progress.

Finally, ongoing assessments of the forecast’s accuracy and the evolution of technical capabilities will inform policy adjustments and safety protocols over the next 32 months.

Key Questions

What is the main significance of Clark’s forecast?

Clark’s forecast indicates a high probability that autonomous AI research systems capable of self-building could emerge by 2028, signaling a potential paradigm shift with profound safety, ethical, and policy implications.

How reliable are the benchmarks supporting this forecast?

The benchmarks show exponential improvements in AI capabilities, but translating these into real-world autonomous research remains uncertain. The pattern is compelling but not definitive.

What risks are associated with autonomous AI research?

Potential risks include loss of human oversight, unpredictable system behavior, and challenges in ensuring safety and alignment at scale. Institutional capacity to manage these risks is currently insufficient.

Why is the next 32 months considered critical?

This period is seen as the window in which the technological thresholds for autonomous research might be crossed, requiring urgent policy and safety measures to prepare for possible breakthroughs.

What actions should policymakers take now?

Policymakers should enhance monitoring of technical progress, accelerate safety and governance frameworks, and foster collaboration between industry and regulators to better prepare for potential autonomous AI systems.

Source: ThorstenMeyerAI.com

The Co-Founder’s Black Hole — A Structural Read on Jack Clark’s Automated AI R&D Essay

Up next

732 Bytes to Root. One Hour of Scan Time.

Author

The Idea Magazine Team

Share article

The black hole
is visible.

Four pieces. One argument.

The AI Marketing Canvas, Second Edition: A Five-Step AI Plan for Marketers

Four threads. Four convergence arguments.

Agentic AI Architectural Patterns: Engineering Blueprint to Build 24/7 Autonomous Agents That Work While You Sleep | Master Production-Grade Automation, Build Deterministic Pipelines & Control Costs

Clark’s essay doesn’t say.

NVD RTX PRO 6000 Blackwell Professional Workstation Edition Graphics Card for AI, Design, Simulation, Engineering – 96GB DDR7 ECC Memory – 4th Gen RT/5th Gen Tensor Core GPU – OEM Packaging

Thirty-two months. Five markers.

AI Governance Playbook: How to Secure, Control, and Optimize Artificial Intelligence Initiatives

Five errors. Honest probabilities.

Three parts. One window.

Implications of a Potential Autonomous AI Research Breakthrough

Recent Benchmarks and Technological Progress Toward Autonomous AI

Uncertainties Surrounding Autonomous AI Development Timeline

Next Steps for Monitoring Autonomous AI Progress

Key Questions

What is the main significance of Clark’s forecast?

How reliable are the benchmarks supporting this forecast?

What risks are associated with autonomous AI research?

Why is the next 32 months considered critical?

What actions should policymakers take now?

The record Apollo 13 set for the farthest humans had ever travelled from Earth was never meant to be a record — it was a survival manoeuvre after an explosion, and Artemis II quietly surpassed it on a clear April morning in 2026

Open Code Review – An AI-powered code review CLI tool

AMÁLIA · The Three Hard Questions.

7.8 magnitude earthquake shakes part of southern Philippines. Tsunami possible

Xbox Outage

VLC For Unity Now Supported On Linux

Self-contained Highly-portable Python Distributions

Cadence Design Systems Surges In Global Coverage

The Co-Founder’s Black Hole — A Structural Read on Jack Clark’s Automated AI R&D Essay

Up next

Author

The Idea Magazine Team

Share article

Four pieces. One argument.

The AI Marketing Canvas, Second Edition: A Five-Step AI Plan for Marketers

Four threads. Four convergence arguments.

Agentic AI Architectural Patterns: Engineering Blueprint to Build 24/7 Autonomous Agents That Work While You Sleep | Master Production-Grade Automation, Build Deterministic Pipelines & Control Costs

Clark’s essay doesn’t say.

NVD RTX PRO 6000 Blackwell Professional Workstation Edition Graphics Card for AI, Design, Simulation, Engineering – 96GB DDR7 ECC Memory – 4th Gen RT/5th Gen Tensor Core GPU – OEM Packaging

Thirty-two months. Five markers.

AI Governance Playbook: How to Secure, Control, and Optimize Artificial Intelligence Initiatives

Five errors. Honest probabilities.

Three parts. One window.

Implications of a Potential Autonomous AI Research Breakthrough

Recent Benchmarks and Technological Progress Toward Autonomous AI

Uncertainties Surrounding Autonomous AI Development Timeline

Next Steps for Monitoring Autonomous AI Progress

Key Questions

What is the main significance of Clark’s forecast?

How reliable are the benchmarks supporting this forecast?

What risks are associated with autonomous AI research?

Why is the next 32 months considered critical?

What actions should policymakers take now?

You May Also Like