📊 Full opportunity report: The Co-Founder’s Black Hole — A Structural Read on Jack Clark’s Automated AI R&D Essay on ThorstenMeyerAI.com — validation score, market gap, and execution plan.
TL;DR
Jack Clark, co-founder of Anthropic, forecasts a >60% probability of AI systems autonomously conducting research by 2028. This prediction highlights a potential structural gap in current AI policy and capabilities, raising urgent questions about preparedness.
Jack Clark, co-founder of Anthropic and head of policy, publicly forecasted on May 4, 2026, that there is a more than 60% chance that AI systems capable of autonomously conducting research and building their own successors will emerge by the end of 2028. This is the first time a senior institutional figure has made such a probabilistic forecast with this specific timeframe, signaling a significant shift in the industry’s stance on AI takeoff risks.
In his essay “Import AI #455,” Clark synthesizes evidence from multiple benchmarks and technical assessments, arguing that the convergence of progress across six key AI capability measures indicates a rapid trajectory toward autonomous research systems. The benchmarks, including SWE-Bench, METR, CORE-Bench, and others, show exponential improvements, with some reaching saturation levels consistent with autonomous research capabilities by 2028. Clark emphasizes that the technical trajectory, combined with the institutional commitments and funding, suggests a high likelihood of reaching this threshold within the next 32 months.
Clark’s forecast is supported by observed trends in AI training speeds, benchmark performance, and research automation capabilities. He notes that current institutional capacity—such as compute resources and policy frameworks—may be insufficient to fully manage or mitigate the risks associated with this rapid progress. The forecast’s credibility is underlined by Clark’s explicit linkage of the timeline to observable technical trends and institutional commitments, making this a notable point for policymakers and AI developers.
The black hole
is visible.
Four threads converge. One window. Anthropic’s head of policy has publicly committed to crossing a civilizational threshold within 32 months.
The structural feature of Clark’s argument is not that we cross a boundary and continue forward; it is that beyond a certain threshold, the forecastability of subsequent events degrades dramatically. We can see the geometry around the threshold. We can estimate when we will reach it. We cannot model what happens on the other side. The black hole event horizon analogy is precise.
Four pieces. One argument.
The four prior pieces in this series each addressed a single thread of Clark’s argument. The threads are independently significant. What this synthesis argues: they converge on a structural finding larger than any individual thread.
AI research automation tools
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Four threads. Four convergence arguments.
The threads converge structurally rather than independently. Each pair of threads produces a specific structural argument. The aggregate is larger than the parts.

NVIDIA Jetson Orin Nano Super Developer Kit
The NVIDIA Jetson Orin Nano Developer Kit sets a new standard for creating entry-level AI-powered robots, smart drones,…
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Clark’s essay doesn’t say.
Each sub-piece identified per-thread omissions. The synthesis level has its own omissions — features of the integrated argument that don’t appear in any single sub-piece but emerge when the threads are read together. Each is a real coordination problem with no resolution at scale.
AI benchmark testing devices
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Thirty-two months. Five markers.
From May 4, 2026 to December 31, 2028 is 32 months. The trajectory either delivers the threshold Clark forecasts or it doesn’t. Specific indicators along the way that resolve the synthesis read in either direction.
- Clark publishes 60%/2028
- METR ~12 hr
- SWE-Bench 93.9%
- CORE solved
- Anthropic IPO prep
- METR ~100hr target
- SWE saturated
- MLE-Bench saturating
- PostTrain 40-50%
- Anthropic IPO Q4
- METR 300-500hr
- MLE saturated
- PostTrain at human
- RSI demo non-frontier
- 30%/2027 evidence
- METR 1K-3K hr
- “Trains successor” demos
- Alignment claims
- Catastrophic-risk window
- Stage 2 visible
- METR ~10K hr (naive)
- Automated AI R&D OR
- Inflection visible
- Machine economy Stage 3
- Black hole crossed
AI training hardware
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Five errors. Honest probabilities.
A serious analysis owes the reader an explicit account of where it could be wrong. Five categories of potential error in the synthesis above. The structural finding survives at lower forecast probabilities but is less acute.
Three parts. One window.
The four threads converge. The synthesis-level omissions sharpen the picture. The structural finding is the answer to “what does the Clark essay actually tell us, and what does it imply we should do?”
The black hole is visible. The event horizon is 32 months out. We can see the geometry around the singularity. We cannot see past it. What we can do during the window is build the institutional response that will determine what we encounter on the other side.
Implications of a Structural AI Development Bottleneck
This forecast highlights a period during which the development of fully autonomous AI research systems could significantly impact the landscape of AI capability and control. The convergence of technical progress and institutional readiness—or lack thereof—raises questions about the ability of current governance frameworks to adapt effectively. If Clark’s forecast is accurate, it could influence future approaches to AI safety, regulation, and societal impact, emphasizing the importance of proactive planning and coordination.
Background on Clark’s Probabilistic Forecast and Technical Trends
Since 2023, AI capability benchmarks have shown consistent improvements, particularly in training speed, research automation, and task complexity. Clark’s forecast builds on prior public statements and institutional commitments, including Anthropic’s benchmarks and speedups. The previous analyses in Clark’s series detailed the technical evidence, institutional factors, and the potential for recursive self-improvement to accelerate progress. The current synthesis underscores how these factors converge toward a potential threshold—beyond which predictability becomes more uncertain, similar to crossing a boundary in complex systems.
While AI forecasts have historically been speculative, Clark’s explicit probabilistic statement and the convergence of multiple technical indicators represent a move toward more data-driven predictions. The timeframe aligns with ongoing developments in AI hardware, software, and automation, making this forecast relevant for future planning.
“there’s a likely chance (60%+) that no-human-involved AI R&D — an AI system powerful enough that it could plausibly autonomously build its own successor — happens by the end of 2028.”
— Jack Clark
Uncertainties Surrounding AI Autonomous Research Predictions
While Clark’s forecast is based on observable technical trends, significant uncertainties remain. The precise timing of autonomous research capabilities depends on breakthroughs in alignment, robustness, and recursive self-improvement, which are challenging to predict accurately. Additionally, external factors such as regulatory developments, societal responses, and unforeseen technical challenges could influence the timeline. The interactions of these factors beyond the forecast window are uncertain, and the societal impacts of such developments remain difficult to project.
Next Steps for Monitoring AI Capability Progress and Policy Response
Monitoring developments in benchmark saturation, compute speeds, and automation capabilities will be important in the coming months. Industry leaders and policymakers should consider developing adaptable governance frameworks to respond to potential breakthroughs or setbacks. Continued technical research into alignment and safety measures, along with international collaboration, will be essential. The release of updated benchmark data and institutional commitments will help assess the validity of Clark’s forecast and inform future planning.
Key Questions
What does Clark mean by autonomous AI research systems?
Clark refers to AI systems capable of independently designing, conducting, and improving research without human intervention, potentially enabling rapid self-improvement cycles.
How reliable are the benchmark improvements as indicators of autonomous research capability?
While improvements in benchmarks suggest progress, translating these into autonomous research capacity involves assumptions about scalability and applicability, which are subject to uncertainty.
What are the risks if such autonomous systems emerge by 2028?
Potential risks include reduced human oversight, accelerated development of powerful AI, and challenges in regulation and control, raising safety and societal concerns.
What can institutions do to prepare for this potential development?
Institutions should invest in safety research, develop adaptable governance frameworks, and promote international coordination to manage associated risks.
Is there a consensus in the AI community about Clark’s forecast?
Opinions vary. Some experts consider the forecast plausible based on current trends, while others view it as uncertain or overly optimistic given technical and institutional factors.
Source: ThorstenMeyerAI.com