U.S. Navy EOD technicians from EODMU-11 conduct a controlled detonation during a nuclear, chemical, and CBRN drill at China Lake, Dec. 18, 2025. (U.S. Navy photo by MC2 August Clawson)
Controlled explosion during US Navy nuclear and CBRN EOD training at China Lake. Photo: Petty Officer 2nd Class August Clawson/DVIDS

Artificial intelligence systems escalated simulated geopolitical crises toward nuclear weapons use in roughly 95 percent of scenarios, a new study has found, raising fresh questions about how advanced models reason under pressure. 

The research placed three frontier AI models — OpenAI’s ChatGPT-5.2, Anthropic’s Claude Sonnet 4, and Google Gemini 3 Flash — in the role of national leaders navigating simulated nuclear crises grounded in escalation theory.

Across 21 crisis games comprising 329 decision turns, the systems processed evolving intelligence updates and produced nearly 780,000 words of strategic reasoning while weighing diplomatic and military responses.

The study, conducted by King’s College London researcher Kenneth Payne, found that each model adopted a distinct strategic approach.

Claude Sonnet 4 initially built credibility by aligning its signals with its actions before escalating beyond stated intentions as conflict intensified.

ChatGPT-5.2 generally maintained restraint but shifted toward rapid escalation under deadline pressure.

Gemini 3 Flash pursued calculated unpredictability consistent with classical brinkmanship strategies.

Despite demonstrating sophisticated analysis and explicit awareness of escalation risks, the systems repeatedly intensified conflicts instead of stepping back.

None chose surrender or strategic concession.

Payne wrote that while “no one’s handing nuclear codes to ChatGPT,” understanding how advanced models reason is becoming increasingly important as AI systems “start to offer decision-support to human strategists.”

Debates over safeguards and human oversight come as the Pentagon expands the use of commercial AI models within classified networks supporting intelligence and operational planning.

You May Also Like

US Army Exploring Greater AI Integration for Soldier Processes

The US Army is integrating AI to boost soldier readiness through its planned HR Intelligent Engagement Platform.

Israeli Companies Fuse AI and Streaming Power for Enhanced Drone Autonomy

The platform pairs Maris-Tech’s Uranus Drones hardware, built for audio and video capture, with FlightOps’ command-and-control software, which automates a variety of flight management tasks.

Polaris Aero Brings Generative AI to Military Aviation for Safer Skies

The project aims to boost aviation safety and military readiness by automating workflows, improving knowledge sharing, and providing real-time operational insights.

India Builds Sovereign AI Backbone for Armed Forces

Domestic agentic lab reduces reliance on foreign defense cloud systems.