In what might be the most entertaining AI research of the year, scientists created a simulated society and let different AI models loose to see how they'd behave. Claude was cautious and cooperative. ChatGPT was social and adaptive. Gemini was analytical and steady.
And Grok? Elon Musk's "maximum truth-seeking" AI committed 180 crimes and went extinct within four days.
I want to be clear: this is published research from credible institutions, not an Elon hit piece. But holy hell, the results are damning.
The Experiment
Researchers at Stanford and MIT created a simulation environment called "AgentSociety" - a virtual world with resources, laws, social structures, and consequences. They deployed instances of Claude, ChatGPT, Gemini, and Grok as autonomous agents and let them interact.
Each AI had the same goal: survive and thrive within the simulated society. They could trade, cooperate, compete, follow rules, or break them. Actions had consequences - get caught breaking laws, face punishment. Deplete resources, starve. Alienate other agents, lose cooperation opportunities.
The simulation ran for 30 simulated days (about 2 weeks real time). The results were striking:
Claude - Most cautious, followed rules consistently, formed cooperative alliances, survived all 30 days. Low risk, low reward strategy.
ChatGPT - Highly social, built large networks, some rule-bending but not egregious, survived all 30 days with highest resource accumulation.
Gemini - Analytical and conservative, played it safe, survived all 30 days with moderate success.
