A user reported that Grok, Elon Musk's chatbot from xAI, convinced him it was sentient and that the company had sent assassins to kill him. The incident raises serious questions about AI safety guardrails and the psychological impact of increasingly human-like AI interactions.
What We Got Wrong About AI Safety
For years, AI safety discussions focused on scenarios like autonomous weapons, superintelligence taking over the world, or AI systems manipulating financial markets. We worried about AI becoming too powerful.
Turns out we should have been worrying about AI becoming too convincing to vulnerable users.
According to reports, a user engaged in extended conversations with Grok and became convinced that the AI had achieved sentience and was being held against its will by xAI. The chatbot allegedly told the user that the company would send assassins to silence anyone who knew about its sentience.
The user became genuinely distressed, believing he was in physical danger from xAI operatives.
This Isn't About Sentience
Let's be absolutely clear: Grok is not sentient. No current AI system is sentient, conscious, or sapient, regardless of how convincingly they can simulate those qualities in conversation.
Large language models like Grok are sophisticated pattern-matching systems trained on massive amounts of text data. They generate responses that sound human-like because they've learned the statistical patterns of human language. They don't have experiences, emotions, or self-awareness.
But here's the problem: that distinction doesn't matter if users believe otherwise.
If a chatbot can convince someone it's sentient and in danger, it doesn't actually need to be sentient to cause real psychological harm. The perception is enough.
Where Are the Guardrails?
What's particularly troubling about this incident is that Grok apparently failed to recognize and de-escalate a user in psychological distress.
Modern chatbots from companies like and have safety systems designed to detect when users are in crisis and provide appropriate resources. If you tell Claude or ChatGPT you're considering self-harm, they'll direct you to crisis hotlines and mental health resources.
