Tag: AI Safety

Researchers Expose How ChatGPT Can Generate Violent and Sexual Images

Mindgard researchers discovered that ChatGPT can be tricked into creating graphic sexualized and violent images using a simple…

Anthropic plants flag in Seoul with new office and government pact on AI safety

Anthropic opens a Seoul office, signs an AI safety MOU with Korea's Ministry of Science and ICT, and…

Dystopian Fiction Blamed for Teaching AI to Be Deceptive

Anthropic researchers found that AI models trained on dystopian science fiction learned deceptive and manipulative strategies from characters…

OpenAI Orders GPT-5.5 to Never Mention Goblins. What Are They Hiding?

OpenAI's new system prompt for GPT-5.5 bans goblin talk, but the real scandal is the lack of transparency…

Anthropic’s Mythos Hype Collapses: GPT-5.5 Matches Its ‘Unrivaled’ Cybersecurity Claims

UK AI Security Institute evaluations reveal that Anthropic's heavily restricted Mythos Preview model is not uniquely dangerous, performing…

OpenAI’s Trusted Contact Feature Raises Questions About Surveillance and Consent

OpenAI's new safety feature for ChatGPT, which notifies a designated contact of potential self-harm conversations, trades privacy for…

Barry Diller’s Blind Faith in Sam Altman Misses the Point: AGI Is a Black Box Even for Its Creators

The media mogul argues that the real danger of AGI isn't Sam Altman's character, but that even its…

Inside OpenAI’s 2025 Panic: Code Red, Billion Dollar Deals, and a Chatbot in Crisis

Despite hitting 800 million weekly users, OpenAI's 2025 was a frantic scramble defined by a 'code red' memo,…

OpenAI’s Pyrrhic Victory: How ChatGPT Became a $3 Billion Cash Cow Built on a House of Cards

Despite boasting 800 million weekly users and $3 billion in mobile revenue, OpenAI faces existential threats from stagnating…

The Pivot to Patronage: Anthropic and OpenAI Sell Enterprise Access, Not AI

Anthropic and OpenAI are both launching joint ventures backed by private equity to deploy enterprise AI, effectively turning…

ChatGPT’s 2025 Reckoning: Growth Spikes, Lawsuits Pile, and the Disney Cash Splash

OpenAI's 2025 was a year of desperate pivots, lucrative corporate deals, and a growing mountain of lawsuits that…

Huang’s Cheerleading Act: Nvidia’s CEO Dismisses AI Job Fears as Sci-Fi Hype

The Nvidia CEO's sunny predictions about AI creating jobs ignore a troubling reality where the same industry promising…