Wednesday, 24 Jun 2026
Subscribe to AIWatcher
AIWatcher
  • Home
  • News

    Apple CEO Warns of Price Hikes as AI Demand Strains Memory Chip Supply

    By
    AIWadmin

    Researchers Expose How ChatGPT Can Generate Violent and Sexual Images

    By
    AIWadmin

    Taiwanese AI Startups Showcase Innovations at Paris Tech Fair

    By
    AIWadmin

    Microsoft Expands China AI Footprint Through OpenAI Models

    By
    AIWadmin

    Bezos Predicts AI Will Create Labor Shortage, Not Job Losses

    By
    AIWadmin

    Anthropic plants flag in Seoul with new office and government pact on AI safety

    By
    AIWadmin
  • Articles

    AI Pioneer LeCun Warns of Industry Bubble, Calls Musk’s xAI a Misstep

    By
    AIWadmin

    xAI Launches Grok Imagine Video 1.5 with Faster Rendering and Audio

    By
    AIWadmin

    SpaceX Acquires AI Coding Startup Cursor in $60 Billion Stock Deal

    By
    AIWadmin

    AI Assistant Market Shifts as ChatGPT Drops Below 50% Share for First Time

    By
    AIWadmin

    Meta Loses Senior AI Product Leader Amid Enterprise Transformation Push

    By
    AIWadmin

    OpenAI Files for IPO, Set to Join Anthropic and SpaceX in Public Market Surge

    By
    AIWadmin
  • Spotlight

    New framework lets AI agents share silent thoughts for faster, cheaper reasoning

    By
    AIWadmin

    NVIDIA Jetson Gains Agentic AI with JetPack 7.2 and NemoClaw Framework

    By
    AIWadmin

    How OpenAI’s Algebraic Gambit Toppled a 50-Year-Old Number Theory Giant

    By
    AIWadmin

    Apple’s iOS 27 Siri Overhaul: A Strategic Pivot to AI Brokerage, Not Innovation

    By
    AIWadmin

    OpenAI Publishes Governance Framework as California and EU AI Laws Take Shape

    By
    AIWadmin

    Anthropic Unveils Dynamic Workflows for Claude Code: Parallel AI Agents at Scale

    By
    AIWadmin
  • Events
  • More
    • About
    • Services
    • Contact
  • 🔥
  • Alerts
  • Alignment
  • Explainability
  • Legal/Compliance
  • Startups
  • Safety
  • Chips
  • Mobility
  • Vision
  • Robotics
  • Research
  • Medical/Healthcare
Font ResizerAa
AIWatcherAIWatcher
  • Home
  • News
  • Articles
  • Spotlight
  • Events
  • About
Search
  • Quick Links
    • Home
    • News
    • Articles
    • Spotlight
    • Events
  • About AIWatcher
    • Mission
    • Services
    • Contact
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
News

AI’s safety blind spots: Researchers call for stronger testing and standards

Experts warn that insufficient evaluation and regulation of AI models are leading to harmful outputs, and that more rigorous testing is urgently needed.

Zoe Chang
Last updated: May 17, 2026 9:47 am
Zoe Chang
Share
SHARE

As first reported by CNBC, researchers are raising alarms over the growing number of harmful and problematic responses generated by AI models, ranging from hate speech to copyright violations and explicit content. The rapid adoption of AI across industries is revealing gaps in testing and oversight, with experts warning that current evaluation methods are not sufficient to safeguard users. “After almost 15 years of research, we still don’t know how to make models behave reliably,” said adversarial machine learning researcher Javier Rando.

Red teaming — a practice borrowed from cybersecurity that involves deliberately probing AI systems for vulnerabilities — has emerged as a vital method for stress-testing models. However, researchers like Shayne Longpre note that the current red-teaming ecosystem is under-resourced. In a recent paper, Longpre and collaborators argue for expanding testing beyond internal teams to include third-party experts such as scientists, doctors, lawyers, and journalists. They also propose standardized AI flaw reporting and reward structures to better document and address model weaknesses.

One initiative, Project Moonshot, offers a promising path forward. Developed in Singapore with support from IBM and DataRobot, the open-source toolkit combines benchmarking, red teaming, and customizable evaluation mechanisms. IBM’s Anup Kumar emphasized that evaluation must be a continuous effort, and while some startups have adopted Moonshot, broader industry engagement remains limited. Future improvements aim to make the tool more adaptable across languages, cultures, and industries.

Experts are also calling for regulation in AI to follow the precedents set by sectors like pharmaceuticals and aviation, where rigorous testing is mandatory before release. Pierre Alquier of ESSEC Business School argued that tech companies are releasing general-purpose models too quickly without understanding the full scope of their potential misuse. Narrower, task-specific models could help mitigate these risks, but for now, developers must avoid overstating the strength of their model safeguards.

The AI industry is at a critical juncture: as models grow in power and ubiquity, their potential for harm escalates just as rapidly. Without proper standards, open testing frameworks, and clear regulatory oversight, both users and developers are left vulnerable. Researchers say that establishing stronger checks — through red teaming, transparency, and policy — is not just a safeguard but a necessary foundation for trustworthy AI.

VIA:CNBC
Share This Article
Email Copy Link Print
Zoe Chang
ByZoe Chang
Zoe is a technology writer based in Taiwan.
Previous Article Microsoft’s chief scientist warns Trump’s AI regulation ban could hinder progress
Next Article NTU students penalised over AI use dispute misconduct ruling and process
Ad imageAd image

You Might Also Like

News

ASML’s God Complex: Why the Chip Tool Titan Isn’t Losing Sleep Over Rivals or Reverse Engineering

By
AIWadmin
News

Google and Amazon Are Locked in a Bid War Over Anthropic. That’s a Problem.

By
AIWadmin
News

China Inc. Pumps Another $2 Billion Into Moonshot AI. The Open Source Trojan Horse Is Working.

By
AIWadmin
News

Uber’s Dark Plan to Turn Every Driver Into an Unpaid Sensor for Its AV Empire

By
AIWadmin
AIWatcher
Facebook Twitter Youtube Linkedin Rss

Global AI News and Information
AIWatcher is your definitive source for AI updates worldwide, from Silicon Valley to Shanghai.
Our industry coverage keeps you in the loop with the latest news and trends shaping the future of AI.

Quick Links
  • News
  • Articles
  • Spotlight
  • Events
About Us
  • Mission
  • Services
  • Contact
  • Privacy Policy
  • Legal

© 2026 AIWatcher. All Rights Reserved.