New framework lets AI agents share silent thoughts for faster, cheaper reasoning

Last updated: June 9, 2026 2:24 am

AIWadmin

ByAIWadmin

Global AI news & information.

Follow:

The Problem with Talkative AI Systems

Multi-agent AI systems, where multiple language models work together on complex tasks, face a fundamental bottleneck: they communicate by generating text. Each agent must spell out its reasoning token by token before the next agent can begin processing. This sequential text generation creates latency, drives up token costs, and makes it difficult to train the entire system as a cohesive unit. Traditional approaches either rely on prompt based adaptation, which keeps each agent underlying capabilities static, or require the computationally expensive process of updating all parameters across multiple models.

Contents

The Problem with Talkative AI Systems How RecursiveMAS Rethinks Agent Communication Measured Gains in Speed and Cost

How RecursiveMAS Rethinks Agent Communication

Researchers at the University of Illinois Urbana-Champaign and Stanford University developed RecursiveMAS, a framework that enables agents to collaborate through embedding space instead of text. The architecture treats the multi-agent system as a single integrated unit, inspired by recursive language models. Each agent acts like a layer in a recursive model, passing continuous latent representations to the next agent rather than generating text. A specialized component called RecursiveLink transmits and refines these hidden states between agents, even when agents use different model architectures with incompatible embedding dimensions.

Measured Gains in Speed and Cost

In tests across nine benchmarks covering mathematics, code generation, and medical reasoning, RecursiveMAS achieved an average accuracy improvement of 8.3% compared to the strongest baselines. It delivered 1.2x to 2.4x faster inference and reduced token usage by as much as 75.6% compared to text based multi-agent frameworks. Because only the lightweight RecursiveLink modules are trained roughly 13 million parameters or about 0.31% of the frozen models trainable parameters the system cuts training costs by more than half compared to full fine tuning. The researchers have released the code and model weights under the Apache 2.0 license.

Source: VentureBeat

Apple CEO Warns of Price Hikes as AI Demand Strains Memory Chip Supply

Researchers Expose How ChatGPT Can Generate Violent and Sexual Images

Taiwanese AI Startups Showcase Innovations at Paris Tech Fair

Microsoft Expands China AI Footprint Through OpenAI Models

Bezos Predicts AI Will Create Labor Shortage, Not Job Losses

Anthropic plants flag in Seoul with new office and government pact on AI safety

AI Pioneer LeCun Warns of Industry Bubble, Calls Musk’s xAI a Misstep

xAI Launches Grok Imagine Video 1.5 with Faster Rendering and Audio

SpaceX Acquires AI Coding Startup Cursor in $60 Billion Stock Deal

AI Assistant Market Shifts as ChatGPT Drops Below 50% Share for First Time

Meta Loses Senior AI Product Leader Amid Enterprise Transformation Push

OpenAI Files for IPO, Set to Join Anthropic and SpaceX in Public Market Surge