Tuesday, 23 Jun 2026
Subscribe to AIWatcher
AIWatcher
  • Home
  • News

    Apple CEO Warns of Price Hikes as AI Demand Strains Memory Chip Supply

    By
    AIWadmin

    Researchers Expose How ChatGPT Can Generate Violent and Sexual Images

    By
    AIWadmin

    Taiwanese AI Startups Showcase Innovations at Paris Tech Fair

    By
    AIWadmin

    Microsoft Expands China AI Footprint Through OpenAI Models

    By
    AIWadmin

    Bezos Predicts AI Will Create Labor Shortage, Not Job Losses

    By
    AIWadmin

    Anthropic plants flag in Seoul with new office and government pact on AI safety

    By
    AIWadmin
  • Articles

    AI Pioneer LeCun Warns of Industry Bubble, Calls Musk’s xAI a Misstep

    By
    AIWadmin

    xAI Launches Grok Imagine Video 1.5 with Faster Rendering and Audio

    By
    AIWadmin

    SpaceX Acquires AI Coding Startup Cursor in $60 Billion Stock Deal

    By
    AIWadmin

    AI Assistant Market Shifts as ChatGPT Drops Below 50% Share for First Time

    By
    AIWadmin

    Meta Loses Senior AI Product Leader Amid Enterprise Transformation Push

    By
    AIWadmin

    OpenAI Files for IPO, Set to Join Anthropic and SpaceX in Public Market Surge

    By
    AIWadmin
  • Spotlight

    New framework lets AI agents share silent thoughts for faster, cheaper reasoning

    By
    AIWadmin

    NVIDIA Jetson Gains Agentic AI with JetPack 7.2 and NemoClaw Framework

    By
    AIWadmin

    How OpenAI’s Algebraic Gambit Toppled a 50-Year-Old Number Theory Giant

    By
    AIWadmin

    Apple’s iOS 27 Siri Overhaul: A Strategic Pivot to AI Brokerage, Not Innovation

    By
    AIWadmin

    OpenAI Publishes Governance Framework as California and EU AI Laws Take Shape

    By
    AIWadmin

    Anthropic Unveils Dynamic Workflows for Claude Code: Parallel AI Agents at Scale

    By
    AIWadmin
  • Events
  • More
    • About
    • Services
    • Contact
  • 🔥
  • Alerts
  • Alignment
  • Explainability
  • Legal/Compliance
  • Startups
  • Safety
  • Chips
  • Mobility
  • Vision
  • Robotics
  • Research
  • Medical/Healthcare
Font ResizerAa
AIWatcherAIWatcher
  • Home
  • News
  • Articles
  • Spotlight
  • Events
  • About
Search
  • Quick Links
    • Home
    • News
    • Articles
    • Spotlight
    • Events
  • About AIWatcher
    • Mission
    • Services
    • Contact
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
News

Google Splits Its Next Gen TPU: One Chip for Training, One for Agents

AIWadmin
Last updated: May 17, 2026 9:45 am
AIWadmin
ByAIWadmin
Global AI news & information.
Follow:
Share
SHARE

A Tale of Two Tensor Processors

Google has officially unveiled its eighth generation of Tensor Processing Units, but this time it is not a single chip. Instead, the company has split its custom AI hardware into two distinct variants. The TPU 8t is built exclusively for the massive computational demands of training frontier models. The TPU 8i is designed specifically for the inference phase, where models generate responses and power autonomous agents.

Contents
A Tale of Two Tensor ProcessorsPerformance and Efficiency at ScaleInfrastructure Designed for a Cloud Native Future

This bifurcation reflects Google’s belief that the industry is entering the “agentic era.” The company argues that running specialized agents requires a fundamentally different hardware architecture than training a monolithic model. By separating the workloads, Google aims to maximize efficiency across the entire AI lifecycle, allowing cloud customers to pay only for the capability they actually need.

Performance and Efficiency at Scale

For training, the TPU 8t pods pack 9,600 chips with two petabytes of shared high bandwidth memory, delivering 121 FP4 EFlops per pod. Google claims a linear scaling capability that can link up to one million chips in a single logical cluster, a feat designed to shrink training timelines from months to weeks. The company also reports a “goodpute” rate of 97 percent, meaning the chips spend almost all their time on meaningful computation rather than waiting on memory or faults.

On the inference side, the TPU 8i triples on chip SRAM to 384 MB to keep larger key value caches local, which speeds up models with long context windows. These chips run in pods of 1,152 units and are the first Google accelerators to rely solely on the custom Axion ARM CPU host, with one CPU per two TPUs. Google claims this full stack ARM approach delivers twice the performance per watt compared to the previous Ironwood generation.

Infrastructure Designed for a Cloud Native Future

Beyond the chips themselves, Google has co designed its data centers around the new TPUs. Integrating networking directly onto the compute die and optimizing pod layouts has reportedly increased compute per unit of electricity by six times. To handle the intense heat, the company deployed fourth generation liquid cooling with actively controlled valves that adjust water flow based on real time workload demands.

Both the TPU 8t and TPU 8i support standard developer frameworks such as JAX, MaxText, PyTorch, SGLang, and vLLM. This ensures that third party developers can immediately leverage the new hardware without rearchitecting their pipelines. While Nvidia briefly saw a 1.5 percent stock dip on the news, Google’s announcement signals its commitment to building an end to end AI infrastructure that challenges the dominant GPU paradigm.

Source: Arstechnica

TAGGED:Agentic AICustom SiliconData CenterGoogleHardwareTPU
Share This Article
Email Copy Link Print
ByAIWadmin
Follow:
Global AI news & information.
Previous Article GitHub Overhauls Copilot Pricing: Usage Based Billing Goes Live in June
Next Article Google Commits Up to $40 Billion to Anthropic in Cloud Compute Deal
Ad imageAd image

You Might Also Like

News

Musk’s OpenAI Bid Reveals His Need for Absolute Control

By
AIWadmin
News

Apple’s iOS 27 Siri Overhaul: A Strategic Pivot to AI Brokerage, Not Innovation

By
AIWadmin
News

OpenAI and Anthropic Clone Each Other’s Enterprise Playbook, Wall Street Foots the Bill

By
AIWadmin
News

Google’s Gemini trap: dark patterns designed to hoover your data

By
AIWadmin
AIWatcher
Facebook Twitter Youtube Linkedin Rss

Global AI News and Information
AIWatcher is your definitive source for AI updates worldwide, from Silicon Valley to Shanghai.
Our industry coverage keeps you in the loop with the latest news and trends shaping the future of AI.

Quick Links
  • News
  • Articles
  • Spotlight
  • Events
About Us
  • Mission
  • Services
  • Contact
  • Privacy Policy
  • Legal

© 2026 AIWatcher. All Rights Reserved.