Friday, 8 May 2026
Subscribe to AIWatcher
AIWatcher
  • Home
  • News

    Beijing Blocks Meta’s Manus Grab: The ‘Singapore Wash’ Strategy Hits a Wall

    By
    AIWadmin

    Ouster’s color lidar sensor aims to kill the camera in robotics and self driving cars

    By
    AIWadmin

    Huang’s Cheerleading Act: Nvidia’s CEO Dismisses AI Job Fears as Sci-Fi Hype

    By
    AIWadmin

    OpenAI and Anthropic’s New Ventures Are a Hostile Takeover of Enterprise AI

    By
    AIWadmin

    The Xteink X3 Is Not a Salvation Device

    By
    AIWadmin

    Uber’s Dark Plan to Turn Every Driver Into an Unpaid Sensor for Its AV Empire

    By
    AIWadmin
  • Articles

    The Great Tech Bloodletting of 2025: 22,000+ Workers Sacrificed on the Altar of AI

    By
    AIWadmin

    OpenAI’s 2025 Reckoning: Code Red, Lawsuits, and the Race Against Rivals

    By
    AIWadmin

    OpenAI’s Secret War on Goblins: Inside the Bizarre Codex Prompt That Bans a Fantasy Species

    By
    AIWadmin

    Japan Airlines Ropes in Wobbly Humanoid Robots to Fill Airport Jobs. It’s Not Going Great Yet.

    By
    AIWadmin

    Google’s Privacy Maze: How Gemini Traps You and Your Data

    By
    AIWadmin

    Google’s $40 Billion Anthropic Bet Is Really a $40 Billion Self-Dealing Loop

    By
    AIWadmin
  • Spotlight

    GitHub Pulls the Plug on Copilot Subsidies, Billing by the Token Starting June 1

    By
    AIWadmin

    Europe Demands Google Unlock Android for Rival AI Assistants. Google Fights Back.

    By
    AIWadmin

    When AI Data Centers Become Battlefield Targets: The Gulf’s Cloud War Just Got Real

    By
    AIWadmin

    Beijing’s Veto of the Meta Manus Deal Exposes the Cracks in US China Tech Relations

    By
    AIWadmin

    Robots Are Your New Baggage Handlers at Haneda Airport. Yes, It Is That Awkward

    By
    AIWadmin

    Google’s Gemini trap: dark patterns designed to hoover your data

    By
    AIWadmin
  • Events
  • More
    • About
    • Services
    • Contact
  • 🔥
  • Alerts
  • Alignment
  • Explainability
  • Legal/Compliance
  • Startups
  • Safety
  • Chips
  • Mobility
  • Vision
  • Robotics
  • Research
  • Medical/Healthcare
Font ResizerAa
AIWatcherAIWatcher
  • Home
  • News
  • Articles
  • Spotlight
  • Events
  • About
Search
  • Quick Links
    • Home
    • News
    • Articles
    • Spotlight
    • Events
  • About AIWatcher
    • Mission
    • Services
    • Contact
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
News

Google Splits Its TPU Line in Two. Good Luck Keeping Up With Nvidia.

AIWadmin
Last updated: May 6, 2026 12:15 am
AIWadmin
ByAIWadmin
Global AI news & information.
Follow:
Share
SHARE

The Great TPU Schism: Training vs. Inference

Google just killed the one size fits all AI accelerator. Its eighth generation Tensor Processing Units arrive in two distinct flavors, the TPU 8t for training and the TPU 8i for inference. This is a clear admission that the old paradigm of using the same silicon for both brute force math and fast token generation was wasteful. The 8t is a monster, boasting 121 FP4 EFlops per pod and a claimed 97 percent utilization rate, which the company calls “goodpute.” They are selling this as the engine for the “agentic era,” but really it is Google scrambling to stay relevant as Nvidia’s GPUs continue to dominate the training racks.

Contents
The Great TPU Schism: Training vs. InferenceThe Efficiency Mirage and the Cost of the Agentic Dream

The real story here is the TPU 8i, which triples on chip SRAM to 384 MB specifically to handle long context windows. Google is betting that autonomous agents will need to remember everything, and memory on the die is the only way to avoid the latency penalty of fetching data from off chip. By pairing these chips exclusively with Google’s own Axion ARM CPUs, the company is also making a bold statement about vertical integration. They want you to buy the whole stack, lock, stock, and barrel, and they are daring Nvidia to match their efficiency claims. But efficiency claims are cheap in a bubble where everyone is spending capital like it is water.

The Efficiency Mirage and the Cost of the Agentic Dream

Google claims the 8t offers double the performance per watt of the Ironwood generation, and that power usage effectiveness has improved six fold thanks to co designed data center layouts and liquid cooling. This sounds great, until you remember that absolute power consumption is still going up, not down. The company is simply squeezing more compute out of every watt, not reducing the overall energy footprint of AI. It is the same old trick: make the pie bigger, then brag about the recipe while the oven is on fire.

The TPU 8t and 8i will power Google’s Gemini agents, but the company is careful to note support for JAX, PyTorch, and SGLang to court third party developers. This is a smart play, but it does not change the fundamental math. Training and running frontier models is an astronomical cost that has yet to show a sustainable return for most enterprises. Google is building faster, more specialized hardware to chase a vision of autonomous agents that might not even be commercially viable. It is a high stakes gamble that the future is agentic, and that Google gets to be the one selling the shovels. Nvidia’s stock barely blinked at the announcement, and that silence is the most damning review of all.

Source: Arstechnica

TAGGED:AI HardwareGoogleInferenceLLMTPUTraining
Share This Article
Email Copy Link Print
ByAIWadmin
Follow:
Global AI news & information.
Previous Article GitHub Copilot’s Free Ride Is Over: Usage Based Pricing Hits Developer Wallets
Next Article Google’s $40 Billion Anthropic Gambit Is a Dangerous Conflict of Interest
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
XFollow
InstagramFollow
LinkedInFollow
MediumFollow
QuoraFollow
- Advertisement -
Ad image

You Might Also Like

News

Taiwan Expo 2025 spotlights AI collaboration with Malaysia to drive long-term growth

By
Zoe Chang
News

GitHub Copilot’s Free Ride Is Over: Usage Based Pricing Hits Developer Wallets

By
AIWadmin
News

Musk’s Threats and Failed Power Grab Expose OpenAI’s Chaotic Founding

By
AIWadmin
News

Google Pumps $40B Into Anthropic: Cloud Cash for Compute Crunch

By
AIWadmin
AIWatcher
Facebook Twitter Youtube Linkedin Rss

Global AI News and Information
AIWatcher is your definitive source for AI updates worldwide, from Silicon Valley to Shanghai.
Our industry coverage keeps you in the loop with the latest news and trends shaping the future of AI.

Quick Links
  • News
  • Articles
  • Spotlight
  • Events
About Us
  • Mission
  • Services
  • Contact
  • Privacy Policy
  • Legal

© 2025 AIWatcher. All Rights Reserved.