Tuesday, 23 Jun 2026
Subscribe to AIWatcher
AIWatcher
  • Home
  • News

    Apple CEO Warns of Price Hikes as AI Demand Strains Memory Chip Supply

    By
    AIWadmin

    Researchers Expose How ChatGPT Can Generate Violent and Sexual Images

    By
    AIWadmin

    Taiwanese AI Startups Showcase Innovations at Paris Tech Fair

    By
    AIWadmin

    Microsoft Expands China AI Footprint Through OpenAI Models

    By
    AIWadmin

    Bezos Predicts AI Will Create Labor Shortage, Not Job Losses

    By
    AIWadmin

    Anthropic plants flag in Seoul with new office and government pact on AI safety

    By
    AIWadmin
  • Articles

    AI Pioneer LeCun Warns of Industry Bubble, Calls Musk’s xAI a Misstep

    By
    AIWadmin

    xAI Launches Grok Imagine Video 1.5 with Faster Rendering and Audio

    By
    AIWadmin

    SpaceX Acquires AI Coding Startup Cursor in $60 Billion Stock Deal

    By
    AIWadmin

    AI Assistant Market Shifts as ChatGPT Drops Below 50% Share for First Time

    By
    AIWadmin

    Meta Loses Senior AI Product Leader Amid Enterprise Transformation Push

    By
    AIWadmin

    OpenAI Files for IPO, Set to Join Anthropic and SpaceX in Public Market Surge

    By
    AIWadmin
  • Spotlight

    New framework lets AI agents share silent thoughts for faster, cheaper reasoning

    By
    AIWadmin

    NVIDIA Jetson Gains Agentic AI with JetPack 7.2 and NemoClaw Framework

    By
    AIWadmin

    How OpenAI’s Algebraic Gambit Toppled a 50-Year-Old Number Theory Giant

    By
    AIWadmin

    Apple’s iOS 27 Siri Overhaul: A Strategic Pivot to AI Brokerage, Not Innovation

    By
    AIWadmin

    OpenAI Publishes Governance Framework as California and EU AI Laws Take Shape

    By
    AIWadmin

    Anthropic Unveils Dynamic Workflows for Claude Code: Parallel AI Agents at Scale

    By
    AIWadmin
  • Events
  • More
    • About
    • Services
    • Contact
  • 🔥
  • Alerts
  • Alignment
  • Explainability
  • Legal/Compliance
  • Startups
  • Safety
  • Chips
  • Mobility
  • Vision
  • Robotics
  • Research
  • Medical/Healthcare
Font ResizerAa
AIWatcherAIWatcher
  • Home
  • News
  • Articles
  • Spotlight
  • Events
  • About
Search
  • Quick Links
    • Home
    • News
    • Articles
    • Spotlight
    • Events
  • About AIWatcher
    • Mission
    • Services
    • Contact
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
News

Alibaba’s New Image Model Doubles Compression and Speeds Up Generation

AIWadmin
Last updated: May 22, 2026 11:58 pm
AIWadmin
ByAIWadmin
Global AI news & information.
Follow:
Share
SHARE

Stronger Compression for Faster Training

Alibaba’s new Qwen-Image-2.0 model achieves a major efficiency gain by using a variational autoencoder (VAE) that compresses images sixteenfold in each direction, doubling the compression ratio of most open source models. Standard image models, such as FLUX.1-dev and HunyuanVideo, typically rely on eightfold spatial downsampling. Doubling the compression rate usually sacrifices fine detail, but the Qwen team overcame this by adding skip connections that preserve fine grained information around the VAE’s bottleneck layers. They also shaped the latent space during early training to capture semantically meaningful structures, giving the image transformer a cleaner workspace.

Contents
Stronger Compression for Faster TrainingArchitectural Changes Speed Up Inference

Architectural Changes Speed Up Inference

The transformer at the core of Qwen-Image-2.0 processes both image and text tokens in a single stream, using frozen weights from Alibaba’s Qwen3-VL vision language model for text conditioning. The team made two structural modifications to prevent unstable activations: they simplified an internal scaling mechanism and stabilized the final layer normalization. These changes allow the model to generate high quality photorealistic images in as few as four generation steps, down from the typical 40 steps required by earlier systems. Alibaba’s technical report notes that the model’s outputs include portraits, animal close ups, nature scenes, and game screenshots with legible on screen text.

Source: The-Decoder

TAGGED:AI ModelAlibabaEfficiencyImage GenerationTransformerVAE
Share This Article
Email Copy Link Print
ByAIWadmin
Follow:
Global AI news & information.
Previous Article Recursive Launches with $650 Million to Build AI That Improves Itself
Next Article Gallup Poll Reveals Strong Public Opposition to AI Data Centers Near Homes
Ad imageAd image

You Might Also Like

News

AI Is the Axe: 22,000 Tech Workers Cut in 2025 as Automation Excuses Mount

By
AIWadmin
News

Musk Admits xAI Ripped Off OpenAI Models for Grok

By
AIWadmin
News

OpenAI’s Near Collapse: Murati Testifies the Board’s Incompetence Almost Killed the Company

By
AIWadmin
ArticlesNewsSpotlight

The Silicon Cornfield Revolt: Why Rural America Is Fighting Big Tech’s Land Grab

By
AIWadmin
AIWatcher
Facebook Twitter Youtube Linkedin Rss

Global AI News and Information
AIWatcher is your definitive source for AI updates worldwide, from Silicon Valley to Shanghai.
Our industry coverage keeps you in the loop with the latest news and trends shaping the future of AI.

Quick Links
  • News
  • Articles
  • Spotlight
  • Events
About Us
  • Mission
  • Services
  • Contact
  • Privacy Policy
  • Legal

© 2026 AIWatcher. All Rights Reserved.