Exclusive: Anthropic in Talks With Samsung to Manufacture Custom AI Chip Save 25% to unlock this story

Sign in
Subscribe

    Data Tools

    • About Pro
    • The Executives Leading the Data Center Race
    • The Next GPs 2026
    • The Next GPs 2025
    • The Rising Stars of AI Research
    • Leaders of the AI Shopping Revolution
    • Enterprise Software Startup Takeover List
    • Org Charts
    • The Information 50 2025
    • Generative AI Takeover List
    • Generative AI Database
    • AI Chip Database
    • AI Data Center Database
    • Tech IPO Tracker
    • Tech Sentiment Tracker
    • Gigafactory Database

    Special Projects

    • The Information 50 Database
    • VC Diversity Index
    • Enterprise Tech Powerlist
  • Org Charts
  • Deep Research
  • Tech
  • Finance
  • Weekend
  • Charts
  • Events
  • TITV
    • Directory

      Search, find and engage with others who are serious about tech and business.

    • Forum

      Follow and be a part of discussions about tech, finance and media.

    • Brand Partnerships

      Premium advertising opportunities for brands

    • Group Subscriptions

      Team access to our exclusive tech news

    • Newsletters

      Journalists who break and shape the news, in your inbox

    • Video

      Catch up on conversations with global leaders in tech, media and finance

    • Partner Content

      Explore our recent partner collaborations

      XFacebookLinkedInThreadsInstagram
    • Help & Support
    • RSS Feed
    • Careers
    Sign in
  • About Pro
  • The Executives Leading the Data Center Race
  • The Next GPs 2026
  • The Next GPs 2025
  • The Rising Stars of AI Research
  • Leaders of the AI Shopping Revolution
  • Enterprise Software Startup Takeover List
  • Org Charts
  • The Information 50 2025
  • Generative AI Takeover List
  • Generative AI Database
  • AI Chip Database
  • AI Data Center Database
  • Tech IPO Tracker
  • Tech Sentiment Tracker
  • Gigafactory Database

SPECIAL PROJECTS

  • The Information 50 Database
  • VC Diversity Index
  • Enterprise Tech Powerlist
Deep Research
TITV
Tech
Finance
Weekend
Charts
Events
Newsletters
  • Directory

    Search, find and engage with others who are serious about tech and business.

  • Forum

    Follow and be a part of discussions about tech, finance and media.

  • Brand Partnerships

    Premium advertising opportunities for brands

  • Group Subscriptions

    Team access to our exclusive tech news

  • Newsletters

    Journalists who break and shape the news, in your inbox

  • Video

    Catch up on conversations with global leaders in tech, media and finance

  • Partner Content

    Explore our recent partner collaborations

Subscribe
  • Sign in
  • Search
  • Opinion
  • Venture Capital
  • Artificial Intelligence
  • Startups
  • Market Research
    XFacebookLinkedInThreadsInstagram
  • Help & Support
  • RSS Feed
  • Careers

In-depth insights in seconds. Ask Deep Research.

AI Agenda

XAI Shows How Hard It Is to Use a Lot of GPUs at Once

Art by Mike Sullivan
By
Stephanie Palazzolo
[email protected]Profile and archive

AI developers have been desperately scrambling to get ahold of Nvidia server chips lately, as we wrote last week. When developers do get the graphics processing units, they’re under a lot of pressure to wring as much performance as possible from that expensive hardware.

That’s easier said than done. Training AI models can be “bursty,” meaning that there can be sudden spikes in GPU usage followed by periods of lower activity when researchers analyze the results and decide what to do next. This leads to what researchers refer to as a lower utilization rate, meaning they aren’t getting the most bang for their GPU buck. (This is less of a problem in AI inference involving finished models, when developers can run them in more predictable or consistent ways.)

Even the biggest AI firms have problems in this regard. Elon Musk’s xAI, for instance, has around 500,000 Nvidia GPUs, one of the largest collections among AI developers based on what they’ve publicly disclosed. But xAI’s Model Flops Utilization—a measure of exactly how much computing power it can eke out of those chips—was around 11% in recent weeks, according to a person who saw the data in an internal memo. (Business Insider earlier reported on the memo.) The MFU rate is an indicator of how effectively a developer is utilizing its chips—a rate of 100%, for instance, would imply full utilization.

To be fair to xAI, everyone struggles with GPU utilization, and a researcher at a rival firm said cracking 40% was difficult for most of xAI’s competitors. But a rate of 11% is appallingly low, the researcher said. And it’s especially surprising given that xAI has a reputation of setting up GPUs in a way that Nvidia recommends. 

Recommended