We evolved for a linear world. If you walk for an hour, you cover a certain distance. Walk for two hours and you cover double that distance. This intuition served us well on the savannah. But it catastrophically fails when confronting AI and the core exponential trends at its heart.

From the time I began work on AI in 2010 to now, the amount of training data that goes into frontier AI models has grown by a staggering 1 trillion times—from roughly 10¹⁴ flops (floating-point operations‚ the core unit of computation) for early systems to over 10²⁶ flops for today’s largest models. This is an explosion. Everything else in AI follows from this fact.

The skeptics keep predicting walls. And they keep being wrong in the face of this epic generational compute ramp. Often, they point out that Moore’s Law is slowing. They also mention a lack of data, or they cite limitations on energy.

But when you look at the combined forces driving this revolution, the exponential trend seems quite predictable. To understand why, it’s worth looking at the complex and fast-moving reality beneath the headlines.

Think of AI training as a room full of people working calculators. For years, adding computational power meant adding more people with calculators to that room. Much of the time those workers sat idle, drumming their fingers on desks, waiting for the numbers to come through for their next calculation. Every pause was wasted potential. Today’s revolution goes beyond more and better calculators (although it delivers those); it is actually about ensuring that all those calculators never stop, and that they work together as one.

Three advances are now converging to enable this. First, the basic calculators got faster. Nvidia’s chips have delivered an over sevenfold increase in raw performance in just six years, from 312 teraflops in 2020 to 2,250 teraflops today. Our own Maia 200 chip, launched this January, delivers 30% better performance per dollar than any other hardware in our fleet. Second, the numbers arrive faster thanks to a technology called HBM, or high bandwidth memory, which stacks chips vertically like tiny skyscrapers; the latest generation, HBM3, triples the bandwidth of its predecessor, feeding data to processors fast enough to keep them busy all the time. Third, the room of people with calculators became an office and then a whole campus or city. Technologies like NVLink and InfiniBand connect hundreds of thousands of GPUs into warehouse-size supercomputers that function as single cognitive entities. A few years ago this was impossible.

These gains all come together to deliver dramatically more compute. Where training a language model took 167 minutes on eight GPUs in 2020, it now takes under four minutes on equivalent modern hardware. To put this in perspective: Moore’s Law would predict only about a 5x improvement over this period. We saw 50x. We’ve gone from two GPUs training AlexNet, the image recognition model that kicked off the modern boom in deep learning in 2012, to over 100,000 GPUs in today’s largest clusters, each one individually far more powerful than its predecessors.

Then there’s the revolution in software. Research from Epoch AI suggests that the compute required to reach a fixed performance level halves approximately every eight months, much faster than the traditional 18-to-24-month doubling of Moore’s Law. The costs of serving some recent models have collapsed by a factor of up to 900 on an annualized basis. AI is becoming radically cheaper to deploy.

The numbers for the near future are just as staggering. Consider that leading labs are growing capacity at nearly 4x annually. Since 2020, the compute used to train frontier models has grown 5x every year. Global AI-relevant compute is forecast to hit 100 million H100-equivalents by 2027, a tenfold increase in three years. Put all this together and we’re looking at something like another 1,000x in effective compute by the end of 2028. It’s plausible that by 2030 we’ll bring an additional 200 gigawatts of compute online every year—akin to the peak energy use of the UK, France, Germany, and Italy put together.

What does all this get us? I believe it will drive the transition from chatbots to nearly human-level agents—semiautonomous systems capable of writing code for days, carrying out weeks- and months-long projects, making calls, negotiating contracts, managing logistics. Forget basic assistants that answer questions. Think teams of AI workers that deliberate, collaborate, and execute. Right now we’re only in the foothills of this transition, and the implications stretch far beyond tech. Every industry built on cognitive work will be transformed.

The obvious constraint here is energy. A single refrigerator-size AI rack consumes 120 kilowatts, equivalent to 100 homes. But this hunger collides with another exponential: Solar costs have fallen by a factor of nearly 100 over 50 years; battery prices have dropped 97% over three decades. There is a pathway to clean scaling coming into view.

The capital is deployed. The engineering is delivering. The $100 billion clusters, the 10-gigawatt power draws, the warehouse-scale supercomputers … these are no longer science fiction. Ground is being broken for these projects now across the US and the world. As a result, we are heading toward true cognitive abundance. At Microsoft AI, this is the world our superintelligence lab is planning for and building.

Skeptics accustomed to a linear world will continue predicting diminishing returns. They will continue being surprised. The compute explosion is the technological story of our time, full stop. And it is still only just beginning.

Mustafa Suleyman is CEO of Microsoft AI.

Read more

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology.

Desalination plants in the Middle East are increasingly vulnerable 

As the conflict in Iran has escalated, a crucial resource is under fire: the desalinization technology that supplies water in the region. 
 
President Donald Trump has threatened to destroy “possibly all desalinization plants” in Iran if the Strait of Hormuz is not reopened. The impact on farming, industry, and—crucially—drinking in the Middle East could be severe. Find out why

—Casey Crownhart 

This story is part of MIT Technology Review Explains, our series untangling the complex, messy world of technology to help you understand what’s coming next. You can read more from the series here. 

AI is changing how small online sellers decide what to make 

For small entrepreneurs, deciding what to sell and where to make it has traditionally been a slow, labor-intensive process. Now that work is increasingly being done by AI.   

Tools like Alibaba’s Accio compress weeks of product research and supplier hunting into a single chat. Business owners and e-commerce experts say they’re making sourcing more accessible—and slashing the time from product idea to launch.  

Read the full story on how AI is leveling the path to global manufacturing

—Caiwei Chen 

The gig workers who are training humanoid robots at home 

When Zeus, a medical student in Nigeria, returns to his apartment from a long day at the hospital, he straps his iPhone to his forehead and records himself doing chores.  
 
Zeus is a data recorder for Micro1, which sells the data he collects to robotics firms. As these companies race to build humanoids, videos from workers like Zeus have become the hottest new way to train them.   
 
Micro1 has hired thousands of them in more than 50 countries, including India, Nigeria, and Argentina. The jobs pay well locally, but raise thorny questions around privacy and informed consent. The work can be challenging—and weird. Read the full story.  

—Michelle Kim 

This is our latest story to be turned into an MIT Technology Review Narrated podcast, which we’re publishing each week on Spotify and Apple Podcasts. Just navigate to MIT Technology Review Narrated on either platform, and follow us to get all our new content as it’s released. 

The must-reads 

I’ve combed the internet to find you today’s most fun/important/scary/fascinating stories about technology. 

1 Anthropic’s new model found security problems in every OS and browser 
Claude Mythos has been heralded as a cybersecurity “reckoning.” (The Verge)  
+ Anthrophic is limiting the rollout over hacking fears. (CNBC
+ It’s also launching a project that lets Mythos flag vulnerabilities. (Gizmodo
+ Apple, Google, and Microsoft have joined the initiative. (ZDNET

2 Iranian hackers are targeting American critical infrastructure 
Their focus is on energy and water infrastructure. (Wired
+ They’re targeting industrial control devices. (TechCrunch)  

3 Google’s AI Overviews deliver millions of incorrect answers per hour 
Despite a 90% accuracy rate. (NYT $) 
+ AI means the end of internet search as we’ve known it. (MIT Technology Review

4 Elon Musk is trying to oust OpenAI CEO Sam Altman in a lawsuit 
As remedies for Altman allegedly defrauding him. (CNBC
+ Musk wants any damages given to OpenAI’s nonprofit arm. (WSJ $) 

5 ICE has admitted it’s using powerful spyware 
The tools that can intercept encrypted messages. (NPR
+ Immigration agencies are also weaponizing AI videos. (MIT Technology Review

6 Greece has joined the countries banning kids from social media 
Under-15s will be blocked from 2027. (Reuters
+ Australia introduced the world’s first social media ban for children. (Guardian
+ Indonesia recently rolled out the first one in Southeast Asia. (DW)  
+ Experts say they’re a lazy fix. (CNBC

7 Intel will help Elon Musk build his Terafab in Texas 
They aim to manufacture chips for AI projects. (Engadget
+ Musk says it will be the largest-ever semiconductor factory. (Engadget
+ Future AI chips could be built on glass. (MIT Technology Review)  

8 TikTok is building a second billion-euro data center in Finland 
It’s moving data storage for European users. (Reuters
+ Finland has become a magnet for data centers. (Bloomberg $) 
+ But nobody wants one in their backyard. (MIT Technology Review

9 Plans for Canada’s first “virtual gated community” have sparked a row 
The AI-powered surveillance system has divided neighbors. (Guardian
+ Is the Pentagon allowed to surveil Americans with AI? (MIT Technology Review

10 The high-tech engineering of the “space toilet” has been revealed 
Artemis II is the first mission to carry one around the world. (Vox

Quote of the day 

“This case has always been about Elon generating more power and more money for what he wants. His lawsuit remains nothing more than a harassment campaign that’s driven by ego, jealousy and a desire to slow down a competitor.” 

—OpenAI criticizes Musk’s legal action in an X post

One More Thing 

USWDS

Inside the US government’s brilliantly boring websites 

You may not notice it, but your experience on every US government website is carefully crafted. 

Each site aligns an official web design and a custom typeface. They aim to make government websites not only good-looking but accessible and functional for all. 

MIT Technology Review dug into the system’s history and features. Find out what we discovered

—Jon Keegan 

We can still have nice things 

A place for comfort, fun and distraction to brighten up your day. (Got any ideas? Drop me a line.) 

+ Rejoice in the splendor of the “Earthset” image captured by Artemis II. 
+ Meet the fearless cat chasing off bears. 
+ This document vividly explains what makes the octopus so unique. 
+ Revealed: the rhythmic secret that makes emo music so angsty

Read more
1 66 67 68 69 70 3,184