Apple Teams Up With NVIDIA to Speed Up AI Language Models

Apple has shared details on a collaboration with NVIDIA to greatly improve the performance of large language models (LLMs) by implementing a new text generation technique that offers substantial speed improvements for AI applications.

ml research apple
Apple earlier this year published and open-sourced Recurrent Drafter (ReDrafter), an approach that combines beam search and dynamic tree attention methods to accelerate text generation. Beam search explores multiple potential text sequences at once for better results, while tree attention organizes and removes redundant overlaps among these sequences to improve efficiency.

Apple has now integrated the technology into NVIDIA's TensorRT-LLM framework, which optimizes LLMs running on NVIDIA GPUs, where it achieved "state of the art performance," according to Apple. The integration saw the technique manage a 2.7x speed increase in tokens generated per second during testing with a production model containing tens of billions of parameters.

Apple says the improved performance not only reduces user-perceived latency but also leads to decreased GPU usage and power consumption. From Apple's Machine Learning Research blog:

"LLMs are increasingly being used to power production applications, and improving inference efficiency can both impact computational costs and reduce latency for users. With ReDrafter's novel approach to speculative decoding integrated into the NVIDIA TensorRT-LLM framework, developers can now benefit from faster token generation on NVIDIA GPUs for their production LLM applications."

Developers interested in implementing ReDrafter can find detailed information on both Apple's website and NVIDIA's developer blog.

Tag: Nvidia

Popular Stories

Apple Logo Zoomed

Tim Cook Teases Plans for Apple's Upcoming 50th Anniversary

Thursday February 5, 2026 12:54 pm PST by
Apple turns 50 this year, and its CEO Tim Cook has promised to celebrate the milestone. The big day falls on April 1, 2026. "I've been unusually reflective lately about Apple because we have been working on what do we do to mark this moment," Cook told employees today, according to Bloomberg's Mark Gurman. "When you really stop and pause and think about the last 50 years, it makes your heart ...
wwdc sans text feature

Apple Rumored to Announce New Product on February 19

Thursday February 5, 2026 12:22 pm PST by
Apple plans to announce the iPhone 17e on Thursday, February 19, according to Macwelt, the German equivalent of Macworld. The report, citing industry sources, is available in English on Macworld. Apple announced the iPhone 16e on Wednesday, February 19 last year, so the iPhone 17e would be unveiled exactly one year later if this rumor is accurate. It is quite uncommon for Apple to unveil...
Finder Siri Feature

Why Apple's iOS 26.4 Siri Upgrade Will Be Bigger Than Originally Promised

Friday February 6, 2026 3:06 pm PST by
In the iOS 26.4 update that's coming this spring, Apple will introduce a new version of Siri that's going to overhaul how we interact with the personal assistant and what it's able to do. The iOS 26.4 version of Siri won't work like ChatGPT or Claude, but it will rely on large language models (LLMs) and has been updated from the ground up. Upgraded Architecture The next-generation...
iOS 26

iOS 26.3 and iOS 26.4 Will Add These New Features to Your iPhone

Tuesday February 3, 2026 7:47 am PST by
While the iOS 26.3 Release Candidate is now available ahead of a public release, the first iOS 26.4 beta is likely still at least a week away. Following beta testing, iOS 26.4 will likely be released to the general public in March or April. Below, we have recapped known or rumored iOS 26.3 and iOS 26.4 features so far. iOS 26.3 iPhone to Android Transfer Tool iOS 26.3 makes it easier...
iphone 17 pro dark blue 1

iPhone 18 Pro Max Rumored to Deliver Next-Level Battery Life

Friday February 6, 2026 5:14 am PST by
The iPhone 18 Pro Max will feature a bigger battery for continued best-in-class battery life, according to a known Weibo leaker. Citing supply chain information, the Weibo user known as "Digital Chat Station" said that the iPhone 18 Pro Max will have a battery capacity of 5,100 to 5,200 mAh. Combined with the efficiency improvements of the A20 Pro chip, made with TSMC's 2nm process, the...

Top Rated Comments

attohs Avatar
15 months ago
NVidia? Did hell freeze over again?
Score: 37 Votes (Like | Disagree)
vegetassj4 Avatar
15 months ago
NVIDIA and Apple??!!? Working together again?



Attachment Image
Score: 13 Votes (Like | Disagree)
Delgibbons Avatar
15 months ago
Can't wait to put a 5090 in my Ma....

oh.
Score: 12 Votes (Like | Disagree)
redbeard331 Avatar
15 months ago
Good we have to hurry this up.



Attachment Image
Score: 9 Votes (Like | Disagree)
lilkwarrior Avatar
15 months ago
What would be an even better collaboration would be Apple enabling Nvidia GPU options again—at least for the Mac Pro.

It would be AWESOME to be able to use Nvidia’s ray-tracing and tensor cores with my creative professional and AI problems with Titan-class/Prosumer/workstation GPUs (x90 and up) again without having to switch to my PC.

A Nvidia MPX GPU module as capable as a 5090 with no wires and Thunderbolt 5 support would be a nirvana-like outcome—especially if Microsoft, Apple, and/or Valve enables a way to dual boot to Windows on ARM and SteamOS.

While I love building a liquid-cooled PC, I and various prosumers would finally have a choice to stop buying PCs altogether
Score: 7 Votes (Like | Disagree)
Unregistered 4U Avatar
15 months ago

Since Apple now produces its own GPUs there is no need for hell to freeze over. Do you even remember the reason Apple and Nvidia parted ways? It was over Nvidia wanting complete access to macOS’s core. Apple said no way.
And, we’ve since had a REALLY good example (CrowdStrike) of why this would have been a baaaad idea.
Score: 4 Votes (Like | Disagree)