Apple-Nvidia collaboration triples speed of AI model production

Author: news@appleinsider.com (Malcolm Owen)

Published: Dec, 19 2024 15:59

Training models for machine learning is a processor-intensive task. Apple's latest machine learning research could make creating models for Apple Intelligence faster, by coming up with a technique to almost triple the rate of generating tokens when using Nvidia GPUs.

One of the problems in creating large language models (LLMs) for tools and apps that offer AI-based functionality, such as Apple Intelligence, is inefficiencies in producing the LLMs in the first place. Training models for machine learning is a resource-intensive and slow process, which is often countered by buying more hardware and taking on increased energy costs.

Earlier in 2024, Apple published and open-sourced Recurrent Drafter, known as ReDrafter, a method of speculative decoding to improve performance in training. It used an RNN (Recurrent Neural Network) draft model combining beam search with dynamic tree attention for predicting and verifying draft tokens from multiple paths.

This sped up LLM token generation by up to 3.5 times per generation step versus typical auto-regressive token generation techniques. In a post to Apple's Machine Learning Research site, it explained that alongside existing work using Apple Silicon, it didn't stop there. The new report published on Wednesday detailed how the team applied the research in creating ReDrafter to make it production-ready for use with Nvidia GPUs.

Nvidia GPUs are often employed in servers used for LLM generation, but the high-performance hardware often comes at a hefty cost. It's not uncommon for multi-GPU servers to cost in excess of $250,000 apiece for the hardware alone, let alone any required infrastructure or other connected costs.

More for You

Nvidia brings Blackwell to your desk - Project DIGITS mini PC is more like a mini supercomputer

Nvidia brings Blackwell to your desk - Project DIGITS min... Nvidia brings Blackwell to your desk - Project DIGITS mini PC is more like a mini supercomputer Techradar

Nvidia is preparing for the post-GPU AI era as it is reportedly recruits ASIC engineers to fend off competition from Broadcom and Marvell

Nvidia is preparing for the post-GPU AI era as it is repo... Nvidia is preparing for the post-GPU AI era as it is reportedly recruits ASIC engineers to fend off competition from Broadcom and Marvell Techradar

Nvidia unveils GB200 NVL4 with two Grace CPUs and four Blackwell GPUs for modern data center workloads

Nvidia unveils GB200 NVL4 with two Grace CPUs and four Bl... Nvidia unveils GB200 NVL4 with two Grace CPUs and four Blackwell GPUs for modern data center workloads Techradar

Biggest Nvidia takeaways from Jensen Huang's CES 2025 keynote Biggest Nvidia takeaways from Jensen Huang's CES 2025 keynote The Independent

Microsoft backed a tiny hardware startup that just launched its first AI processor that does inference without GPU or expensive HBM memory and a key Nvidia partner is collaborating with it

Microsoft backed a tiny hardware startup that just launch... Microsoft backed a tiny hardware startup that just launched its first AI processor that does inference without GPU or expensive HBM memory and a key Nvidia partner is collaborating with it Techradar

Top Followed

Sky teases launch of new ‘smarter, brighter and better than ever’ mystery product coming in weeks

Sky teases launch of new ‘smarter, brighter and better th... Sky teases launch of new ‘smarter, brighter and better than ever’ mystery product coming in weeks The Sun

Everyone is making the same joke about forecast map showi... Everyone is making the same joke about forecast map showing odd-shaped winter storm Mail Online

Apple's bad blood with Nvidia continues, after decades of... Apple's bad blood with Nvidia continues, after decades of fighting Apple Insider

Why has Elon Musk changed his X name to Kekius Maximus? Reason behind meme-inspired crypto token Twitter handle

Why has Elon Musk changed his X name to Kekius Maximus? R... Why has Elon Musk changed his X name to Kekius Maximus? Reason behind meme-inspired crypto token Twitter handle The Sun

Scientist challenges 'out of Africa' theory with new orig... Scientist challenges 'out of Africa' theory with new origin for modern humans Mail Online

Apple-Nvidia collaboration triples speed of AI model production

Share:

Share:

More for You

Top Followed

Trending

You Might Also Like