On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

On-device AI models have stayed small because the entire weight set has to live in DRAM, capping practical parameter counts well below what server-side deployments use. Enterprise architects evaluating agentic workloads have had to choose between capable cloud-dependent models and limited on-device ones. Apple's third-generation foundation models, announced at WWDC26, break that constraint by movi

Source

VentureBeat

Read full article at VentureBeat

Opens original article in a new tab

Can tech companies learn to love cheaper AI models?

If those same AI workloads can be handled by cheaper models without affecting quality, it would mean a massive shift in the economics of AI.

about 2 hours agoRead more →

Ars Technica

NASA assigns crew for Artemis III, sets aggressive timeline for flying it

"Artemis III will be an extraordinary demonstration of what is possible."

about 2 hours agoRead more →

ZDNet

Amazon just slashed SSD prices for Prime Day - these are the 5 best deals I've found

I track SSD deals, and found huge markdowns from top brands like WD, Samsung, and more ahead of Amazon Prime Day.

about 2 hours agoRead more →

TechCrunch

WWDC 2026: Everything announced on Siri AI, iOS 27, Apple Intelligence, and more

Apple primarily made the case for an improved experience with its longstanding Siri assistant, which like most other announcements had a hefty helping of AI.

about 3 hours agoRead more →

Related Tech Stories

Can tech companies learn to love cheaper AI models?

NASA assigns crew for Artemis III, sets aggressive timeline for flying it

Amazon just slashed SSD prices for Prime Day - these are the 5 best deals I've found

WWDC 2026: Everything announced on Siri AI, iOS 27, Apple Intelligence, and more