The AI Podcast (NVIDIA)

Podcast Active

Total Items 1

Priority 4 / 5

Last Checked Never

Health Healthy

https://blogs.nvidia.com/ai-podcast/

Associated Themes

▲ Rising 0 mentions 7.5

◆ Emerging 0 mentions 5.5

Recent Items

On-Device AI: Running Llama on a Phone

We have quantised Llama 3 down to 4-bit precision and it runs at 30 tokens per second on a flagship Android device. The quality loss is surprisingly s...

09 Mar 2026 71% Medium

The AI Podcast (NVIDIA)

Associated Themes

AI-Driven Cost Reduction

Edge AI & On-Device Inference

Recent Items

On-Device AI: Running Llama on a Phone