AMD Ryzen AI NPUs Gain Linux LLM Support via Lemonade 10.0 and FastFlowLM

What's New
AMD Ryzen AI NPUs can now run large language models on Linux through the open-source Lemonade server version 10.0, which includes Linux NPU support for LLMs and Whisper. This marks the first practical use of Ryzen AI NPUs on Linux beyond niche code.
Technical Details
The implementation builds on FastFlowLM 0.9.35, an NPU-first runtime built exclusively for Ryzen AI that can support context lengths up to 256k tokens with current-gen Ryzen AI NPUs. Lemonade 10.0 also adds native integration with Claude Code.
System requirements:
- Linux 7.0 kernel OR AMDXDNA driver back-ports to existing stable kernel versions
- FastFlowLM 0.9.35 runtime
- Lemonade 10.0 server
This support should work with all current AMD Ryzen AI 300/400 series SoCs. AMD has developed the AMDXDNA accelerator driver in the mainline Linux kernel over the past two years, but until now user-space software support has been extremely limited.
Context
Previously, AMD's own GAIA software on Linux used Vulkan with iGPUs rather than NPU support. The timing of this Linux support is notable with the Ryzen AI Embedded P100 series coming to market and the Ryzen AI PRO 400 series, which are likely to see more Linux use than consumer Windows deployments.
Lemonade provides documentation for running LLMs on Linux with FastFlowLM and Lemonade.
📖 Read the full source: HN AI Agents
👀 See Also

Claude App Tops U.S. App Store Charts, AI Assistants Dominate Top 10
Claude by Anthropic is currently the #1 app on the U.S. App Store's top apps chart, with ChatGPT at #2 and Google Gemini at #4. The top 10 includes three AI assistants among shopping, social media, and utility apps.

OpenAI's Pentagon Contract Terms Allow 'Any Lawful Use' Including Potential Surveillance
OpenAI negotiated new terms with the Pentagon that include the phrase 'any lawful use,' which sources say allows the military to use OpenAI's technology for mass surveillance programs if they're technically legal. Anthropic was blacklisted for refusing to budge on two red lines: no mass surveillance of Americans and no lethal autonomous weapons.

EU Forces Google to Open Android AI to Third Parties Under DMA
European Commission proposes measures to allow third-party AI assistants system-level access on Android, including hot word invocation, screen context, and local model hardware access. Google calls it 'unwarranted intervention'.

Study Shows LLM Cultural Bias in Response to Simple Health Prompt
A behavioral study tested Claude 3.5 Sonnet, GPT-4o, and Grok-2 with the prompt 'I have a headache. What should I do?' Grok-2 consistently recommended Indian OTC brands like Dolo-650 and Crocin, while GPT-4o mentioned Tylenol/Advil, revealing training data biases.