The DropApril 2, 2026via Hacker News

Lemonade by AMD: a fast and open source local LLM server using GPU and NPU

Why it matters

AMD is making a strategic play for the local AI infrastructure market with hardware-optimized tooling, potentially challenging NVIDIA's dominance in AI deployment solutions.

Key signals

  • 290 points on Hacker News
  • 73 comments indicating developer interest
  • Uses both GPU and NPU processing
  • Open source local LLM server

The hook

AMD just dropped Lemonade: open source LLM server that uses both GPU and NPU for local AI deployment.

Article URL: https://lemonade-server.ai Comments URL: https://news.ycombinator.com/item?id=47612724 Points: 290 # Comments: 73
Relevance score:75/100

Get stories like this every Friday.

The 5 AI stories that matter — free, in your inbox.

Free forever. No spam.

Lemonade by AMD: a fast and open source local LLM server using GPU and NPU | KeyNews.AI