AI Engineer YouTube · May 29, 2026

Reachy Mini: the $300 open source robot you can actually hack — Andres Marafioti, Hugging Face

Reachy Mini: the $300 open source robot you can actually hack — Andres Marafioti, Hugging Face video thumbnail
Why it matters

Qwen3-TTS shipped at 0.8x real time: one second of audio took 1.2 seconds to generate. Andres Marafioti from Hugging Face spent two weeks fixing it. The culprits were no streaming, 500 autoregressive steps per audio packet with a CPU GPU round trip on each, and a dynamic KV cache that blocked compilation. Static KV cac

My takeaway: Qwen3-TTS shipped at 0.8x real time: one second of audio took 1.2 seconds to generate. Andres Marafioti from Hugging Face spent two weeks fixing it. The culprits were no streaming, 500 autoregressive steps per audio packet with a CPU GPU round trip on each, and a dynamic KV cache that blocked compilation. Static KV cac