Latest in AI

Showing:speech-to-speechDevelopersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Reachy Mini goes fully local
Hugging Face Blog62 days agoHardware
Hugging Face published a tutorial for running Reachy Mini conversations without cloud audio processing or API keys. The setup uses its speech-to-speech library as a cascaded VAD, STT, LLM, and TTS pipeline exposed through a Realtime API-compatible WebSocket. Recommended defaults include llama.cpp with Gemma 4, Silero VAD, Parakeet-TDT, and Qwen3-TTS, while allowing swaps to vLLM, MLX, Transformers, or hosted Responses API providers.
在 Hugging Face 上部署語音對語音 (Speech-to-Speech) 模型★ 75
Hugging Face Blog644 days agoTutorial
As real-time voice interaction technologies like GPT-4o become more widespread, the open-source community is also actively developing speech-to-speech (S2S)…