🔁 Hugging Face 转推了
Vaibhav (VB) Srivastav @reach_vb
Kyutai released their Streaming Text to Speech model, ~2B param model, ultra low latency (220ms), CC-BY-4.0 license 🔥
Trained on 2.5 Million Hours of audio, it can serve up to 32 users w/ less than 350ms latency on a SINGLE L40 🤯
Incredible release by kyutai folks, go check
Trained on 2.5 Million Hours of audio, it can serve up to 32 users w/ less than 350ms latency on a SINGLE L40 🤯
Incredible release by kyutai folks, go check