🔁 Hugging Face 转推了
Tri Dao @tri_dao
Crazy that we now have an open source model with 13B params that’s competitive w o1. And Mamba layers help bring much higher inference throughput