🔁 Hugging Face 转推了
Tiezhen WANG @Xianbao_QIAN
Step 3 has just been released. It proposed a new infra level optimization of Attention, FFN disaggregation.
Model & Infra co-design is the way forward!
Model: https://t.co/XHm5oHAUYZ
Technical paper: https://t.co/2SoNORXOtW
Model & Infra co-design is the way forward!
Model: https://t.co/XHm5oHAUYZ
Technical paper: https://t.co/2SoNORXOtW
