🔁 Hugging Face 转推了
Gabriele Berton @gabriberton
Interesting how the original Pixel Shuffle was created to increase resolution (increase res, reduce channels) in order to go sub-pixel (image 1)
and now it's used the other way around (reduce res, increase channels), mostly to reduce number of tokens to feed to the VLM (image 2)
and now it's used the other way around (reduce res, increase channels), mostly to reduce number of tokens to feed to the VLM (image 2)

