🔁 Hugging Face 转推了
DailyPapers @HuggingPapers
Hugging Face releases FineWeb2! 🥂
A new 20TB multilingual dataset, supporting 1000+ languages, with a data processing pipeline that can be automatically adapted to support any language.
A new 20TB multilingual dataset, supporting 1000+ languages, with a data processing pipeline that can be automatically adapted to support any language.
