MarkTechPost@AI 2024年10月27日
Cohere for AI Releases Aya Expanse (8B & 32B): A State-of-the-Art Multilingual Family of Models to Bridge the Language Gap in AI
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

语言技术虽发展迅速,但许多语言的代表性仍存在差距,NLP的进步多集中于英语等语言。Cohere for AI推出Aya Expanse,旨在通过提供开放权重模型,扩大语言覆盖和包容性,适用于多种自然语言任务,不同模型大小适应多种用例,在多语言理解基准上表现出色,有助于弥合语言差距。

🎯Aya Expanse是Cohere for AI推出的旨在弥合语言差距的开放权重模型系列,可被全球研究者和开发者访问和构建,能用于多种自然语言任务,如文本生成、翻译和总结。

💪Aya Expanse有多种尺寸,包括Aya Expanse-8B和Aya Expanse-32B,不同模型大小可适应从大规模应用到轻型部署的各种用例,且利用先进的transformer架构,能有效处理多语言场景。

🌟Aya Expanse-32B在多语言理解基准上表现显著,超过了Gemma 2 27B、Mistral 8x22B和Llama 3.1 70B等模型;Aya Expanse-8B在其参数类中也优于其他领先模型。

🎉Cohere for AI通过重新思考机器学习的核心构建块,为弥合语言差距做出了重要贡献,开放模型权重促进了一个包容性的研究和开发生态系统。

Despite rapid advancements in language technology, significant gaps in representation persist for many languages. Most progress in natural language processing (NLP) has focused on well-resourced languages like English, leaving many others underrepresented. This imbalance means that only a small portion of the world’s population can fully benefit from AI tools. The absence of robust language models for low-resource languages, coupled with unequal AI access, exacerbates disparities in education, information accessibility, and technological empowerment. Addressing these challenges requires a concerted effort to develop and deploy language models that serve all communities equitably.

Cohere for AI Introduces Aya Expanse: an open-weights state-of-art family of models to help close the language gap with AI. Aya Expanse is designed to expand language coverage and inclusivity in the AI landscape by providing open-weight models that can be accessed and built upon by researchers and developers worldwide. Available in multiple sizes, including Aya Expanse-8B and Aya Expanse-32B, these models are adaptable across a wide range of natural language tasks, such as text generation, translation, and summarization. The different model sizes offer flexibility for various use cases, from large-scale applications to lighter deployments. Aya Expanse utilizes advanced transformer architecture to capture linguistic nuances and semantic richness, and it is fine-tuned to handle multilingual scenarios effectively. The models leverage diverse datasets from low-resource languages like Swahili, Bengali, and Welsh to ensure equitable performance across linguistic contexts.

Aya Expanse plays a crucial role in bridging linguistic divides, ensuring underrepresented languages have the tools needed to benefit from AI advancements. The Aya Expanse-32B model, in particular, has demonstrated significant improvements in multilingual understanding benchmarks, outperforming models such as Gemma 2 27B, Mistral 8x22B, and Llama 3.1 70B—a model more than twice its size. In evaluations, Aya Expanse-32B achieved a 25% higher average accuracy across low-resource language benchmarks compared to other leading models. Similarly, Aya Expanse-8B outperforms leading models in its parameter class, including Gemma 2 9B, Llama 3.1 8B, and the recently released Ministral 8B, with win rates ranging from 60.4% to 70.6%. These results highlight Aya Expanse’s potential to support underserved communities and foster better language inclusivity.

The improvements in Aya Expanse stem from Cohere for AI’s sustained focus on expanding how AI serves languages around the world. By rethinking the core building blocks of machine learning breakthroughs, including data arbitrage, preference training for general performance and safety, and model merging, Cohere for AI has made a significant contribution to bridging the language gap. Making the model weights openly available encourages an inclusive ecosystem of researchers and developers, ensuring language modeling becomes a community-driven effort rather than one controlled by a few entities.

In conclusion, Aya Expanse represents a significant step towards democratizing AI and addressing the language gap in NLP. By providing powerful, multilingual language models with open weights, Cohere for AI advances language technology while promoting inclusivity and collaboration. Aya Expanse enables developers, educators, and innovators from diverse linguistic backgrounds to create applications that are accessible and beneficial to a broader population, ultimately contributing to a more connected and equitable world. This move aligns well with the core values of artificial intelligence—accessibility, inclusiveness, and innovation without borders.


Check out the Details, 8B Model and 32B Model. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 55k+ ML SubReddit.

[Upcoming Live Webinar- Oct 29, 2024] The Best Platform for Serving Fine-Tuned Models: Predibase Inference Engine (Promoted)

The post Cohere for AI Releases Aya Expanse (8B & 32B): A State-of-the-Art Multilingual Family of Models to Bridge the Language Gap in AI appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Aya Expanse Cohere for AI 语言差距 多语言模型
相关文章