MarkTechPost@AI 2024年07月22日
Athene-Llama3-70B Released: An Open-Weight LLM Trained through RLHF based on Llama-3-70B-Instruct
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Nexusflow 发布了 Athene-Llama3-70B,这是一个基于 Meta AI 的 Llama-3-70B 微调的开源权重聊天模型。Athene-70B 在 Arena-Hard-Auto 得分中取得了 77.8% 的成绩,与 GPT-4o 和 Claude-3.5-Sonnet 等专有模型相媲美。这标志着其前身 Llama-3-70B-Instruct(得分 46.6%)的显著改进。这种提升源于 Nexusflow 的目标后训练管道,旨在改进特定模型行为。Athene-70B 目前正在 Chatbot Arena 上进行公开测试。

🚀 **目标后训练:** Nexusflow 开发了内部基准,评估 LLM 在指令遵循、编码、创意写作和多语言任务方面的能力。基于这些评估,为目标强化学习从人类反馈 (RLHF) 中精心策划了高质量的偏好数据。该管道导致与 Llama-3-70B-Instruct 相比性能大幅提升。这些增强涵盖了关键方面,例如精确的指令遵循、数学和推理、全面的编码辅助、灵感创意写作和多语言精通。

💡 **企业级 AI 解决方案:** Athene-70B 展示了 Nexusflow 通过目标后训练为特定企业需求定制模型的能力。Nexusflow 在 Starling-7B 和 NexusRaven-V2 方面的先前成功基础上,致力于将其实现的模型提升到满足企业级应用程序标准。该公司提供定制解决方案,帮助企业在 GenAI 副驾驶和代理技术方面脱颖而出。Nexusflow 邀请组织探索 Athene-70B 如何通过联系他们以获取更多信息和合作机会来增强其 AI 计划。

🌐 **开源权重:** Athene-Llama3-70B 是一款开源权重聊天模型,这意味着研究人员和开发人员可以自由地访问和使用其权重。这将促进基于 LLM 的应用程序的创新和发展。开源模型的可用性有助于降低进入门槛,并使更多人能够参与 AI 研究和开发。

Nexusflow has released Athene-Llama3-70B, an open-weight chat model fine-tuned from Meta AI’s Llama-3-70B. Athene-70B has achieved an Arena-Hard-Auto score of 77.8%, rivaling proprietary models like GPT-4o and Claude-3.5-Sonnet. This marks a significant improvement from its predecessor, Llama-3-70B-Instruct, which scored 46.6%. The enhancement stems from Nexusflow’s targeted post-training pipeline, designed to improve specific model behaviors. Athene-70B is currently undergoing public testing on Chatbot Arena.

To maximize Llama-3-70B’s potential, Nexusflow developed internal benchmarks evaluating LLM capabilities in instruction following, coding, creative writing, and multilingual tasks. Based on these evaluations, high-quality preference data was curated for targeted Reinforcement Learning from Human Feedback (RLHF). This pipeline resulted in substantial performance improvements compared to Llama-3-70B-Instruct. The enhancements span key aspects such as precise instruction following, math and reasoning, comprehensive coding assistance, inspired creative writing, and multilingual mastery.

Athene-70B demonstrates Nexusflow’s capability to customize models for specific enterprise requirements through targeted post-training. Building on previous successes with Starling-7B and NexusRaven-V2, Nexusflow aims to advance its models to meet enterprise-grade application standards. The company offers tailored solutions to help businesses excel in GenAI copilot and agent technologies. Nexusflow invites organizations to explore how Athene-70B can enhance their AI initiatives by contacting them for further information and collaboration opportunities.

Athene-Llama3-70B, an open-weights chat model developed by Nexusflow, demonstrates significant improvements over its predecessor. The model achieves competitive performance compared to proprietary models in the Arena-Hard-Auto benchmark. Nexusflow’s targeted post-training pipeline, utilizing internal benchmarks and Reinforcement Learning from Human Feedback, has enhanced the model’s capabilities across various domains, including instruction following, math and reasoning, coding, creative writing, and multilingual tasks. This advancement showcases Nexusflow’s ability to tailor models for enterprise needs, building on their previous successes. The company positions itself as a provider of customized enterprise-grade AI solutions, inviting organizations to explore the potential of Athene-70B for their AI initiatives.


Check out the Model Card. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter..

Don’t Forget to join our 46k+ ML SubReddit

Find Upcoming AI Webinars here

The post Athene-Llama3-70B Released: An Open-Weight LLM Trained through RLHF based on Llama-3-70B-Instruct appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Athene-Llama3-70B 开源权重 LLM 强化学习 企业级 AI
相关文章