MarkTechPost@AI 04月06日 04:00
Meta AI Just Released Llama 4 Scout and Llama 4 Maverick: The First Set of Llama 4 Models
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Meta AI 发布了新一代多模态模型 Llama 4,包括 Scout 和 Maverick 两个版本。 Scout 拥有 170 亿参数,擅长处理长文本,在多项基准测试中表现优异。Maverick 同样基于 170 亿参数,专注于视觉理解,在多模态推理任务中表现出色,并具有出色的性价比。这两款模型均受益于正在训练的更强大的 Llama 4 Behemoth 模型。Llama 4 的发布标志着 Meta AI 在多模态 AI 领域的技术进步。

🧠 Llama 4 Scout: 是一款拥有 170 亿参数的模型,拥有 16 个专家模块,上下文窗口可容纳 1000 万个 tokens,在处理长文档、复杂代码库和详细对话任务中表现出色,优于 Gemma 3、Gemini 2.0 Flash-Lite 和 Mistral 3.1 等模型。

👁️ Llama 4 Maverick: 同样基于 170 亿参数,拥有 128 个专家模块,专门增强视觉理解能力,能够精确对齐文本提示和视觉元素。在多模态推理任务中表现出色,超越了 GPT-4o 和 Gemini 2.0 Flash,并在推理和编码基准测试中与 DeepSeek v3 表现相当,但仅使用约一半的参数。

💰 Maverick 的效率: 在 LMArena 平台上,Maverick 的聊天优化版本获得了 1417 的 Elo 评级,表明其在对话和多模态环境中的计算效率和实用性。

💡 Llama 4 Behemoth 的影响: Scout 和 Maverick 的开发受益于 Meta 更强大的模型 Llama 4 Behemoth 的蒸馏技术。Behemoth 仍在积极训练中,初步结果显示其在 STEM 相关基准测试中,相对于 GPT-4.5、Claude Sonnet 3.7 和 Gemini 2.0 Pro 有显著优势。

Today, Meta AI announced the release of its latest generation multimodal models, Llama 4, featuring two variants: Llama 4 Scout and Llama 4 Maverick. These models represent significant technical advancements in multimodal AI, offering improved capabilities for both text and image understanding.

Llama 4 Scout is a 17-billion-active-parameter model structured with 16 expert modules. It introduces an extensive context window capable of accommodating up to 10 million tokens. This substantial context capacity enables the model to manage and interpret extensive textual content effectively, beneficial for long-form document processing, complex codebases, and detailed dialogue tasks. In comparative evaluations, Llama 4 Scout has demonstrated superior performance relative to contemporary models such as Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 across recognized benchmark datasets.

Parallel to Scout, Llama 4 Maverick, also built upon a 17-billion-active-parameter architecture, incorporates 128 expert modules explicitly designed to enhance visual grounding. This design facilitates precise alignment between textual prompts and associated visual elements, enabling targeted responses grounded accurately to specific image regions. Maverick exhibits robust performance in comparative assessments, surpassing GPT-4o and Gemini 2.0 Flash, particularly in multimodal reasoning tasks. Additionally, Maverick has achieved comparable outcomes to DeepSeek v3 on reasoning and coding benchmarks while employing approximately half the active parameters.

A key feature of Maverick is its noteworthy performance-to-cost efficiency. Benchmarking efforts, specifically on the LMArena platform, have recorded an Elo rating of 1417 for Maverick’s chat-optimized version, indicating its computational efficiency and practical applicability in conversational and multimodal contexts.

The development of Scout and Maverick draws heavily from distillation techniques derived from the ongoing training of Meta’s more powerful model, Llama 4 Behemoth. Behemoth, which remains under active training, has preliminarily shown significant advantages over established models such as GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro, particularly within STEM-focused benchmarks. The insights and advanced methodologies from Behemoth have been instrumental in refining Scout and Maverick’s technical capabilities.

With the introduction of Llama 4, Meta AI advances multimodal artificial intelligence through highly refined and technically sophisticated models capable of deep semantic understanding and precise multimodal alignment. This release further exemplifies Meta AI’s ongoing commitment to fostering innovation and maintaining open accessibility for researchers, developers, and enterprise applications.

Future progress in multimodal AI is anticipated with the finalization and public release of Llama 4 Behemoth. Initial results indicate Behemoth’s potential to set new standards within multimodal performance, particularly in STEM applications and computational reasoning tasks. Meta AI plans to disclose detailed technical specifications and performance metrics upon completion of the Behemoth model.

The announcement underscores Meta AI’s dedication to pushing the technical limits of multimodal modeling, supporting the evolution of practical and research-oriented AI applications across diverse sectors including scientific research, education, and complex conversational systems. As Meta AI continues this trajectory, the technological advancements embodied in Llama 4 Scout, Maverick, and eventually Behemoth are expected to facilitate substantial progress in the computational and practical capabilities of multimodal AI.


Check out the Benchmarks and Download Llama 4. All credit for this research goes to the researchers of this project. Also, feel free to follow us on Twitter and don’t forget to join our 85k+ ML SubReddit.

[Register Now] miniCON Virtual Conference on OPEN SOURCE AI: FREE REGISTRATION + Certificate of Attendance + 3 Hour Short Event (April 12, 9 am- 12 pm PST) + Hands on Workshop [Sponsored]

The post Meta AI Just Released Llama 4 Scout and Llama 4 Maverick: The First Set of Llama 4 Models appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Meta AI Llama 4 多模态 AI 人工智能
相关文章