MarkTechPost@AI 2024年09月06日
Yi-Coder Released by 01.AI: A Powerful Small-Scale Code LLM Series, Delivering Exceptional Performance in Code Generation, Editing, and Long-Context Comprehension
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Yi-Coder是01.AI发布的一系列开源语言模型,用于高效强大的编码性能,虽参数规模相对较小,但在多方面表现出色。

💻Yi-Coder有多种型号,如Yi-Coder-9B-Chat、Yi-Coder-9B、Yi-Coder-1.5B-Chat、Yi-Coder-1.5B,它们在不同的编码任务中表现优异,如文本生成、代码生成与理解、代码编辑等。

🚀Yi-Coder-9B在训练上具有优势,使用了大量高质量令牌,能在多种编程环境中提供无与伦比的能力,且在处理长上下文建模方面表现出色。

🎉Yi-Coder在多项评估中表现卓越,如在LiveCodeBench上的通过率较高,在HumanEval、MBPP、CRUXEval-O等标准代码生成基准测试中成绩优异,在代码编辑、跨文件代码完成、长上下文理解和数学推理等方面也有出色表现。

The landscape of large language models (LLMs) for coding has been enriched with the release of Yi-Coder by 01.AI, a series of open-source models designed for efficient and powerful coding performance. Despite its relatively small size, Yi-Coder delivers state-of-the-art results, positioning itself as a formidable code generation and completion player. Available in two configurations, 1.5 billion and 9 billion parameters, Yi-Coder proves that bigger isn’t always better, offering an impressive range of capabilities tailored for developers seeking high-performance models with minimal resource overhead. The four variants open-sourced on Hugging Face till now are:

    Yi-Coder-9B-Chat: This model is designed for text generation, focusing on code-related tasks offering interactive and conversational capabilities. It delivers state-of-the-art performance in competitive programming and long-context code generation and was recently updated to enhance its efficiency.Yi-Coder-9B: The larger base model in the series, Yi-Coder-9B, offers powerful code generation and comprehension across 52 programming languages. Updated to optimize its long-context processing further, it excels at precisely handling complex, project-level tasks.Yi-Coder-1.5B-Chat: A smaller, lightweight model designed for chat-based coding tasks, Yi-Coder-1.5B-Chat delivers impressive results in code editing and interactive code completion. The recent update focuses on improving its real-time performance and accuracy in conversational coding applications.Yi-Coder-1.5B: This base model offers an efficient solution for developers needing fast code generation with fewer computational resources. The recent update enhances its ability to tackle basic programming tasks, making it a highly versatile tool for developers with limited hardware.

Yi-Coder-9B, the larger of the two models, stands out due to its advanced training. It builds upon Yi-9B with an additional 2.4 trillion high-quality tokens sourced from a comprehensive repository-level code corpus on GitHub and code-related data filtered from CommonCrawl. These tokens cover 52 major programming languages, enabling Yi-Coder to offer unmatched proficiency across various coding environments. The ability to handle long-context modeling with a maximum context window of 128K tokens makes Yi-Coder ideal for handling complex, project-level code generation and comprehension tasks.

One of Yi-Coder’s most impressive aspects is its competitive performance, particularly with the Yi-Coder-9B-Chat model. In rigorous evaluations, Yi-Coder-9B-Chat achieved a 23.4% pass rate on LiveCodeBench, a platform designed to benchmark LLMs using real-time competitive programming problems sourced from LeetCode, AtCoder, and CodeForces. Notably, Yi-Coder’s performance surpassed much larger models, including DeepSeek-Coder-33B-Instruct and CodeGeex4-All-9B, making it the only model under 10 billion parameters to break the 20% threshold.

In addition to its competitive programming strengths, Yi-Coder excelled in standard code generation benchmarks such as HumanEval, MBPP, and CRUXEval-O. With an 85.4% pass rate on HumanEval and a 73.8% pass rate on MBPP, Yi-Coder-9B-Chat outperformed many of its peers, showcasing its ability to handle basic and complex coding tasks. It also became the first open-source LLM to surpass 50% accuracy on CRUXEval-O, further cementing its status as a high-performing model in the coding community.

Yi-Coder is not limited to code generation; it also excels in code editing tasks. Using CodeEditorBench, a benchmark designed to evaluate a model’s ability to perform debugging, translation, language switching, and code polishing, Yi-Coder consistently outperformed its competitors. The model demonstrated impressive win rates against other open-source models, particularly debugging and code translation. This makes Yi-Coder attractive for developers looking to streamline their code refinement processes.

Another critical area where Yi-Coder shines is cross-file code completion, a key requirement for modern Integrated Development Environments (IDEs). On the CrossCodeEval benchmark, which tests models’ ability to understand and complete code with cross-file dependencies, Yi-Coder outperformed similarly sized models in both retrieval and non-retrieval contexts. This result can be attributed to its extensive training on repository-level code corpora, allowing it to capture long-term dependencies and efficiently complete code tasks that span multiple files.

Long-context comprehension is one of Yi-Coder’s most unique strengths. In a synthetic task called “Needle in the code,” Yi-Coder demonstrated its ability to handle sequences as long as 128K tokens, twice the length used in comparable evaluations like those of CodeQwen1.5. The model flawlessly completed this task, demonstrating its proficiency in extracting key information from extensive codebases, a crucial skill for developers working on large-scale projects.

In addition to its coding capabilities, Yi-Coder has shown promise in mathematical reasoning. By leveraging program-aided language models (PAL), Yi-Coder-9B achieved an average accuracy of 70.3% across seven mathematical reasoning benchmarks, surpassing the performance of the larger DeepSeek-Coder-33B. This demonstrates that strong coding abilities can translate into other domains, such as solving complex mathematical problems.

In conclusion, Yi-Coder’s release marks an important step forward in the evolution of code-focused LLMs. Despite its relatively small parameter count, the model offers a competitive edge over larger alternatives, excelling in long-context comprehension, mathematical reasoning, and code editing. Its availability in base and chat versions provides flexibility for users seeking efficient inference and training options. By open-sourcing Yi-Coder, 01.AI has significantly contributed to the development community. The model’s remarkable performance across various coding tasks and its efficient architecture positions Yi-Coder as a powerful tool for developers looking to push the boundaries of what small LLMs can achieve in software development.


Check out the Details and Model Series. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and LinkedIn. Join our Telegram Channel. If you like our work, you will love our newsletter..

Don’t Forget to join our 50k+ ML SubReddit

The post Yi-Coder Released by 01.AI: A Powerful Small-Scale Code LLM Series, Delivering Exceptional Performance in Code Generation, Editing, and Long-Context Comprehension appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Yi-Coder 代码LLM 编码性能 模型优势 高效工具
相关文章