MarkTechPost@AI 2024年08月13日
Researchers at FPT Software AI Center Introduce XMainframe: A State-of-the-Art Large Language Model (LLM) Specialized for Mainframe Modernization to Address the $100B Legacy Code Modernization
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

FPT 软件 AI 中心研究人员开发了 XMainframe,这是一个专门为主机遗留系统和 COBOL 代码库设计的最新大型语言模型 (LLM)。该模型旨在解决主机现代化中遇到的挑战,包括缺乏针对主机语言的训练数据、缺乏适当的基准以及代码生成以外的复杂性。XMainframe 在多个基准测试中优于现有的最先进的 LLM,例如多项选择题、问答和 COBOL 代码摘要。

🤔 XMainframe 旨在解决主机现代化中遇到的挑战,例如缺乏针对主机语言的训练数据、缺乏适当的基准以及代码生成以外的复杂性。

🚀 XMainframe 通过创建广泛的数据收集管道来解决这些挑战,该管道可以生成高质量的训练数据集,从而显着提高 XMainframe 在此专业领域的表现。

📈 XMainframe 在多个基准测试中优于现有的最先进的 LLM,例如多项选择题、问答和 COBOL 代码摘要。

📊 在多项选择题方面,XMainframe 的准确率比 DeepSeek-Coder 高 30%,在问答方面,XMainframe 的 BLEU 分数是 Mixtral-Instruct 8x7B 的两倍,在 COBOL 摘要方面,XMainframe 的得分是 GPT-3.5 的六倍。

💡 这些结果突出了 XMainframe 在管理和现代化遗留系统方面取得重大进展的潜力,最终将提高软件开发人员的生产力和节省时间。

Introduction

Mainframe operating systems, originating in the 1940s, remain essential to critical sectors such as finance and government. However, the vast legacy of COBOL code—estimated by IBM to be around 200 to 220 billion lines—needs to be migrated to modern platforms and rewritten in contemporary programming languages. This task is monumental, with the cost of rewriting COBOL code using human resources estimated at 32 to 50 cents per line, presenting a $100 billion challenge. The time required for a complete rewrite by human programmers is still uncertain. These systems are often perceived as outdated, requiring significant maintenance and modernization. Addressing this challenge demands innovative tools capable of understanding and interacting with legacy codebases, a long-standing obstacle for the industry. The advent of Large Language Models (LLMs) offers a potential solution to this enduring problem. However, there are several concerns when applying LLMs to mainframe modernization.

Challenges in Using LLMs for Mainframe Modernization:

1. Limited Training on Mainframe Languages: While existing LLMs are trained on a wide range of languages, both natural and programming, they lack sufficient training on languages used in mainframes, such as COBOL. The relatively small amount of COBOL code available online leads to inadequate understanding and reasoning in these models.. Additionally, organizations tend to keep their mainframe codebases private due to the high security demands of financial-critical sectors, further limiting the available training data.

2. Lack of Proper Benchmarks: The absence of comprehensive documentation and clear business goals for mainframe systems makes it difficult to develop benchmarks to evaluate the quality of LLMs in this domain. This hinders the ability to measure their effectiveness and reliability in mainframe modernization tasks.

3. Complexity Beyond Code Generation: LLMs for coding are primarily trained for code generation, the most common use case in software engineering tasks. However, mainframe modernization involves more than just generating COBOL code—organizations aim to migrate their systems to other languages. Thus, LLMs must possess knowledge beyond code generation to effectively modernize these systems.

XMainframe

To address these challenges, researchers at FPT Software AI Center have developed XMainframe, a state-of-the-art large language model (LLM) specifically designed with expertise in mainframe legacy systems and COBOL codebases. The solution includes the creation of an extensive data collection pipeline to produce high-quality training datasets, significantly enhancing XMainframe’s performance in this specialized domain. Additionally, they introduce MainframeBench, a comprehensive benchmark for evaluating mainframe knowledge through multiple-choice questions, question answering, and COBOL code summarization. Empirical evaluations show that XMainframe consistently outperforms existing state-of-the-art LLMs in these tasks, achieving 30% higher accuracy than DeepSeek-Coder on multiple-choice questions, doubling the BLEU score of Mixtral-Instruct 8x7B on question-answering, and scoring six times higher than GPT-3.5 on COBOL summarization. This work underscores XMainframe’s potential to drive significant advancements in managing and modernizing legacy systems, ultimately enhancing productivity and saving time for software developers.

Illustration of steps to collect data to build Mainframe:

Results on MCQ:

Results on Q&A

Results on Code Summarization:


Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter..

Don’t Forget to join our 48k+ ML SubReddit

Find Upcoming AI Webinars here


Thanks to FPT Software AI Center for the thought leadership/ Resources for this article. FPT Software AI Center has supported us in this content/article.

The post Researchers at FPT Software AI Center Introduce XMainframe: A State-of-the-Art Large Language Model (LLM) Specialized for Mainframe Modernization to Address the $100B Legacy Code Modernization appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

XMainframe 大型语言模型 主机现代化 COBOL 遗留代码
相关文章