MarkTechPost@AI 2024年11月25日
SemiKong: An Open Source Foundation Model for Semiconductor Manufacturing Process
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

SemiKong是半导体领域首个行业特定的LLM,旨在解决半导体行业中LLM的专业知识需求问题。它通过详细研究和特定训练,提升了半导体制造的性能,虽处于初始阶段但意义重大。

🦾SemiKong是半导体领域首个行业特定LLM,为定制专有模型奠定基础。

💻训练过程分预训练和微调两个阶段,使用多种数据和工具。

📈评估涉及多个标准,预训练结合微调可提升性能,较大模型表现更佳。

🎉SemiKong虽初始阶段但意义重大,为半导体领域研究提供基线。

Semiconductors are essential in powering various electronic devices and driving development across telecommunications, automotive, healthcare, renewable energy, and IoT industries. In semiconductor manufacturing and design, the two main phases, FEOL and BEOL, present unique challenges. LLMs are trained on vast amounts of text data using self-supervised learning techniques that can capture rich domain knowledge.LLMs can also help in tasks like design rule checking, layout generation, and space exploration in Integrated Circuit (IC) design. LLMs allow the generation of new designs that adhere to the specified constraints and optimize for desired performance metrics, learning from large IC layouts and design rule datasets. However, most models are general and do not possess specific knowledge within the semiconductor industry. This reflects unique problems, such as complex physics and chemistry for semiconductor devices and processes.

Currently, LLMs are general-purpose models that, despite their power, need more specialized knowledge for tasks specific to the semiconductor industry. Artificial Intelligence (AI) improved semiconductor manufacturing by improving mask optimization and hotspot detection through machine learning, deep reinforcement learning, and datasets like LithoBench. In the semiconductor industry, domain-specific large language models (LLMs) such as ChipGPT and ChatEDA outperformed general models in tasks like code generation, debugging, and chatbot assistance. LLMs also evaluated natural language generation tasks, using expert feedback to improve benchmarks and address challenges in complex domain-specific evaluations. 

To integrate the power of LLMs in the semiconductor industry, researchers from Aitomatic Inc., FPT Software AI Center, and Tokyo Electron Ltd conducted detailed research and proposed SemiKong, the first industry-specific LLM for the semiconductor domain that provides a foundation for developing customized proprietary models. SemiKong 1.0 focuses on building a foundational model with an expert-level understanding of etching problems. This approach involves training models with comprehensive domain-specific data. The training process was divided into two stages: pretraining and fine-tuning.

There are very few high-quality datasets for the semiconductor domain. To address this, a large-scale text-based dataset focused on semiconductor concepts and etching problems emerged, including pretraining data from technical books, papers, and patents, along with instruction data featuring 50,000 questions. Tools like GPT-4o-mini handled formatting, while GPT-4o generated and answered some questions. The SemiKong model was trained in three steps. First, it was pre-trained using Llama3 checkpoints to learn about the semiconductor industry. Then, it went through supervised fine-tuning to improve its ability to handle tasks like answering questions and reasoning. Finally, the model was fine-tuned with quantization to make it ready for real-world use, gaining deeper knowledge about semiconductor manufacturing along the way. The researchers used 8 NVIDIA A100 80GB GPUs for training for better performance and training speed.

The evaluation of the SemiKong model involved comparing its performance across several criteria, including Clarity and Directness (C&D), Practicality and Immediate Usability (PIU), Efficiency and Brevity (E&B), Logical Flow and Coherence (LFC), Expert-to-Expert Communication (EEC), and Use of Examples and Specificity (UES). Experiments showed that fine-tuning alone did not significantly improve performance, as domain-specific knowledge was crucial. When pretraining was combined with fine-tuning, performance improved. Larger models with 70B parameters outperformed smaller ones, with the SemiKong 70B model excelling in all criteria. 

In summary, the proposed method provided a robust solution for integrating LLM technology with the semiconductor industry and achieved great performance. It performed better than the open-source foundation model. However, SemiKong is in its initial phase, and significant work remains. This work of integrating the latest LLM technology in manufacturing can act as a baseline for future research in the domain of semiconductors and change it forever!


Check out the Paper and GitHub Page. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 55k+ ML SubReddit.

[FREE AI VIRTUAL CONFERENCE] SmallCon: Free Virtual GenAI Conference ft. Meta, Mistral, Salesforce, Harvey AI & more. Join us on Dec 11th for this free virtual event to learn what it takes to build big with small models from AI trailblazers like Meta, Mistral AI, Salesforce, Harvey AI, Upstage, Nubank, Nvidia, Hugging Face, and more.

The post SemiKong: An Open Source Foundation Model for Semiconductor Manufacturing Process appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

SemiKong 半导体制造 LLM技术 行业特定模型
相关文章