Multimodal LLM Integrated Semantic Communications for 6G Immersive Experiences

cs.AI updates on arXiv.org 07月08日 12:33

Multimodal LLM Integrated Semantic Communications for 6G Immersive Experiences

本文介绍了一种新型多模态大语言模型（MLLM）集成语义通信框架MLLM-SC，旨在解决6G网络在资源受限的无线通信系统中的挑战，并验证了其在AR/VR应用和图像生成中的有效性。

arXiv:2507.04621v1 Announce Type: cross Abstract: 6G networks promise revolutionary immersive communication experiences including augmented reality (AR), virtual reality (VR), and holographic communications. These applications demand high-dimensional multimodal data transmission and intelligent data processing in real-time, which is extremely challenging over resource-limited wireless communication systems. Moreover, a joint understanding of the environment, context, and user intent is essential to deliver task-relevant content effectively. This article presents a novel multimodal large language model (MLLM) integrated semantic communications framework, termed MLLM-SC, which fully leverages reasoning and generative capabilities of pre-trained foundation models for context-aware and task-oriented wireless communication. The MLLM-SC framework adopts a device-edge collaborative architecture. At the edge, MLLM-empowered semantic guidance module analyzes multimodal inputs, user intents, and channel conditions to generate importance-aware attention maps prioritizing semantically critical information. An importance-aware semantic encoder and a resource-adaptive semantic decoder are jointly designed and optimized, which can utilize the semantic guidance for adaptive bandwidth allocation and high-quality content reconstruction or generation. Extensive case studies on visual question answering for AR/VR applications and diffusion-driven image generation validate the effectiveness of MLLM-SC.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

6G网络语义通信 MLLM

相关文章

LeCun谢赛宁首发全新视觉多模态模型，等效1000张A100干翻GPT-4V

抢占 6G 制高点，我国发布国际首个 6G 外场试验网突破性成果

腾讯SEED-Story：生成丰富、叙事连贯及风格一致图文故事的大模型

大胆预测一下10年后，也就是2034年的中国，可能是这样的：1、大街上全是无人驾驶汽车。2、大街小巷，无人快递无人飞机穿梭不停。3、6G网络全面覆盖，7G技术开始...

From Diagrams to Solutions: MAVIS’s Three-Stage Framework for Mathematical AI

【财联社早知道】中国重大突破！3GPP启动首个6G标准项目，这家华为鸿蒙合作商展开了6G的研究工作；这家公司参与鸿蒙生态应用开发，并与华为钱包合作提供一站式交...

【6G网络产业化正式开启，机构扎堆看好这些概念股】9月14日讯，数据宝统计，截至目前，今年以来，6G概念股中，共34股迎来机构调研。7只概念股获机构调研5次以上...

938 Gbps！初探 6G 网络速度极限：比 5G 快 9000 倍，130GB 的《黑神话：悟空》游戏下载仅需 1.1 秒

科技昨夜今晨 1019：6G 实验速度极限可达 5G 的 9000 倍；京东声明后续没有相关脱口秀演员合作计划；比亚迪海豹 06 GT 车型上市...

Waymo 利用谷歌 Gemini 大语言模型，开发端到端自动驾驶模型