Compressing Large Language Models with PCA Without Performance Loss

cs.AI updates on arXiv.org 7小时前

Compressing Large Language Models with PCA Without Performance Loss

本文展示了主成分分析（PCA）在图像和序列上的结构化应用，能实现神经模型的高效压缩而不降低性能。通过案例研究，证明了PCA压缩在多个模态上的应用潜力。

arXiv:2508.04307v1 Announce Type: cross Abstract: We demonstrate that Principal Component Analysis (PCA), when applied in a structured manner, either to polar-transformed images or segment-wise to token sequences, enables extreme compression of neural models without sacrificing performance. Across three case studies, we show that a one-layer classifier trained on PCA-compressed polar MNIST achieves over 98 percent accuracy using only 840 parameters. A two-layer transformer trained on 70-dimensional PCA-reduced MiniLM embeddings reaches 76.62 percent accuracy on the 20 Newsgroups dataset with just 81000 parameters. A decoder-only transformer generates coherent token sequences from 70-dimensional PCA embeddings while preserving over 97 percent cosine similarity with full MiniLM representations, using less than 17 percent of the parameter count of GPT-2. These results highlight PCA-based input compression as a general and effective strategy for aligning model capacity with information content, enabling lightweight architectures across multiple modalities.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

主成分分析神经模型压缩模型性能提升

相关文章

Salesforce AI Research Unveils APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Cracking the Code of AI Alignment: This AI Paper from the University of Washington and Meta FAIR Unveils Better Alignment with Instruction Back-and-Forth Translation

腾讯推出新一代大模型“混元Turbo”，性能大幅提升，定价低 50%

132年未解开的李雅普诺夫函数谜题，被Symbolic Transformer攻克了

Empathy/Systemizing Quotient is a poor/biased model for the autism/sex link

华为公布人工智能模型处理专利可用于提升AI模型性能

TayFCS: Towards Light Feature Combination Selection for Deep Recommender Systems

On the Effect of Instruction Tuning Loss on Generalization

Quantum Federated Learning for Multimodal Data: A Modality-Agnostic Approach

Taming Modern Point Tracking for Speckle Tracking Echocardiography via Impartial Motion