热点
关于我们
xx
xx
"
训练策略
" 相关文章
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
cs.AI updates on arXiv.org
2025-07-25T04:28:49.000000Z
In-Depth and In-Breadth: Pre-training Multimodal Language Models Customized for Comprehensive Chart Understanding
cs.AI updates on arXiv.org
2025-07-22T04:44:28.000000Z
Entropy Loss: An Interpretability Amplifier of 3D Object Detection Network for Intelligent Driving
cs.AI updates on arXiv.org
2025-07-21T04:06:48.000000Z
Sub-Scaling Laws: On the Role of Data Density and Training Strategies in LLMs
cs.AI updates on arXiv.org
2025-07-16T04:28:49.000000Z
Learning Diffusion Models with Flexible Representation Guidance
cs.AI updates on arXiv.org
2025-07-15T04:24:31.000000Z
Pre-Training LLMs on a budget: A comparison of three optimizers
cs.AI updates on arXiv.org
2025-07-14T04:08:38.000000Z
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
cs.AI updates on arXiv.org
2025-07-14T04:08:15.000000Z
阿里国际Ovis2系列模型开源:多模态大语言模型的新突破
阿里技术
2025-04-09T10:06:09.000000Z
LLM逻辑推演策略选择:推理时计算 vs 训练时计算
OneFlow
2025-04-09T10:05:57.000000Z
波士顿动力Atlas机器人超进化:会跳托马斯 动作完美复刻人类
快科技资讯
2025-03-20T05:16:41.000000Z
When is it Better to Train on the Alignment Proxy?
少点错误
2025-03-11T18:13:27.000000Z
阿里国际Ovis2系列模型开源:多模态大语言模型的新突破
阿里技术
2025-03-04T04:25:03.000000Z
AWS DeepRacer: Closing time at AWS re:Invent 2024 –How did that physical racing go?
AWS Machine Learning Blog
2025-02-27T17:35:14.000000Z
苹果自动驾驶新进展:36块钱训练百万公里数据,10天跑完16亿公里
36氪 - 科技频道
2025-02-24T12:28:20.000000Z
DeepSeek团队新作:把代码变成思维链,大模型推理各种能力全面提升
智源社区
2025-02-18T08:07:27.000000Z
完整解读:从DeepSeek Janus到Janus-Pro!
智源社区
2025-01-31T17:07:12.000000Z
AI 造梦师:香港大学携手快手科技推出 GameFactory 框架,突破游戏场景泛化难题
IT之家
2025-01-19T23:29:12.000000Z
VITA-1.5: 迈向GPT-4o级实时视频-语音交互
我爱计算机视觉
2025-01-12T13:24:03.000000Z
Meta斯坦福全新多模态Apollo,60分钟视频轻松理解!7B性能超越30B
新智元
2024-12-20T07:01:14.000000Z
无编码器多模态大模型EVE:原生多模态新方案
智源研究院
2024-10-24T17:00:57.000000Z