训练策略_Fishai

热点

"训练策略" 相关文章

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

cs.AI updates on arXiv.org 2025-07-25T04:28:49.000000Z

In-Depth and In-Breadth: Pre-training Multimodal Language Models Customized for Comprehensive Chart Understanding

cs.AI updates on arXiv.org 2025-07-22T04:44:28.000000Z

Entropy Loss: An Interpretability Amplifier of 3D Object Detection Network for Intelligent Driving

cs.AI updates on arXiv.org 2025-07-21T04:06:48.000000Z

Sub-Scaling Laws: On the Role of Data Density and Training Strategies in LLMs

cs.AI updates on arXiv.org 2025-07-16T04:28:49.000000Z

Learning Diffusion Models with Flexible Representation Guidance

cs.AI updates on arXiv.org 2025-07-15T04:24:31.000000Z

Pre-Training LLMs on a budget: A comparison of three optimizers

cs.AI updates on arXiv.org 2025-07-14T04:08:38.000000Z

M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning

cs.AI updates on arXiv.org 2025-07-14T04:08:15.000000Z

阿里国际Ovis2系列模型开源：多模态大语言模型的新突破

阿里技术 2025-04-09T10:06:09.000000Z

LLM逻辑推演策略选择：推理时计算 vs 训练时计算

OneFlow 2025-04-09T10:05:57.000000Z

波士顿动力Atlas机器人超进化：会跳托马斯动作完美复刻人类

快科技资讯 2025-03-20T05:16:41.000000Z

When is it Better to Train on the Alignment Proxy?

少点错误 2025-03-11T18:13:27.000000Z

阿里国际Ovis2系列模型开源：多模态大语言模型的新突破

阿里技术 2025-03-04T04:25:03.000000Z

AWS DeepRacer: Closing time at AWS re:Invent 2024 –How did that physical racing go?

AWS Machine Learning Blog 2025-02-27T17:35:14.000000Z

苹果自动驾驶新进展：36块钱训练百万公里数据，10天跑完16亿公里

36氪 - 科技频道 2025-02-24T12:28:20.000000Z

DeepSeek团队新作：把代码变成思维链，大模型推理各种能力全面提升

智源社区 2025-02-18T08:07:27.000000Z

完整解读：从DeepSeek Janus到Janus-Pro！

智源社区 2025-01-31T17:07:12.000000Z

AI 造梦师：香港大学携手快手科技推出 GameFactory 框架，突破游戏场景泛化难题

IT之家 2025-01-19T23:29:12.000000Z

VITA-1.5: 迈向GPT-4o级实时视频-语音交互

我爱计算机视觉 2025-01-12T13:24:03.000000Z

Meta斯坦福全新多模态Apollo，60分钟视频轻松理解！7B性能超越30B

新智元 2024-12-20T07:01:14.000000Z

无编码器多模态大模型EVE：原生多模态新方案

智源研究院 2024-10-24T17:00:57.000000Z

Copyright © 2019 FISHAI.All Rights Reserved