热点
"语言模型" 相关文章
GPT-2:让语言模型一统多任务学习江湖
掘金 人工智能 2025-08-02T09:55:08.000000Z
Nat. Methods | SWING:一种通用于肽和蛋白质互作的滑动窗口交互语言模型
智源社区 2025-08-02T07:54:00.000000Z
AI拿下奥数IMO金牌,但数学界的AlphaGo时刻还没来
钛媒体:引领未来商业与生活新知 2025-08-01T08:51:28.000000Z
A Language Model-Driven Semi-Supervised Ensemble Framework for Illicit Market Detection Across Deep/Dark Web and Social Platforms
cs.AI updates on arXiv.org 2025-08-01T04:08:20.000000Z
Can LLM-Reasoning Models Replace Classical Planning? A Benchmark Study
cs.AI updates on arXiv.org 2025-08-01T04:08:18.000000Z
A Unified Perception-Language-Action Framework for Adaptive Autonomous Driving
cs.AI updates on arXiv.org 2025-08-01T04:08:15.000000Z
Causal Reasoning in Pieces: Modular In-Context Learning for Causal Discovery
cs.AI updates on arXiv.org 2025-08-01T04:08:14.000000Z
Forcing Language Models to Be ‘Friendly’ Makes Them More Inaccurate and Unsafe
Unite.AI 2025-07-31T20:35:43.000000Z
Simulating large systems with Regression Language Models
智源社区 2025-07-31T17:53:07.000000Z
字节跳动Seed团队发布扩散语言模型,每秒推理速度2146 tokens
36氪 2025-07-31T14:55:18.000000Z
字节跳动 Seed 团队发布扩散语言模型 Diffusion Preview,每秒推理速度 2146 tokens
IT之家 2025-07-31T13:15:06.000000Z
字节跳动Seed团队发布扩散语言模型,每秒推理速度2146 tokens
界面快报 2025-07-31T12:56:56.000000Z
Reducing Hallucinations in Summarization via Reinforcement Learning with Entity Hallucination Index
cs.AI updates on arXiv.org 2025-07-31T04:48:16.000000Z
AI-generated stories favour stability over change: homogeneity and cultural stereotyping in narratives generated by gpt-4o-mini
cs.AI updates on arXiv.org 2025-07-31T04:48:07.000000Z
SPIRAL:零和游戏自对弈成为语言模型推理训练的「免费午餐」
机器之心 2025-07-30T10:20:58.000000Z
Rubrics as Rewards (RaR): A Reinforcement Learning Framework for Training Language Models with Structured, Multi-Criteria Evaluation Signals
MarkTechPost@AI 2025-07-30T04:29:46.000000Z
Strategist: Self-improvement of LLM Decision Making via Bi-Level Tree Search
cs.AI updates on arXiv.org 2025-07-30T04:12:16.000000Z
VN-MTEB: Vietnamese Massive Text Embedding Benchmark
cs.AI updates on arXiv.org 2025-07-30T04:11:57.000000Z
开源模型也能卷出SOTA!MiroMind-M1高效推理压缩token,训练数据与代码全透明
PaperWeekly 2025-07-30T03:06:45.000000Z
AQUA: A Large Language Model for Aquaculture & Fisheries
cs.AI updates on arXiv.org 2025-07-29T04:22:25.000000Z