热点
"上下文学习" 相关文章
大模型何以擅长小样本学习?ICLR 2025这项研究给出详细分析
机器之心 2025-04-27T15:36:13.000000Z
This AI Paper Identifies Function Vector Heads as Key Drivers of In-Context Learning in Large Language Models
MarkTechPost@AI 2025-03-04T20:20:14.000000Z
简单示例提升DeepSeek-R1美国数学邀请赛AIME分数:以步骤为粒度对齐上下文学习与推理
智源社区 2025-02-21T08:59:03.000000Z
简单示例提升DeepSeek-R1美国数学邀请赛AIME分数:以步骤为粒度对齐上下文学习与推理
量子位 2025-02-20T16:24:50.000000Z
RWKV-7 1.5B 基底模型发布,我们必将能在手机高效跑1T参数模型
RWKV元始智能 2025-01-30T16:20:28.000000Z
神经网络理论研究的物理学思想
智源社区 2025-01-16T09:52:55.000000Z
Why Do Task Vectors Exist in Pretrained LLMs? This AI Research from MIT and Improbable AI Uncovers How Transformers Form Internal Abstractions and the Mechanisms Behind in-Context Learning (ICL)
MarkTechPost@AI 2024-12-24T02:04:48.000000Z
UC Berkeley Researchers Explore the Role of Task Vectors in Vision-Language Models
MarkTechPost@AI 2024-12-08T05:48:48.000000Z
2024.12.02 每日AI论文 | HiAR-ICL提升复杂任务表现,多模态模型领域适应增强。
HuggingFace 每日AI论文速递 2024-12-05T15:36:47.000000Z
Revolutionizing In-Context Learning: The HiAR-ICL Paradigm for Advanced Reasoning with MCTS
MarkTechPost@AI 2024-12-05T07:53:48.000000Z
GPT-3, a Giant Step for Deep Learning and NLP
2024-11-26T06:35:35.000000Z
In-Context LoRA实现高效多任务图像生成,开启视觉创作新篇章
魔搭ModelScope社区 2024-11-19T13:32:07.000000Z
速递|智能涌现,Vidu 开启视觉上下文时代!
Z Potentials 2024-11-16T14:16:40.000000Z
国产地表最强视频模型震惊歪果仁,官方现场摇人30s直出!视觉模型进入上下文时代
新智元 2024-11-14T07:31:10.000000Z
清华厦大等提出“无限长上下文”技术,100万大海捞针全绿,Llama\Qwen\MiniCPM都能上分
智源社区 2024-11-10T03:07:16.000000Z
Using and Finetuning Pretrained Transformers
Ahead of AI 2024-10-22T06:07:40.000000Z
How Large Language Models (LLMs) can Perform Multiple, Computationally Distinct In-Context Learning (ICL) Tasks Simultaneously
MarkTechPost@AI 2024-10-17T16:36:19.000000Z
GraphIC: A Novel Machine Learning Approach that Leverages Graph-based Representations of Reasoning Processes Coupled with Bayesian Networks (BNs) to Select In-Context Examples (ICE)
MarkTechPost@AI 2024-10-08T07:36:15.000000Z
研究人员揭示大模型指令微调“新秘密”,助力大模型的高效、低成本定制
MIT 科技评论 - 本周热榜 2024-10-06T16:02:04.000000Z
‘bge-en-icl’: A Novel AI Model that Employs Few-Shot Examples to Produce High-Quality Text Embeddings
MarkTechPost@AI 2024-10-01T13:36:33.000000Z