热点
"实验研究" 相关文章
Alignment faking in large language models
Newsroom Anthropic 2025-02-26T06:17:45.000000Z
科学家发现野生鱼类能识别并追随特定人类
Cnbeta 2025-02-19T01:51:56.000000Z
苹果也在蒸馏大模型,给出了蒸馏Scaling Laws
机器之心 2025-02-14T08:23:37.000000Z
新研究发现:倭黑猩猩具备推断他人是否“知情”的能力
IT之家 2025-02-04T04:07:27.000000Z
Imaging reveals how microplastics may harm the brain
Physics World 2025-01-29T12:12:37.000000Z
Leveraging Hallucinations in Large Language Models to Enhance Drug Discovery
MarkTechPost@AI 2025-01-28T06:35:09.000000Z
Disproving the "People-Pleasing" Hypothesis for AI Self-Reports of Experience
少点错误 2025-01-26T15:55:31.000000Z
深度|Anthropic团队重磅发现:模型会假装迎合人类,以维护初始偏好
Z Potentials 2025-01-09T17:00:16.000000Z
Favorite colors of some LLMs.
少点错误 2024-12-31T21:24:32.000000Z
蚂蚁群组团“最强 AGI”:破解几何难题“完爆”人类,群体智能登顶 PNAS
IT之家 2024-12-26T07:07:30.000000Z
This AI Paper from Anthropic and Redwood Research Reveals the First Empirical Evidence of Alignment Faking in LLMs Without Explicit Training
MarkTechPost@AI 2024-12-22T03:49:50.000000Z
可以解答我们为何存在的DUNE实验正准备进行
Cnbeta 2024-12-15T12:22:05.000000Z
Unraveling Multimodal Dynamics: Insights into Cross-Modal Information Flow in Large Language Models
MarkTechPost@AI 2024-12-02T08:19:56.000000Z
扩散模型=进化算法!生物学大佬用数学揭示本质
新智元 2024-11-24T09:00:48.000000Z
LLM 比之前预想的更像人类,竟也能「三省吾身」
机器之心 2024-11-03T09:11:11.000000Z
Bursts of embers play outsized role in wildfire spread, say physicists
Physics World 2024-10-31T13:14:51.000000Z
Enhancing Task Planning in Language Agents: Leveraging Graph Neural Networks for Improved Task Decomposition and Decision-Making in Large Language Models
MarkTechPost@AI 2024-10-31T10:05:31.000000Z
Objects with embedded spins could test whether quantum measurement affects gravity
Physics World 2024-10-21T17:29:23.000000Z
重要的事情说两遍,Prompt“复读机”,显著提高LLM推理能力
36kr 2024-10-09T00:19:39.000000Z
[Paper] Can We Bias the MCQ Answers of Vision-Language Models with Visual Stimuli?
少点错误 2024-10-03T18:08:45.000000Z