Artificial Ignorance 2024年11月15日
AI Roundup 093: Diminishing returns
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

近期,OpenAI、Google和Anthropic等AI巨头在最新的大模型研发中遭遇瓶颈,尽管投入了巨资进行训练,但模型效果却未达到预期。主要原因在于高质量训练数据的缺乏,以及行业过度炒作导致的期望值过高。与此同时,AI公司正在开发新的内部基准测试,因为现有的公开基准测试已变得过时。此外,文章还涵盖了OpenAI、Google、Apple等公司在AI领域的最新动态,以及AI艺术、AI滥用等话题。

🤔 **AI大模型发展遇瓶颈:**OpenAI的Orion、Google的Gemini和Anthropic的Claude 3.5 Opus等模型,尽管投入大量资源训练,但效果却未达到预期,主要受限于高质量训练数据的缺乏,以及行业对AI模型的过度预期。

📊 **公开基准测试失效:**现有公开基准测试对新AI模型的区分度降低,因为模型得分普遍较高,难以体现性能差异。因此,AI公司开始开发新的内部基准测试,但同时也引发了关于透明度和可比性的担忧。

🚀 **AI应用场景拓展:**从聊天机器人向智能代理转变,需要更复杂的评估系统,例如沙盒环境或多阶段问题解决场景,以取代简单的选择题形式。

🎨 **AI艺术市场接受度提升:**由AI艺术家Ai-Da创作的艾伦·图灵肖像以110万美元的价格售出,凸显了AI生成艺术在市场上的日益认可,同时也引发了关于创造力、版权和艺术价值的思考。

⚠️ **AI滥用风险增加:**AI被用于生成虚假专辑、传播虚假信息等,引发了对AI滥用风险的担忧,例如AI生成的虚假音乐专辑用于侵犯版权,AI生成的虚假信息在社交媒体上传播等。

Artwork created with Midjourney.

Diminishing returns: Training

OpenAI, Google, and Anthropic are discovering that bigger isn't always better, as their latest AI models fall short of expectations despite massive investments in training.

The big picture:

Subscribe now

Elsewhere in OpenAI:

Artwork created with Midjourney.

Diminishing returns: Benchmarks

AI companies are also developing new internal benchmarks, as existing public benchmarks are quickly becoming obsolete.

Why it matters:

Elsewhere in the FAANG free-for-all:

Source: www.ai-darobot.com

Art-ificial intelligence

A portrait of Alan Turing created by Ai-Da, a humanoid robot artist, sold for $1.1 million at Sotheby's - nearly ten times its estimated value.

Between the lines:

Elsewhere in AI absurdity:

Artificial Ignorance is reader-supported. If you found this interesting or insightful, consider becoming a free or paid subscriber.3

Things happen

François Chollet is leaving Google after nearly a decade. A Q&A with Gwern Branwen on anonymity, intelligence, and AGI timelines. High-end AI chips are creating a winner-takes-all trend in the chip sector. Elon Musk's supercomputer freaked out AI rivals. Tech companies are leasing more office space as AI demand grows. SoftBank plans to build an AI supercomputer in Japan. Ilya Sutskever says we're back in the age of wonder and discovery. Taiwan's chip production is on track to increase 22% YoY in 2024. US private data center construction spending has grown to nearly $30B/year. AI adoption rates have stalled in the US at ~33%. Baidu unveils smart glasses powered by its Ernie LLM. AI startup founders hope for lighter regulations under Trump. Alibaba claims its Qwen2.5-Coder-32B-Instruct matches GPT-4. Spotify's CTO on AI-generated music and recommendations. Nvidia B200 GPU and Google Trillium TPU debut on MLPerf Training benchmark. Ecosia and Qwant launch European Search Perspective. US orders TSMC to halt shipments of advanced chips to China. AI makes tech debt more expensive. Claude AI to process secret government data through Palantir deal. The barriers to AI engineering are crumbling fast. AI hype is cooling, according to a new survey. AlphaFold3 is now open source. You're probably not testing your AI well enough. A look at diagrams AI can and cannot generate. Facebook Research releases "Watermark Anything". Researchers detail RoboPAIR, an algorithm to bypass LLM safeguards. Concerns rise about AI-fabricated scientific data. AI disinfo has amplified satire and false narratives since August. Study finds 7% to 17% of CS peer review sentences were written by LLMs. Chegg has lost 500K+ subscribers since ChatGPT's launch. Greg Brockman has returned to OpenAI.

Last week's roundup

Fundraising

Read more

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI模型 训练数据 基准测试 AI艺术 AI滥用
相关文章