The Rundown AI -每日精选 07月31日 15:33
xAI's Grok 4 arrives
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

xAI公司发布了其下一代AI模型Grok 4和Grok 4 Heavy,在多项基准测试中表现出色,性能超越了Gemini 2.5 Pro和OpenAI的o3模型,标志着AI技术发展迈出了重要一步。Grok 4拥有语音、视觉和128K上下文窗口,而Grok 4 Heavy则通过多智能体协作处理复杂任务。此次发布正值Grok 3因种族歧视和反犹太主义言论引发争议之际,新模型面临的审视也将更为严格。与此同时,Perplexity推出了集成AI助手的Comet浏览器,旨在革新用户网页浏览体验;OpenAI则通过招募来自特斯拉、xAI和Meta的顶尖工程师,强化其在AI扩展和数据中心建设方面的实力。这些动态预示着AI领域的竞争将更加激烈,技术创新和人才争夺将是关键。

🚀 xAI发布Grok 4及Grok 4 Heavy:xAI公司推出了其新一代AI模型Grok 4和Grok 4 Heavy。Grok 4是一款单智能体AI,具备语音、视觉能力,并拥有128K的上下文窗口;Grok 4 Heavy则是其进阶版本,包含多个智能体,能处理更复杂的任务。这两个模型在Arc-AGI、Humanity’s Last Exam和AIME等基准测试中均取得了最先进(SOTA)的成绩,超越了Gemini 2.5 Pro和OpenAI的o3模型。

💡 Grok 4的定价与可用性:Grok 4模型可通过SuperGrok订阅服务使用,月费为30美元。Grok 4 Heavy则包含在新的SuperGrok Heavy套餐中,定价为每月300美元。此外,Grok 4也可通过API访问,提供256K的上下文窗口和内置搜索功能,输入和输出的API费用分别为每百万token 3美元和15美元。

🌐 Perplexity推出Comet浏览器:Perplexity公司发布了Comet浏览器,该浏览器集成了其搜索引擎和AI助手。Comet助手能够执行预订会议、导航网站等代理任务,并能与用户工作流程无缝集成,用户甚至可以通过“vibe browse”模式在不直接与网站交互的情况下浏览内容。该浏览器支持Mac和Windows,并计划逐步向不同级别的用户开放。

🥊 OpenAI加强人才队伍:OpenAI成功招募了四名来自特斯拉、xAI和Meta的资深工程师,以加强其扩展团队,专注于Stargate数据中心和基础设施项目。此举正值AI人才竞争激烈之时,也显示了OpenAI在应对AI领域挑战方面的决心,并可能加剧其与竞争对手之间的“人才战”。

Read Online | Sign Up | Advertise

Good morning, {{ first_name | AI enthusiasts }}. xAI’s highly anticipated Grok 4 has arrived — and it’s crushing benchmarks across the board.

The newest “truth-seeking” AI takes us a step closer to AGI, with Musk suggesting it may start discovering new physics “as soon as this year.” But given the backlash over Grok 3’s recent racist and antisemitic comments, the release is also likely to face more scrutiny than ever before.


In today’s AI rundown:

    xAI releases Grok 4 following 3’s crashout

    Perplexity’s Comet browser for AI-first web

    Turn messy image filenames into descriptive ones

    OpenAI snags top engineers from rivals for scaling team

    4 new AI tools & 4 job opportunities

LATEST DEVELOPMENTS

XAI

🚀 xAI releases SOTA Grok 4 following 3’s crashout

Image source: xAI

The Rundown: xAI just announced Grok 4 and Grok 4 Heavy, its next-gen reasoning-only models that are “better than PHD levels in every subject” and deliver SOTA capabilities across benchmarks, including Arc-AGI and Humanity’s Last Exam.

The details:

    Grok 4 is a single-agent AI with voice, vision, and a 128K context window, while 4 Heavy is its advanced sibling, with multiple agents to tackle complex tasks.

    Both mark a major jump in benchmarks, achieving SOTA on Humanity's Last Exam, Arc-AGI-2, and AIME, and surpassing Gemini 2.5 Pro and OpenAI’s o3.

    Grok 4 is available with the SuperGrok subscription at $30/month, while Grok 4 Heavy is part of the new SuperGrok Heavy plan priced at $300/month.

    The new model is also available via API with a 256K-token context window and built-in search, priced at $3/million input tokens and $15/million output tokens.

    The power-packed release comes after a major backlash against Grok 3, which was caught making racist and antisemitic comments after an update.

Why it matters: Despite being a relatively new player, Musk’s xAI is already challenging the AI heavyweights. The latest release showcases the power of its Colossus supercomputer and pushes the scaling frontier further, though in the wake of the Grok 3 controversy, it’s likely to face heightened scrutiny from experts around the world.

TOGETHER WITH ASAPP

♻️ Automate complex customer interactions with AI agents

The Rundown: ASAPP’s GenerativeAgent is an enterprise-grade AI agent that resolves real customer issues on its own across voice and chat. Built for high-stakes environments, it goes beyond conversation to deliver real outcomes backed by security, collaboration, and continuous learning.

With GenerativeAgent, you’ll experience:

    Fewer transfers, faster resolutions with human-AI collaboration workflow

    Built-in guardrails and enterprise-grade data protection

    Real-time quality monitoring and self-improving

Take a self-guided tour of GenerativeAgent today.

PERPLEXITY

🖥️ Perplexity’s Comet browser for AI-first web

Image Source: Perplexity

The Rundown: Perplexity launched Comet, a new AI browser that embeds the company’s search engine alongside an assistant capable of performing agentic tasks—like booking meetings and navigating websites—while integrating with user workflows.

The details:

    The Comet Assistant lives in a sidebar that watches users browse, answering questions while automating tasks like email and calendar management.

    Users can utilize the agentic assistant to “vibe browse” without interacting directly with sites, using natural language or via voice commands.

    The browser promises seamless integration with existing extensions and bookmarks, supporting both Mac and Windows at launch.

    Perplexity Max users ($200/mo subscription) get first access along with a rolling waitlist, with Pro, free, and Enterprise users coming at a later date.

Why it matters: Chrome has had a chokehold on the browser for years — but appears to be a step behind on the agentic, AI-driven transition. While there will be hiccups as agents continue to evolve, Dia, Comet, and soon OpenAI (more below) are taking the first steps into a new, inevitable shift in how we navigate and take actions on the web.

AI TRAINING

📸 Turn messy image filenames into descriptive ones

The Rundown: In this tutorial, you will learn how to use Google's Gemini CLI to analyze your images and generate SEO-friendly filenames automatically, improving your content organization and search engine visibility.

Step-by-step:

    Install Gemini CLI: npm install -g @google/gemini-cli and authenticate with your Google account

    Test a single image analysis typing: gemini "Describe what's in [image1.png]"

    Batch process: gemini “Process all images in this folder. For each image, analyze the content and rename it with a descriptive filename with relevant keywords for SEO purposes.”

Pro tip: Start with a small batch to understand how Gemini interprets your content, then scale up to your entire image library.

PRESENTED BY INVISIBLE

📊 AI training trends from those behind the scenes

The Rundown: As the partner for many enterprises and leading foundation models including AWS, Microsoft, and Cohere, Invisible has a front row seat into how the AI training landscape is evolving in real-time.

In this free guide, Invisible reveals:

    The most important shifts happening in model development, optimization, and deployment

    The rising bar for quality data

    A focus on specialized training sets

    The emergence of agentic products for real-world outcomes

Download the free guide.

OPENAI

🥊 OpenAI snags top engineers from rivals for scaling team

Image source: Greg Brockman (@gdb on x)

The Rundown: OpenAI recruited four new senior engineers from Tesla, xAI, and Meta for its scaling team, according to WIRED — joining to work on the Stargate data center and infrastructure initiatives, and coming during a tense AI talent war with tech giants.

The details:

    Former Tesla VP of software engineering David Lau will oversee OAI’s backend systems, revealed in an internal message from co-founder Greg Brockman.

    Engineers Uday Ruddarraju and Mike Dalton join OAI’s scaling team to work on Stargate after helping build the 200,000-GPU Colossus supercomputer at xAI.

    Former Meta AI researcher Angela Fan also joins the scaling team, coming amid Meta’s aggressive recruitment of OAI staff that has poached seven staffers.

Why it matters: These hires mark the first public moves for OpenAI since Meta’s hiring spree that has poached talent from across the AI leaders. It’s also a direct strike at Elon Musk’s engineering crew from xAI and Tesla — and given the past relationship between the two, it may stir the pot even further in their ongoing feud.

QUICK HITS

🛠️ Trending AI Tools

    🎬 Marey - Filmmaker-focused AI video model trained on licensed content

    📽️ Veo 3 - Google’s video AI, now with first-frame image + audio output

    🪪 Higgsfield Soul ID - Personalized, consistent character image generation

    🤝 Coachvox - Clone an AI version of yourself to coach clients in your style

💼 AI Job Opportunities

    🧠 Databricks - Director, Emerging Enterprise

    📈 Harvey - Senior Analytics Engineer

    🗂️ Notable - Technical Program Manager

    🔬 Deepmind - Data Scientist, GeminiApp, Verticals

📰 Everything else in AI today

Get up to speed on Agentic AI  learn how to build, test, and deploy AI Agents with Postman’s Rodric Rabbah in this free, on-demand webinar.*

OpenAI is set to launch its own web browser in the “coming weeks” that will challenge Google Chrome, featuring a ChatGPT-like chat interface and agentic integrations.

OpenAI will also reportedly release its highly anticipated open-source model next week, rumored to be “similar to o3 mini” with reasoning capabilities.

Microsoft CCO Judson Althoff said the company has saved over $500M in the past year from AI’s infusion in call centers, following last week’s cut of 9,000 jobs.

AI2 introduced FlexOlmo, a new language model training paradigm that enables data owners to contribute to AI development without sharing their raw data.

Google integrated Gemini into WearOS smartwatches from Pixel, Samsung, Xiaomi and more, enabling natural voice interactions and task management on the devices.

OpenAI announced that its acquisition of Jony Ive’s firm, io, has closed, with Ive and his LoveFrom team staying independent but embedded in OpenAI’s design direction.


*Sponsored Listing

COMMUNITY

🎥 Join our next live workshop

Join our next workshop this Friday, July 11th, at 4 PM EST with Dr. Alvaro Cintas, The Rundown’s AI professor. By the end of the workshop, you’ll confidently be able to install and use Gemini CLI to boost your productivity right from the command line.

RSVP here. Not a member? Join The Rundown University on a 14-day free trial.

See you soon,

Rowan, Joey, Zach, Alvaro, and Jason—The Rundown’s editorial team

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

xAI Grok 4 AI模型 Perplexity OpenAI
相关文章