The Rundown AI -每日精选 07月31日 15:33
China's open-source AI surge continues
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本期AI资讯聚焦中国科技公司的最新进展。Z.ai发布了强大的开源模型GLM-4.5,在推理、编码和自主任务方面表现出色,价格更具竞争力。微软则在Edge浏览器中推出了“Copilot Mode”,将AI助手深度集成到浏览体验中。此外,还有关于如何替换视频角色声音的教程,以及Alibaba的Wan2.2在开源视频生成领域取得的突破,该模型在电影感和运动质量上超越了Sora等竞品。整体来看,中国在AI领域的开源生态和技术创新步伐正在加速,对全球AI格局产生重要影响。

🌟 Z.ai发布GLM-4.5开源模型,性能逼近顶尖模型,价格更优,并在自主任务上表现突出,成功率高达90%,同时开源了“slime”训练框架,推动了开源AI生态的发展。

💻 微软在Edge浏览器中推出“Copilot Mode”,将AI助手无缝集成到浏览体验中,支持跨标签页搜索和主动任务处理,标志着浏览器进入“智能代理”竞争新阶段。

🎥 Alibaba的Wan2.2开源视频模型在文本到视频和图像到视频生成方面取得了显著进步,通过“专家”系统优化效率,并在美学、文字渲染和镜头控制等方面超越了Sora等竞争对手,为视频生成领域注入新动力。

💡 本文还介绍了使用Google Veo和ElevenLabs替换视频角色声音的详细步骤,以及Prezi AI在快速创建演示文稿方面的优势,为AI工具的应用提供了更多实践指导。

🚀 中国AI实验室正以前所未有的速度推出高质量的开源模型,涵盖语言和视频领域,这不仅缩小了与前沿系统的差距,也对OpenAI等公司的发布构成了竞争压力,加速了全球AI技术的迭代。

Read Online | Sign Up | Advertise

Good morning, {{ first_name | AI enthusiasts }}. While the AI world awaits the imminent launch of OpenAI’s open-source model and GPT-5, Chinese labs continue to churn out the headlines.

New launches from Zai and Alibaba just raised the open-source bar again in both language and video models — continuing a relentless pace of development out East that is shifting the AI landscape faster than ever.

Note: We just opened multi-seats for our AI University. If you're looking to build your team's AI upskilling learning path, reach out here.


In today’s AI rundown:

    Z.ai’s new open-source powerhouse

    Microsoft’s ‘Copilot Mode’ for agentic browsing

    Replace any character voice in your videos

    Alibaba’s Wan2.2 pushes open-source video forward

    4 new AI tools & 4 job opportunities

LATEST DEVELOPMENTS

Z AI

🤖 Z.ai’s new open-source powerhouse

Image source: Zai

The Rundown: Chinese startup Z.ai (formerly Zhipu) just released GLM-4.5, an open-source agentic AI model family that undercuts DeepSeek's pricing while nearing the performance of leading models across reasoning, coding, and autonomous tasks.

The details:

    4.5 combines reasoning, coding, and agentic abilities into a single model with 355B parameters, with hybrid thinking for balancing speed vs. task difficulty.

    Z.ai claims 4.5 is now the top open-source model worldwide, and ranks just behind industry leaders o3 and Grok 4 in overall performance.

    The model excels in agentic tasks, beating out top models like o3, Gemini 2.5 Pro, and Grok 4 on benchmarks while hitting a 90% success rate in tool use.

    In addition to 4.5 and 4.5-Air launching with open weights, Z.ai also published and open-sourced their ‘slime’ training framework for others to build off of.

Why it matters: Qwen, Kimi, DeepSeek, MiniMax, Z.ai… The list goes on and on. Chinese labs are putting out better and better open models at an insane pace, continuing to both close the gap with frontier systems and put pressure on the likes of OpenAI’s upcoming releases to stay a step ahead of the field.

TOGETHER WITH GUIDDE

🎥 Create instant video guides with AI

The Rundown: Stop wasting time on repetitive explanations. Guidde’s AI helps you create stunning video guides in seconds, 11x faster.

Use Guidde to:

    Auto-generate step-by-step video guides with visuals, voiceovers, and a CTA

    Turn boring docs into visual masterpieces

    Save hours with AI-powered automation

    Share or embed your guide anywhere

Download the free extension.

MICROSOFT

🦄 Microsoft’s ‘Copilot Mode’ for agentic browsing

Image source: Microsoft

The Rundown: Microsoft just released ‘Copilot Mode’ in Edge, bringing the AI assistant directly into the browser to search across open tabs, handle tasks, and proactively suggest and take actions.

The details:

    Copilot Mode integrates AI directly into Edge's new tab page, integrating features like voice and multi-tab analysis directly into the browsing experience.

    The feature launches free for a limited time on Windows and Mac with opt-in activation, though Microsoft hinted at eventual subscription pricing.

    Copilot will eventually be able to access users’ browser history and credentials (with permission), allowing for actions like completing bookings or errands.

Why it matters: Microsoft Edge now enters into the agentic browser wars, with competitors like Perplexity’s Comet and TBC’s Dia also launching within the last few months. While agentic tasks are still rough around the edges across the industry, the incorporation of active AI involvement in the browsing experience is clearly here to stay.

AI TRAINING

🎤 Replace any character voice in your videos

The Rundown: In this tutorial, you will learn how to transform AI-generated videos by replacing their default voices with custom voices using Google Veo, audio conversion tools, and ElevenLabs’ voice cloning.

Step-by-step:

    Create your AI video using Google Veo and download the MP4 file

    Convert the video to MP3 using any audio extractor from a video tool

    Go to ElevenLabs’ Voice Changer, upload your MP3, and generate speech with your chosen voice.

    Import both the original video and new audio into CapCut, mute the original audio, and export your video with the custom voice.

Pro tip: Create voice clones in ElevenLabs to maintain consistent character voices across all your video projects.

PRESENTED BY PREZI

💡 Go from idea to “woah” presentation in seconds

The Rundown: Prezi AI doesn’t just make slides. It builds persuasive narratives with a dynamic format designed to hold attention and help your message land. Whether you're pitching or presenting to your team, Prezi transforms your ideas into presentations that actually perform.

With Prezi, you can:

    Go from rough ideas or PDFs to standout presentations in seconds

    Engage your audience with a format proven to be more effective than slides

    Get AI-powered suggestions for content, structure, and design

Try Prezi AI for free and beat boring slides.

ALIBABA

🎥 Alibaba’s Wan2.2 pushes open-source video forward

Image source: Alibaba

The Rundown: Alibaba's Tongyi Lab just launched Wan2.2, a new open-source video model that brings advanced cinematic capabilities and high-quality motion for both text-to-video and image-to-video generations.

The details:

    Wan2.2 uses two specialized "experts" — one creates the overall scene while the other adds fine details, keeping the system efficient.

    The model surpassed top rivals, including Seedance, Hailuo, Kling, and Sora, in aesthetics, text rendering, camera control, and more.

    It was trained on 66% more images and 83% more videos than Wan2.1, enabling it to better handle complex motion, scenes, and aesthetics.

    Users can also fine-tune video aspects like lighting, color, and camera angles, unlocking more cinematic control over the final output.

Why it matters: China’s open-source flurry doesn’t just apply to language models like GLM-4.5 above — it’s across the entire AI toolbox. While Western labs are debating closed versus open models, Chinese labs are building a parallel open AI ecosystem, with network effects that could determine which path developers worldwide adopt.

QUICK HITS

🛠️ Trending AI Tools

💼 AI Job Opportunities

    📱 Databricks - Senior Digital Media Manager

    🗂️ Parloa - Executive Assistant to the CRO

    🤝 UiPath - Partner Sales Executive

    🧑‍💻 xAI - Software Engineer, Developer Experience

📰 Everything else in AI today

Alibaba debuted Quark AI glasses, a new line of smart glasses launching by the end of the year, powered by the company’s Qwen model.

Anthropic announced weekly rate limits for Pro and Max users due to “unprecedented demand” from Claude Code, saying the move will impact under 5% of current users.

Tesla and Samsung signed a $16.5B deal for the manufacturing of Tesla’s next-gen AI6 chips, with Elon Musk saying the “strategic importance of this is hard to overstate.”

Runway signed a new partnership agreement with IMAX, bringing AI-generated shorts from the company’s 2025 AI Film Festival to big screens at ten U.S. locations in August.

Google DeepMind CEO Demis Hassabis revealed that Google processed 980 trillion (!) tokens across its AI products in June, an over 2x increase from May.

Anthropic published research on automated agents that audit models for alignment issues, using them to spot subtle risks and misbehaviors that humans might miss.

COMMUNITY

🎥 Join our next live workshop

Join our next workshop this Friday, August 1st, at 4 PM EST with Dr. Alvaro Cintas, The Rundown’s AI professor. By the end of this workshop, you’ll have practical strategies to get the AI to do exactly what you want.

RSVP here. Not a member? Join The Rundown University on a 14-day free trial.

See you soon,

Rowan, Joey, Zach, Alvaro, and Jason—The Rundown’s editorial team

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Z.ai GLM-4.5 Microsoft Copilot Alibaba Wan2.2 AI开源
相关文章