TechCrunch News 04月29日 05:41
Alibaba unveils Qwen 3, a family of ‘hybrid’ AI reasoning models
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

阿里巴巴发布了Qwen 3系列AI模型,声称在某些方面超越了谷歌和OpenAI的最佳模型。这些模型大小从0.6亿到2350亿参数不等,可在Hugging Face和GitHub上下载。Qwen 3是混合模型,可以快速处理简单请求,也能“思考”复杂问题。它支持119种语言,使用近36万亿tokens的数据集进行训练,包括教科书、问答对和代码片段。最大模型Qwen-3-235B-A22B在编程竞赛平台Codeforces和数学基准测试AIME上优于OpenAI的o3-mini。Qwen 3还擅长工具调用、遵循指令和复制特定数据格式。

🧠Qwen 3模型是混合模型,能够根据问题的复杂性,在快速响应和深度推理之间切换,从而实现更高效的问题解决。

🌐Qwen 3支持多达119种语言,并使用包含教科书、问答对以及代码片段等多样化内容、总计近36万亿tokens的数据集进行训练,这极大地提升了模型的性能。

🏆在性能测试中,Qwen-3-235B-A22B模型在Codeforces编程竞赛平台和AIME数学基准测试中,均超越了OpenAI的o3-mini模型,展现了强大的竞争力。

🧰Qwen 3在工具调用、遵循指令以及特定数据格式复制等方面表现出色,这使得它在实际应用中更具优势。

Chinese tech company Alibaba on Monday released Qwen 3, a family of AI models the company claims matches and in some cases outperforms the best models available from Google and OpenAI.

Most of the models are — or soon will be — available for download under an “open” license from AI dev platform Hugging Face and GitHub. They range in size from 0.6 billion parameters to 235 billion parameters. Parameters roughly correspond to a model’s problem-solving skills, and models with more parameters generally perform better than those with fewer parameters.

The rise of China-originated model series like Qwen have increased the pressure on American labs such as OpenAI to deliver more capable AI technologies. They’ve also led policymakers to implement restrictions aimed at limiting the ability of Chinese AI companies to obtain the chips necessary to train models.

According to Alibaba, Qwen 3 models are “hybrid” models in the sense that they can take time and “reason” through complex problems or answer simpler requests quickly. Reasoning enables the models to effectively fact-check themselves, similar to models like OpenAI’s o3, but at the cost of higher latency.

“We have seamlessly integrated thinking and non-thinking modes, offering users the flexibility to control the thinking budget,” wrote the Qwen team in a blog post.

The Qwen 3 models support 119 languages, Alibaba says, and were trained on a data set of nearly 36 trillion tokens. Tokens are the raw bits of data that the model processes; 1 million tokens is equivalent to about 750,000 words. Alibaba says Qwen 3 was trained on a combination of textbooks, “question-answer pairs,” code snippets, and more.

These improvements, along with others, greatly boosted Qwen 3’s performance compared to its predecessor, Qwen 2, says Alibaba. On Codeforces, a platform for programming contests, the largest Qwen 3 model — Qwen-3-235B-A22B — beats out OpenAI’s o3-mini. Qwen-3-235B-A22B also bests o3-mini on the latest version of AIME, a challenging math benchmark, and BFCL, a test for assessing a model’s ability to “reason” about problems.

But Qwen-3-235B-A22B isn’t publicly available — at least not yet.

The largest public Qwen 3 model, Qwen3-32B, is still competitive with a number of proprietary and open AI models, including Chinese AI lab DeepSeek’s R1. Qwen3-32B surpasses OpenAI’s o1 model on several tests, including an accuracy benchmark called LiveBench.

Alibaba says Qwen 3 “excels” in tool-calling capabilities as well as following instructions and copying specific data formats. In addition to releasing models for download, Qwen 3 is available from cloud providers including Fireworks AI and Hyperbolic.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Qwen 3 阿里巴巴 AI模型 OpenAI 深度学习
相关文章