钛媒体:引领未来商业与生活新知 前天 11:51
China's Gaokao Puts Domestic AI Models to the Test and Under Tight Control
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

在中国高考期间,国内大型语言模型(LLM)纷纷采取了自我审查措施,限制与考试相关内容的交互。尽管这些模型在技术上能够回答高考问题,但为了遵守法规、管理声誉风险,并避免任何可能损害考试公正性的行为,它们选择主动限制了对高考题目的解答。这种策略反映了中国对AI在教育领域应用的谨慎态度,以及对考试公平性的高度重视。然而,随着AI技术在教育领域的深入应用,未来可能会出现AI辅助学习的新模式。

🤔 中国的大型语言模型(LLM)在高考期间实施了严格的限制,禁止或限制了对与考试相关内容的访问,特别是数学问题,以避免潜在的风险。

⚠️ 这种自我审查是出于合规、安全和声誉风险管理的考虑。任何可能损害考试公正性的行为都可能引发政治危机,因此LLM平台选择规避风险。

📚 国内的LLM平台正在积极开发AI驱动的辅导工具,旨在构建自适应知识图谱、提供个性化指导,并培养批判性思维能力。未来的趋势是AI在教育领域的应用,而非仅仅充当考试解答者。

🎓 尽管在考试中禁止使用LLM,但在研究、写作和学习中,对LLM的使用态度各不相同。一些教育工作者鼓励负责任地使用,只要AI生成的内容得到适当引用和透明披露。


AI generated


AsianFin -- As millions of students across China sit for the annual Gaokao, the nation's most authoritative standardized exam, a surprising subplot is unfolding in the world of artificial intelligence: domestic large language models (LLMs) are refusing to take the test.

Widely regarded as one of the fairest and most rigorous large-scale selection systems in the world, the Gaokao is a formidable measure not just of academic knowledge, but of deeper abilities such as logical reasoning, information synthesis, mental agility, and written expression. These are precisely the kinds of capabilities that AI models—especially the latest general-purpose LLMs—are now being designed to emulate.

But this year, China's leading AI companies have drawn a line.

From June 7 to 10, during the official Gaokao period, mainstream Chinese LLM platforms have implemented sweeping restrictions on engaging with exam-related content—especially math questions, traditionally considered a benchmark of reasoning ability. Users attempting to upload math problems from the 2025 national exam paper were met with errors, blocked uploads, or blanket messages such as "feature not supported."

What's more, certain core capabilities, including image recognition of questions and even keyword responses involving "Gaokao" or specific exam subjects, have been disabled across major platforms. DeepSeek, one of China's most advanced LLMs, imposed the strictest limitations, even while providing relatively robust answers under more generalized prompts.

In contrast, foreign models like ChatGPT and Claude remain technically capable of answering Gaokao-style questions with advanced reasoning. But despite comparable or superior capabilities, Chinese LLM developers are opting for strategic self-censorship—a mix of compliance, safety, and reputational risk management.

"This is not a technical failure. It's a deliberate downgrade—a governance decision," said an industry insider familiar with platform content moderation mechanisms.

Although there is no publicly reported case of AI-enabled cheating during the Gaokao, the exam's intense security and national sensitivity leave no room for error. Any suggestion that AI tools might compromise test integrity—by solving questions or helping students mid-exam—could escalate into a political crisis.

Regulators are already watching closely. On May 30, China's Ministry of Education, Cyberspace Administration, and Ministry of Public Security jointly announced a crackdown on illegal activities surrounding the Gaokao. The targets: exaggerated "AI-assisted prediction" products, fake prep materials, and scams masquerading as AI-driven miracle tools.

Earlier this year, state broadcaster CCTV raised alarms over wearable AI-enabled gadgets, like smart glasses, that could be used for stealth cheating. Rokid CEO Zhu Mingming suggested "signal blocking or disabling functions" as the simplest countermeasure.

With such scrutiny, domestic LLM platforms have every incentive to sidestep potential risk—both legal and reputational. For now, rejecting Gaokao questions altogether may be the safest play.

China's top AI models are not backing away from the Gaokao because they can't handle it—they're doing so because engaging carries too much downside. In fact, many of these models now rival or exceed international peers in select performance benchmarks and specialized applications.

But the hallucination problem—inconsistent or inaccurate outputs, especially in subjects requiring precise calculations—remains a lingering weakness for all LLMs. And in a high-stakes test like the Gaokao, any mismatch between "AI-generated answers" and official ones could provoke public backlash.

Some model developers have previously marketed their ability to "solve Gaokao problems with high accuracy," but most are now choosing discretion over demonstration.

Still, these restrictions are unlikely to be permanent. Once the Gaokao concludes, partial support for K12-related content is expected to return, driven by ongoing market demand.

Interestingly, the ones complaining most during this AI blackout aren't high schoolers—they're university students in the middle of their own final exams. On Chinese social media, posts like "College students are the real victims of the Gaokao" and "Please let us use AI—help us survive finals" have gained traction, reflecting the extent to which LLMs have become embedded in students' academic routines.

While LLMs are banned in exams across most universities, attitudes toward their use in research, writing, and study vary. Some educators encourage responsible usage, so long as AI-generated material is properly cited and transparently disclosed. But in test scenarios—where fairness is paramount—AI assistance remains a clear red line.

The Road Ahead: AI as Tutor, Not Test-Taker

Looking forward, temporary restrictions during national exams are likely to become standard practice for domestic LLM platforms. But the broader trend—AI's integration into education—is far from stalling.

China's edtech giants are racing to develop AI-powered tutors, not to spoon-feed answers, but to build adaptive knowledge maps, provide personalized guidance, and foster critical thinking. The future of "AI + Education" lies in heuristic learning models, not in serving as glorified test solvers.

As China's educational system evolves and its LLM ecosystem matures, striking the right balance between compliance, innovation, and educational value will be a defining challenge—and opportunity.

 

更多精彩内容,关注钛媒体微信号(ID:taimeiti),或者下载钛媒体App

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

人工智能 高考 AI教育 LLM 自我审查
相关文章