少点错误 18小时前
Is there a looming Cultural Omnicide?
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

文章探讨了全球推广消费娱乐技术对文化多样性的潜在威胁,特别是对认知安全较低的群体。文章指出,互联网的普及导致本土文化面临前所未有的挑战,传统知识系统正在被同质化的互联网和AI所取代。文章强调,如果文化没有数字化,就等于在AI世界中缺席,这会影响AI的认知多样性。文章呼吁关注文化多样性的重要性,并提出了通过数字化保护和丰富不同文化,从而塑造更具包容性的AI未来的解决方案。

🌍 **文化侵蚀的加速**:互联网的普及为 previously offline 的人群提供了极具吸引力的内容,导致了对本土文化的侵蚀。这种侵蚀不仅限于娱乐,还影响着传统习俗、语言和知识的传承,例如巴西Marubo部落在引入Starlink后,年轻人放弃了传统文化学习。

🤖 **AI认知同质化的风险**:互联网内容同质化,而AI系统学习依赖于这些内容,导致AI认知也趋于单一。如果文化没有数字化,AI将无法学习和理解这些文化,从而限制了AI的思维方式和解决问题的能力,例如缺乏Sanskrit OCR,使得AI难以接触到 Vedic texts。

💡 **文化多样性的价值**:文化多样性如同生物多样性,蕴藏着解决问题的丰富方案。文章通过举例说明了传统文化中蕴含的知识对科学和医学的贡献,例如治疗白血病的药物和抗疟疾药物的发现,以及传统农业技术在应对荒漠化中的作用。

🤝 **保护与丰富文化的重要性**:文章强调,为了确保AI拥有更广泛的认知和更强的解决问题的能力,需要积极推动文化数字化,例如 Ayah Bdeir 致力于数字化阿拉伯文化,Rohan Pandey 致力于Sanskrit OCR。这不仅是文化保护,更是为了防止AI的认知同质化,构建更具包容性的未来。

Published on June 25, 2025 6:18 PM GMT

The reality, risks, & consequences of scaling consumer-entertainment technologies to a global population with little cognitive security. [mirrored from Substack]

Western, American culture will be in frontier language models’ training data. 
Most other cultures will not.
If nobody steps in, ~90% of mankind's culture could die quite soon.

Globally, indigenous cultures face an unprecedented threat to their traditional heritage.

The Marubo, a 2,000-member tribe in the Brazilian Amazon, recently made headlines: they received Starlink in September 2023. Within 9 months, their youth stopped learning traditional body paint and jewelry making, young men began sharing pornographic videos in group chats (in a culture that frowns on public kissing), leaders observed "more aggressive sexual behavior," and children became addicted to short-form video content. One leader reported: "Everyone is so connected that sometimes they don't even talk to their own family."

Marubo people carrying a Starlink antenna stopped for a break to eat papaya.

Cultural diversity is already experiencing a mass extinction at rates comparable to our biological, Holocene extinction ֊֊ up to 10,000 times the natural background rate. Yet unlike species loss, which triggers conservation efforts, cultural extinction largely remains unnoticed.

Approximately 50% of the world's population was not online 10 years ago. In 2010, only 30% of the global population used the internet, while by 2020, that number had doubled to 60%.

The mechanism is straightforward: providing internet access to previously offline populations overwhelms users with enticing, addicting, and viral stimuli. The entertainment is far greater than anything indigenous, pre-digital lifestyles can offer.

This cultural erosion has profound implications beyond the communities themselves: the internet that's replacing indigenous knowledge systems is also feeding the artificial intelligence that will shape humanity's future. Large language models learn from text scraped across the web ֊֊֊ and if a culture doesn't exist online, it effectively doesn't exist to AI. Every traditional practice abandoned for TikTok, every oral tradition left unrecorded, every indigenous language that goes unwritten represents not just a loss to those communities, but a permanent gap in the knowledge systems that will guide our AI-mediated world. The internet homogenizes human culture toward whatever content generates the most engagement, and AI systems trained on that same internet amplify those patterns further. The Marubo children who stopped learning traditional body painting aren't just losing their heritage ֊֊֊ they're ensuring that AI systems will never learn to think in ways their ancestors did. What survives digitization becomes the foundation for machine intelligence; what doesn't survive gets erased from our computational future.

A North Korean soldier of the Ukrainian War (2025) lies in his barracks, scrolling.

The mechanism driving this convergence has deep historical precedent, and is an unsurprising consequence of how power operates in the digital age. The policing power of any central cultural authority weakens with distance ֊֊֊ a Chinese proverb, from the Yuan dynasty, applies: "Heaven is high and the emperor is far away.” Historically, in times of totalitarian oppression, such a reality enabled local customs to persist. In the post-industrial age, this protection no longer exists. The proverbial emperor is no longer a despot, requiring legislative & coercive mechanisms to homogenize culture. The emperor has been supplanted by market forces and economic competition, which carry out this responsibility instead.

Mature capitalist economies, such as the United States, necessitate firms pursue global user acquisition for survival ֊֊֊ firms must constantly expand their customer base to satisfy growth expectations and outcompete adversaries. This imperative drives firms to pursue a global audience, while delivering widely-appealing products which transcend cultural boundaries. Through algorithmically-delivered content, engineers develop low-friction behavioral pathways which reshape cultural habits, social practices, and leisure activities towards addiction.

Cultural diversity, much the same as biological diversity, preserves an irreplaceably robust repository of solutions to ancient problems. Darwinian evolution has operated as a massively parallel search algorithm in for hundreds of millions of years, testing chemical and behavioral solutions to ecological problems. Claude Lévi-Strauss, renowned anthropologist, believes culture to be "a unique experiment in organizing human life" ֊֊֊ encoding generations of knowledge systems adapted to localized environments. Much the same as biodiversity, such cultural solutions cannot be easily discovered, let alone, recreated once lost.

This embodied wisdom, refined through generations of trial and error, often reveals natural phenomena that laboratory science would never think to investigate. For every cultural extinction, we lose not just traditions, but entire systems of observation and experimentation that could unlock medical and scientific breakthroughs.

Tectitethya crypta

Recently, I had the privilege of speaking with Ayah Bdeir, a Lebanese entrepreneur and activist currently leading AI strategy at The Mozilla Foundation. She is reducing her Mozilla responsibilities to pursue what she sees as an existential challenge: economizing large-scale digitization of Arab cultural media, knowledge, and lifestyles. Her rationale was straightforward: "The Arab world is not in the training data." She explained how this absence manifests at every level - from the technical (Arabic OCR remains largely unresolved, making centuries of texts inaccessible) to the geopolitical (The Gulf States pouring billions into sovereign AI). Without intervention, she argued, AI will learn to think exclusively through a Western lens, while a fifth of humanity's historical frameworks vanish from our computational future.

Similarly, Rohan Pandey, a machine learning scientist, recently departed from OpenAI to address Sanskrit OCR, recognizing that millennia of Vedic texts ֊֊ containing alternative philosophical frameworks, governance models, and problem-solving methodologies ֊֊ exist neither in digital format nor in current training corpora. These initiatives are important beyond preservation efforts; they constitute strategic interventions to incorporate cognitive diversity into artificial intelligence systems before technological monoculture becomes irreversible.

See, large language models are prediction engines. LLMs learn implicit value functions from their training data. Cultural homogenization is as epistemological as it is behavioral: when AI systems train exclusively on internet-scale English text, they inherit a biased ontology. Claude Opus models “believe” problems have solutions, progress is linear, optimization is inherently good. The majority of human knowledge was developed under different assumptions. (For example, Islamic jurisprudence uses analogical reasoning, Qiyas, which treats precedent as living wisdom rather than fixed law). Now, as far as we are aware, language models cannot hold conscious “beliefs”, per se, but they do hold statistical patterns which function as such. In the training process, language models operate like anthropologists in reverse ֊֊ they must reconstruct entire worldviews from textual fragments. When indigenous cultural frameworks have no representation in the training data, models literally cannot develop the neural infrastructure to reconstruct those ways of thinking.

Including Vedic literature in the training data (or Arabic philosophy, or indigenous texts) fundamentally changes what the model treats as valid reasoning. The Bhagavad Gita presents duty-based ethics where Krishna argues for necessary violence; the Arthashastra details statecraft through deception; Tantric texts describe transformation through transgression. When repositories of cultures developed from these ideas comprise significant training data, language models develop different priors about acceptable solutions. It might propose strategies that Western-trained models would never generate ֊֊ not because they're "wrong" but because they violate implicit constraints learned from a narrow philosophical canon.

We're watching the construction of a new priesthood - machine learning engineers who shape how future systems will think - while the training data that could provide alternate reasoning patterns remains unlabeled, untranslated, or simply absent. Pandey's work on Sanskrit OCR isn't about preservation; it's about preventing an intellectual monoculture where every AI system inherits the same blind spots. The Marubo didn't just lose their customs to Starlink ֊֊֊ they lost their children's ability to think in ways that modernity cannot replicate. As one Marubo leader stated: "We can't live without the internet." The dependency crystallizes before understanding of the danger arrives.

We're writing a self-fulfilling prophecy: by training AI only on the worldviews that survived digitization, we ensure those become the only worldviews that can exist in our AI-mediated future. The models will complete our civilizational story the only way they know how - by extrapolating from the fragments we've given it. In doing so, we're eliminating entire branches of human problem-solving that took millennia to develop and might be impossible to rediscover.


Thank you Dunya Baradari, Cassandra Melax, & Izzy for helpful insights & review.



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

文化多样性 AI 互联网 文化保护
相关文章