少点错误 2024年07月15日
Series on Artificial Wisdom
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文探讨了“人工智慧(AW)”的概念,并展示了多种设计人工智慧系统的方法。人工智慧指的是能够显著提升世界智慧的人工智能系统。智慧可以定义为“能够有效避免大规模错误的思考/计划”,或者“拥有良好的最终目标和子目标”。通过将智慧与人工智能结合,我们或许能够在人工智能快速发展的过程中生成大量的智慧,这将帮助我们明智地应对转型人工智能和“最重要的世纪”。

🤔 **人工智慧(AW)**是指能够显著提升世界智慧的人工智能系统。智慧可以定义为“能够有效避免大规模错误的思考/计划”,或者“拥有良好的最终目标和子目标”。

🤖 **人工智慧设计**:本文介绍了三种设计人工智慧系统的方法,包括: - **智慧型工作流程研究组织**:通过构建一个专注于人工智能安全、存在风险、长期主义和人工智慧的研究组织,并利用工作流程来提高该组织的效率,从而逐步提升人工智慧。 - **GitWise 和 AlphaWise**:创建类似维基百科或 GitHub 的去中心化系统,用于收集和分享提升智慧的用例,并利用这些用例训练人工智能模型,使其能够充当智慧教练,帮助用户做出明智的决策。 - **决策预测 AI 和 Futarchy**:利用先进的预测人工智能来帮助人类做出更好的决策,并能够准确预测行动带来灾难性风险的微小可能性,从而避免潜在的风险。

💡 **人工智慧的应用**:人工智慧系统可以通过帮助人类避免大规模错误、提升决策能力、预测未来等方式,为人类社会带来积极的改变。

🚀 **人工智慧的发展方向**:未来的研究方向包括: - 进一步提高人工智慧系统的效率和安全性; - 探索人工智慧在不同领域的应用,例如医疗保健、教育、金融等; - 关注人工智慧伦理问题,确保其发展符合人类利益。

Published on July 15, 2024 1:11 AM GMT

This series introduces Artificial Wisdom (AW) and illustrates several designs for AW systems. Artificial Wisdom refers to artificial intelligence systems which substantially increase wisdom in the world. Wisdom may be defined as "thinking/planning which is good at avoiding large-scale errors," or as “having good terminal goals and sub-goals.” By “strapping” wisdom to AI via AW as AI takes off, we may be able to generate enormous quantities of wisdom which could help us navigate Transformative AI and The Most Important Century wisely.

On Artificial Wisdom

In the first post in the series I introduce the term “Artificial Wisdom (AW),” which refers to artificial intelligence systems which substantially increase wisdom in the world. Wisdom may be defined as "thinking/planning which is good at avoiding large-scale errors," including both errors of commission and errors of omission; or as “having good goals” including terminal goals and sub-goals.

Due to orthogonality, it is possible we could keep AI under control and yet use it very unwisely. Four scenarios are discussed on how AI alignment interacts with artificial wisdom, with artificial wisdom being an improvement on any world, unless pursuit of AW significantly detracts from alignment, causing it to fail.

By “strapping” wisdom to AI via AW as AI takes off, we may be able to generate enormous quantities of wisdom in both humans an autonomous AI systems which could help us navigate Transformative AI and "The Most Important Century" wisely, in order to achieve existential security and navigate toward a positive long-term future.

Designing Artificial Wisdom: The Wise Workflow Research Organization

Even simple workflows can greatly enhance the performance of LLM’s, so artificially wise workflows seem like a promising candidate for greatly increasing AW. 

This piece outlines the idea of introducing workflows into a research organization which works on various topics related to AI Safety, existential risk & existential security, longtermism, and artificial wisdom. Such an organization could make progressing the field of artificial wisdom one of their primary goals, and as workflows become more powerful they could automate an increasing fraction of work within the organization. 

Essentially, the research organization, whose goal is to increase human wisdom around existential risk, acts as scaffolding on which to bootstrap artificial wisdom. 

Such a system would be unusually interpretable since all reasoning is done in natural language except that of the base model. When the organization develops improved ideas about existential security factors and projects to achieve these factors, they could themselves incubate these projects, or pass them on to incubators to make sure the wisdom does not go to waste.

Designing Artificial Wisdom: GitWise and AlphaWise

Artificially wise coaches that improve human wisdom seem like another promising path to AW. Such coaches could have negligible costs, be scalable & personalized, and soon perform at a superhuman level. Certain critical humans receiving wise coaching could be decisive in humans navigating transformative AI wisely.

One path to AW coaches is by creating a decentralized system like a wiki or GitHub for wisdom-enhancing use-cases. Users could build up a database of instructions for LLM’s to act as AW coaches to help users make difficult decisions, navigate difficult life and epistemic dilemmas, work through values conflicts, achieve career goals, improve relational/mental/physical/emotional well-being, and increase fulfillment/happiness.

One especially wise use-case could be a premortem/postmortem bot that helps people, organizations, and governments to avoid large-scale errors.

Another path to creating an AW coach is to build a new system trained on biographical data, which analyses and learns to predict which decision-making processes and strategies of humans with various traits in various environments are most effective for achieving certain goals.

Designing Artificial Wisdom: Decision Forecasting AI & Futarchy

A final AW design involves using advanced forecasting AI to help humans make better decisions. Such a decision forecasting system could help individuals, organizations, and governments achieve their values while maintaining important side constraints and minimizing negative side effects.

An important feature to include in such AW systems is the ability to accurately forecast even minuscule probabilities of actions increasing the likelihood of catastrophic risks. The system could refuse to answer, attempt to persuade the user against such actions, and the analyses of such queries could be used to better understand the risk humanity is facing, and to formulate counter-strategies and defensive capabilities.

In addition to helping users select good strategies to achieve values or terminal goals, it is possible such systems could also learn to predict and help users understand what values and terminal goals will be satisfying once achieved.

While such technologies seem likely to be developed, it is questionable whether this is a good thing due to potential dual-use applications, for example use by misaligned AI agents; therefore, while it is good to use such capabilities wisely if they arise, it is important to do more research on whether differential technological development of such systems is desirable.



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

人工智慧 智慧 人工智能安全 存在风险 长期主义
相关文章