Astral Codex Ten Podcast feed 2024年07月17日
Gwern's AI-Generated Poetry
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI发布的语言模型GPT-2经过Gwern的重新训练,学会了写诗!通过在117 MB的古腾堡诗歌语料库上进行训练,GPT-2能够准确地掌握诗歌的韵律和节奏,甚至能够写出完美的五步抑扬格诗。这说明了语言模型的学习能力,只要有足够的数据,就能掌握复杂的语言规律。

😄 GPT-2经过Gwern的重新训练,学会了写诗!Gwern使用117 MB的古腾堡诗歌语料库对GPT-2进行了专门的训练,使其能够理解诗歌的韵律和节奏。

😁 经过训练的GPT-2能够写出完美的五步抑扬格诗,这表明它已经掌握了诗歌的语言规律。它甚至能够使用一些罕见的词汇,例如“compeers”和“erst”,这些词在莎士比亚的时代很常见,但在现代英语中已经很少使用。

🤔 虽然GPT-2在押韵方面仍然存在一些问题,但它能够在一段较短的时间内保持押韵,这表明它已经开始理解押韵的规则。

😊 GPT-2的训练结果表明,语言模型的学习能力非常强大。只要有足够的数据,它就能掌握复杂的语言规律,并进行创造性的写作。

🤩 虽然GPT-2目前还无法完全替代人类诗人,但它的出现为诗歌创作带来了新的可能性,也为我们理解语言和创造力提供了新的视角。

Gwern has answered my prayers and taught GPT-2 poetry.

GPT-2 is the language processing system that OpenAI announced a few weeks ago. They are keeping the full version secret, but have released a smaller prototype version. Gwern retrained it on the Gutenberg Poetry Corpus, a 117 MB collection of pre-1923 English poetry, to create a specialized poetry AI.

I previously tested the out-of-the-box version of GPT-2 and couldn’t make it understand rhyme and meter. I wrongly assumed this was a fundamental limitation: “obviously something that has never heard sound can’t derive these complex rhythms just from meaningless strings of letters.” I was wrong; it just didn’t have enough training data. Gwern’s retrained version gets both of these right, and more too. For example:

Thou know’st how Menoetiades the swift Was dragged, of Hector and the fierce compeers And Phrygian warriors. So, we will dispatch Your bodies, then, yourselves to burn the ships In sacrifice; with torches and with bells To burn them, and with oxen to replace Your gallant friends for ever. But I wish That no man living has so long endured The onset of his foes, as I have power To burn or storm; for mighty Hector erst Was slain, and now returns his safe return

This is all perfect iambic pentameter. I know AP English students who can’t write iambic pentameter as competently as this.

(by the way, both “compeers” and “erst” are perfectly cromulent words from the period when people wrote poems like this; both show up in Shelley)

It has more trouble with rhymes – my guess is a lot of the poetry it was trained on was blank verse. But when it decides it should be rhyming, it can keep it up for a little while. From its Elegy Written in a Country Churchyardfanfic:

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

GPT-2 诗歌 人工智能 语言模型
相关文章