voicebot 2024年10月02日
Stability AI Shares Open-Source Generative AI Audio Model for Creative Sound Design
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Stability AI发布开源生成式AI音频模型Stable Audio Open,该模型可根据文本提示生成短音频样本,包括音效和制作元素,适用于音乐制作和声音设计等领域,具有广泛的应用潜力。

🎧Stable Audio Open是针对音乐家和声音工程师的开源生成式AI模型,能依据文本提示生成短音频,如音效、制作元素等,可生成高质量音频,最长达47秒。

🎵此模型在音乐制作和声音设计中作用显著,可创造如鼓点、乐器即兴重复段、环境音和拟音等各种音频元素,且用户可用自己的音频数据对模型进行微调。

💡Stable Audio Open应用广泛,在影视、音乐、游戏等领域都能发挥作用,如为特定场景快速生成定制音效,让音乐人尝试新声音等,还可通过在特定音频数据上训练来生成独特内容。

🌐该模型基于FreeSound和Free Music Archive的音频数据进行训练,尊重原作者权利,其模型权重可在Hugging Face上获取,且与Stability AI的商业产品Stable Audio有所区别。


Synthetic media startup Stability AI has unveiled an open-source generative AI model aimed at musicians and sound engineers. The new Stable Audio Open model can produce short audio samples based on text prompts, including sound effects and production elements.

Stable Audio Open

Stable Audio Open can generate up to 47 seconds of high-quality audio from a given text prompt. Stability pitches this capability as particularly useful for creating various audio elements such as drum beats, instrument riffs, ambient sounds, and foley recordings, which are essential in music production and sound design. Stable Audio Open has some flexibility in its final product, allowing users to fine-tune the model with their own audio data. The company suggested a drummer could train the model on samples of their drum recordings to generate new and unique beats. This adaptability opens up a myriad of possibilities for users to customize the audio output to better suit their specific needs and preferences.

The potential applications of Stable Audio Open are vast. In film and television, for instance, sound designers can quickly generate bespoke sound effects tailored to specific scenes. Musicians and producers can use the tool to experiment with new sounds and incorporate unique audio elements into their compositions. Even in gaming, developers can create dynamic audio effects that enhance the immersive experience for players.

“This release marks a key milestone as we further open portions of our generative audio capabilities to empower sound designers, musicians and creative communities,” Stability AI wrote in a blog post. “We encourage sound designers, musicians, developers and audio enthusiasts to download the model, explore its capabilities and provide feedback. While an exciting step forward, this is still just the beginning for open and responsible audio generation capabilities. We look forward to continuing research and prioritizing development hand-in-hand with creative communities.”

Stability AI trained Stable Audio Open on audio data from FreeSound and the Free Music Archive, which allowed the creation of an open audio model that respects the rights of original creators. The Stable Audio Open model weights are available on Hugging Face. By comparison, Stability AI’s commercial product, Stable Audio, can produce high-quality, coherent musical tracks up to three minutes long and includes other features like audio-to-audio generation. Stable Audio Open is tailored more for shorter audio clips and sound effects. While it is capable of generating brief musical segments, it is not optimized for full songs, melodies, or vocals. This distinction ensures that Stable Audio Open remains focused on providing tools for sound design rather than complete musical compositions.

You can hear a couple of samples below. The first clip was made from the prompt: Blackbird song, summer, dusk in the forest.

The model generated the second audio clip from the prompt: Rock beat played in a treated studio, session drumming on an acoustic kit.

Stability AI Releases Augmented Text-to-Music Engine Stable Audio 2 With Upload and Style Transfer Features

Generative AI Music Startup Suno Raises $125M

Microsoft Copilot Adds Generative AI Music Engine

The post Stability AI Shares Open-Source Generative AI Audio Model for Creative Sound Design appeared first on Voicebot.ai.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Stability AI Stable Audio Open 音频生成 声音设计
相关文章