voicebot 2024年10月02日
ElevenLabs Launches Generative AI Text-to-Sound-Effects Tool
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

ElevenLab发布将文本提示转换为音效的新工具。用户可描述所需声音,该工具能为创作者提供多种音频工具,辅助创作高质量内容。此工具对所有用户开放,免费用户有每月字符限制且需注明来源。该工具利用Shutterstock音频库训练模型,已通过专业人员测试。这一新功能可能提升对ElevenLabs服务的需求。

🎙 ElevenLabs发布的新工具可将文本提示转化为音效,用户通过描述想要听到的声音来唤起与视频或音轨最适配的任何音效,包括短小的器乐音乐片段。

💻 该工具增强了ElevenLabs现有的工具包,主要以用于良性和欺骗性目的的逼真语音克隆而闻名。它旨在帮助创作者快速、经济且大规模地生成丰富的沉浸式音景,包括影视工作室、视频游戏开发者和社交媒体内容创作者等。

📏 ElevenLabs为所有用户提供新的音效创作工具,免费用户每月有10,000字符的限制。公司的使用页面指出,一秒的音效约为40个字符,默认剪辑的持续时间使用200个字符,按此速率,用户每月可制作约50个音效。免费用户在使用该工具发布的任何内容中必须注明elevenlabs.io。

🎛 ElevenLabs利用Shutterstock的音频库(包含授权曲目)来训练其模型。该工具已由包括视频游戏开发者、电影制片人、社交媒体内容创作者和营销人员在内的各种专业人员在alpha阶段进行了测试,他们提供的反馈有助于工具在发布前的完善。

Generative AI audio startup ElevenLab has released a new tool that turns text prompts into sound effects. Users can evoke any sound that would work best with an accompanying video or audio track by describing the sound they want to hear, including short instrumental pieces of music. The new feature augments ElevenLabs’ existing toolkit, known mainly for the realistic voice cloning used for both benign and deceptive purposes.

ElevenLabs Sound Effects

“In the last year, we revolutionized AI Voices by producing the first truly emotive, human-like Text to Speech platform. Text to Sound Effects marks another major step forward as we equip creators with all of the audio tools they need to produce high quality content,” ElevenLabs head of growth Sam Sklar explained in a blog post. “The tool has been designed to help creators—including film and television studios, video game developers, and social media content creators—to generate rich and immersive soundscapes quickly, affordably and at scale.”

ElevenLabs is offering the new sound effect creation tool to all of its users, with a monthly limit of 10,000 characters for those at the free level. The company’s usage page describes a second of a sound effect as about 40 characters, with a default clip’s duration using 200 characters. At that rate, users could produce about 50 sound effects per month. The other caveat is that free-tier users have to credit elevenlabs.io for the sound in anything published using it.

ElevenLabs utilized Shutterstock’s audio library, which contains licensed tracks, to train its model for this sound generation tool. The tool has already been tested in an alpha phase by various professionals, including video game developers, film producers, social media content creators, and marketers. This group of early adopters has provided valuable feedback, helping refine the tool before its release. The startup made a point of emphasizing that sound effects cannot violate the content and uses policy, though any audio guardrails will likely need regular updates.

“We’re excited to be partnering with ElevenLabs to fuel yet another significant innovation in AI, Text to Sound Effects, with our ethically-sourced data,” Shutterstock chief enterprise officer Aimee Egan said. “The combined power of our rich and immersive library of tracks and this cutting-edge audio technology has enabled the creation of a true market first. We’re thrilled by the positive feedback from the early access community and look forward to seeing the wide array of projects they will create.”

The new feature will likely only elevate the demand for ElevenLabs’ services, which led to an $80 million funding round at the beginning of the year. The startup is already famous for its fidelity to real voices, including being used in speeches from prison by Pakistan’s former Prime Minister Imran Khan, who employed ElevenLabs in a victory speech and during the campaign. And robocalls to New Hampshire voters earlier this year used ElevenLabs to make a deepfake version of President Biden in an attempt to suppress turnout in the state’s primary election, which is against ElevenLabs’ own rules. An investigation traced the calls back to a telecom provider, Lingo, which transmitted them on behalf of Life Corporation. The FCC issued a cease and desist over it, followed by an outright ban on deepfake robocalls. That’s led to a similar rush to come up with deepfake detectors by companies like Pindrop as well as internal detectors from ElevenLabs and others.

ElevenLabs Raises $80M And Shares Generative AI Voice Models, Tools and Deepfake Voice Marketplace

Synthesia and ElevenLabs Team Up to Augment Deepfake Videos With Generative AI Voice Models

ElevenLabs Releases Generative AI Voice Translation and Dubbing Tool

The post ElevenLabs Launches Generative AI Text-to-Sound-Effects Tool appeared first on Voicebot.ai.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

ElevenLabs 文本转音效 音频工具 Shutterstock音频库
相关文章