TechCrunch News 03月04日
Podcasting platform Podcastle launches a text-to-speech model with more than 450 AI voices
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Podcastle发布了其AI语音模型Asyncflow v1.0,加入了AI语音合成的竞争。该模型提供超过450种AI声音,可以将文本转换为语音,并通过API提供给开发者集成到应用程序中。Podcastle声称其训练和推理成本较低,具有竞争优势。同时,Podcastle还升级了语音克隆功能,只需几秒钟的录音即可创建语音克隆。Podcastle将音频、视频、播客和AI语音合成工具整合到一个平台,旨在超越竞争对手。

🚀 Asyncflow v1.0模型发布:Podcastle推出了Asyncflow v1.0,一款AI驱动的文本转语音模型,标志着该公司正式进军AI语音合成领域,与ElevenLabs、Speechify等公司展开竞争。

🎤 450+ AI声音选择:Asyncflow v1.0提供超过450种AI声音,用户可以将文本转换为由AI朗读的语音片段,适用于营销、广告、内容创作、教育和企业培训等多种场景。

💰 成本优势显著:Podcastle声称其AI模型的训练和推理成本较低,使其在价格上具有优势。例如,Podcastle的文本转语音服务收费为每500分钟40美元,而ElevenLabs的收费为每500分钟99美元。

✨ 语音克隆功能升级:Podcastle改进了语音克隆功能,将训练过程从大约70个不同的句子缩短到只需几秒钟的录音,并利用Magic Dust AI技术来提高音频录制质量。

Podcast recording and editing platform odcastle is now joining other companies in the AI-powered, text-to-speech race by releasing its own AI model called Asyncflow v1.0. An API for developers will also be available, allowing them to directly integrate the text-to-speech model in their apps.

Thanks to the new model, the company is able to offer more than 450 AI voices that can narrate your text. The startup said that it developed the technology and model in such a way that its training and inference costs are low, giving it an advantage against competitors.

With the move, Podcastle joins a number of startups, including ElevenLabs, Speechify, and WellSaid, that have developed technology and AI models to convert any kind of text into a voice clip narrated by AI. This technology spans use cases like marketing, advertisement, content creation, education, and corporate training.

Podcastle’s founder, Arto Yeritsyan, told TechCrunch that the company had always wanted to build a text-to-speech model, but the cost of training and data requirements for that were very high.

“We wanted to build a robust text-to-speech model since our inception. However, the costs of development were very high. Thanks to recent large language model developments, we were able to reach a breakthrough last year to get to a place where we could build a high-quality voice model without needing a ton of data,” Yeritsyan said.

The company was also aided in its efforts by its $13.5 million Series A fundraise last year.

Yeritsyan said that while Podcastle charges around $40 per 500 minutes of text-to-speech conversion, ElevenLabs charges $99 for the same.

Podcastle’s voice cloning feature is getting an upgrade, as well, to create a quicker process for training.

Earlier, the training process involved reading roughly 70 different sentences. Now, it just needs a few seconds of recording from you to create a clone of your voice. The new process also used Podcastle’s Magic Dust AI, which was released last year, to improve audio recording quality.

Image Credits: Podcastle

In our testing, the voice created with the new process sounded a bit robotic, though it mimicked our tone. The company said that, over time, it will improve the feature. Plus, you can train different samples of your voice to get different results.

Podcastle said that apart from costs, having tools for audio, video, podcasts, and AI-powered narration under one redesigned site will give it an edge over competitors. Yeritsyan said that while the majority of the users use Podcastle to work on audio content, video is catching up to it as well.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Podcastle Asyncflow v1.0 AI语音合成 语音克隆
相关文章