TechCrunch News 2024年11月19日
ElevenLabs now offers ability to build conversational AI agents
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

ElevenLabs周一推出构建对话AI机器人的能力,用户可在其开发者平台上创建,能定制多种变量。该公司此前主要提供文本到语音服务,此次为满足客户需求构建完整管道。用户可登录账号开始创建,还能添加知识库等。该公司面临一些挑战,也有一定竞争压力。

🎤 ElevenLabs提供AI语音克隆和文本到语音API,现可构建对话AI机器人。

💻 用户可在开发者平台上定制多种变量,如语气、响应长度等。

📄 可选择语言模型、调整响应温度等,还能添加知识库并集成自定义LLM。

🚀 ElevenLabs需发展语音到文本能力,面临竞争压力但认为自身有优势。

ElevenLabs, a startup that provides AI voice cloning and a text-to-speech API, launched the ability to build conversational AI bots on Monday.

The company announced that users can now build complete conversational agents on ElevenLabs’ developer platform, with customizable variables such as tone of voice and response length.

ElevenLabs has mostly worked on providing different voices and AI tools for text-to-speech services. The company’s head of growth, Sam Sklar, told TechCrunch that many of its clients were already using this ability to create conversational AI agents. However, the toughest parts were integrating the knowledge base and handling interruptions from customers. That’s why the company decided to build a full pipeline for conversational bots.

Users can log into their ElevenLabs account and start building a conversation agent by selecting a template or creating a new project. They can choose the agent’s primary language, first message, and system prompt to determine the agent’s persona. Developers also have to select large language model (Gemini, GPT, or Claude), the temperature of responses (to determine how creative the response should be), and token usage limit.

They can also tune other aspects like voice, latency, stability, authentication criteria, and maximum length of conversation with the AI agent.

Users can add their own knowledge base, like a file, URL, or text block, to power the conversational bot. Plus, they can also integrate their own custom LLM with the bot. ElevenLabs’ SDK is compatible with Python, Javascript, React, and Swift. The company also offers a WebSocket API for more customization.

Companies can also define criteria to collect certain data items — for instance, name and email of customers speaking to the agent — along with evaluation criteria in natural language to define the success or failure of the call.

ElevenLabs is leverage its existing pipeline for the text-to-speech part. The company has to develop speech-to-text capabilities for the new conversational AI product. The company is not offering its speech-to-text API as a standalone product as of now, but it might do that in the future, making it a competitor to Google’s, Microsoft’s and Amazon’s speech-to-text APIs, as well as specialized APIs, such as OpenAI’s Whisper, AssemblyAI, Deepgram, Speechmatics and Gladia.

The company, which is aiming to raise new funding at a valuation north of $3 billion, also competes with other voice AI startups, such as Vapi and Retell — they are also building conversational agents. More notably, the company will also rival OpenAI’s real-time conversational API. However, ElevenLabs believes that its customizations and ability to switch models will give it an edge over OpenAI.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

ElevenLabs 对话AI机器人 文本到语音 竞争压力
相关文章