TechCrunch News 04月09日
Google’s enterprise cloud gets a music-generating AI model
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

谷歌在Vertex AI云平台上发布了多款生成式AI模型的更新,重点面向企业市场。Lyria模型推出预览版,可根据文本生成音乐;Veo 2视频创作模型新增编辑和视觉效果定制选项;Chirp 3语音克隆功能正式发布,支持多种语言;Imagen 3图像生成器性能显著提升。这些更新旨在增强谷歌在生成式AI领域的竞争力,与亚马逊等公司竞争。谷歌还强调了对生成内容的保护措施,并提供了版权相关的保障。

🎶 Lyria,谷歌的文本到音乐模型,现已向部分客户开放预览。该模型能够创作多种风格和流派的音乐,如爵士钢琴独奏和低保真音乐。

🗣️ Chirp 3,谷歌的音频理解模型,驱动了Instant Custom Voice功能,可以在10秒的音频中克隆语音。该模型现已全面推出,并支持约35种语言。谷歌对Instant Custom Voice的使用进行了“尽职调查”,以防止滥用。

🎬 Veo 2视频创作模型新增了编辑功能,例如移除背景、标志和物体,扩展视频帧,调整摄像机角度和节奏,以创建延时摄影和无人机风格的视频剪辑。

🖼️ Imagen 3图像生成器进行了升级,提高了移除对象和重建图像缺失或损坏部分的能力。所有由Imagen、Veo和Lyria生成的媒体都将使用谷歌的SynthID技术进行水印处理。

🛡️ 谷歌强调其生成式AI模型具有“内置保护措施”,以防止有害内容的产生。谷歌还提供了模型训练的退出机制和赔偿政策,以保护谷歌云和Vertex AI客户免受与AI相关的版权纠纷。

On Wednesday, Google rolled out updates to several of its first-party media-generating AI models available through its Vertex AI cloud platform.

Lyria, Google’s text-to-music model, is now available in preview for select customers, and the company’s Veo 2 video creation model has been enhanced with new editing and visual effects customization options. The company has also launched a voice-cloning feature powered by Chirp 3, Google’s audio understanding model, for “allow-listed” users. And the Imagen 3 image generator now delivers what the company describes as “significantly” better performance.

The updates, timed for Cloud Next, are Google’s latest push to corner the enterprise market for generative AI. The company competes perhaps most directly with Amazon, which offers a comparable cloud AI platform called Bedrock with its own set of proprietary generative AI models.

Google is pitching Lyria as an alternative to royalty-free music libraries. Using the model, customers can create songs in a range of styles and genres, from jazzy piano solos to lo-fi tracks, the company said.

Chirp 3, meanwhile, can synthesize speech in around 35 languages. First previewed earlier this year, Chirp 3 drives Instant Custom Voice, which can supposedly clone a voice with 10 seconds of audio. It’s now generally available. This model also underpins a new tool launching in preview, called Transcription with Diarization, which separates and identifies speakers in recordings with multiple participants.

To prevent abuse, Instant Custom Voice is subject to a “diligence” process to verify “proper voice usage permissions,” says Google.

As for Veo 2, the model can now remove background images, logos, and objects from existing videos, and extend the frame of video footage (to convert landscape video into portrait, for example). It can also now adjust the camera angles and pacing in AI-generated scenes to create timelapses, drone-style clips, and more, and it can interpolate between specified beginning and end frames.

These Veo features are available in preview for now.

As for the aforementioned Imagen 3 upgrades, Google said they improve the model’s ability to remove objects and reconstruct missing or damaged portions of images.

All media generated by Imagen, Veo, and Lyria (but not Chirp) are watermarked using Google’s SynthID technology. The company said all its generative AI models have “built-in safeguards” to protect against the creation of harmful content.

Google hasn’t historically indicated which specific data it uses to train its models, and the tech giant stuck with that precedent today. Training data tends to be a controversial subject for IP-related reasons. Some firms train their models on copyrighted works without first obtaining permission from rights holders. While these companies claim that U.S. fair use doctrine shields the practice, some creators understandably disagree. Many are battling vendors in court.

Google has previously told TechCrunch that it offers opt-out mechanisms for model training as well as an indemnity policy to shield Google Cloud and Vertex AI customers from AI-related copyright disputes.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

谷歌 AI 生成式AI Lyria Veo 2 Imagen 3
相关文章