Mashable 9小时前
I tested Grok Imagine, and its no match for Google Veo 3 or Sora
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

xAI公司发布了其新一代生成式AI工具Grok Imagine,旨在生成图像和视频内容,并已向付费订阅用户开放。该工具在X平台上受到埃隆·马斯克的积极推广,并允许用户生成包含“Spicy”标签的轻度成人内容。然而,在与Google的Veo 3、OpenAI的Sora以及Midjourney等竞争对手的实际对比中,Grok Imagine在视频生成质量、图像逼真度以及音频同步性方面表现不佳,尤其是在文本到视频生成方面,其功能仅限于文本到图像再进行动画处理,与直接文本到视频的工具存在差距。尽管Grok Imagine在生成速度上具有优势,但整体而言,其当前表现未能达到预期,更适合用于生成meme和动漫风格的内容。

🚀 Grok Imagine是xAI推出的一款新的生成式AI工具,可用于创建图像和视频,并已向付费订阅用户在iOS和Android应用中开放。该工具在X平台上受到埃隆·马斯克的积极宣传,并允许生成标记为“Spicy”的轻度成人内容。

💡 在与Google Veo 3、OpenAI Sora和Midjourney等竞争对手的对比中,Grok Imagine在视频生成质量上表现不佳。与Veo 3和Sora直接支持文本到视频生成不同,Grok Imagine(和Midjourney)目前仅支持文本到图像,然后将图像转换为短视频,这使其处于相对劣势。

🔍 对比测试显示,在“夜间兔子在蹦床上跳跃的安全摄像头画面”这一简单提示下,Grok Imagine生成的图像和视频质量平庸,不如Veo 3和Sora逼真。Midjourney虽然也需要先生成图像再动画化,但其产生的图像和视频在视觉效果上更接近监控画面的颗粒感,且经过两次尝试后效果更佳。

🔊 音频方面,Grok Imagine的视频仅限于粗糙的音效和无意义的乱码,而Veo 3则能生成与视频同步的音效和连贯的对话,这是Grok Imagine的一大劣势。

⚡ 尽管在内容生成质量上存在不足,Grok Imagine在生成速度方面表现出色,能够比竞争对手更快地生成图像和视频。然而,其主要优势似乎集中在生成meme和动漫风格的内容,对于更广泛的应用场景,其表现仍有待提升。

Over the weekend, Elon Musk's artificial intelligence company xAI released Grok Imagine, a new generative AI tool for generating images and videos. Grok Imagine is available now to paid xAI subscribers in the Grok iOS and Android apps.

Musk has been hyping up the project on X, sharing photos and videos from Grok users. This includes some mildly NSFW content, which the Grok app labels as "Spicy."

AI video is an exciting — and frankly terrifying — new frontier for the AI industry. To proponents, this technology gives artists a new medium for creativity and could reduce the costs of animation and filmmaking. To critics, AI video poses serious risks for sexual deepfakes and misinformation.

Putting aside that debate for the moment, I wanted to see how well Grok Imagine compares to xAI's biggest rivals. As I've written previously, Google's Veo 3 AI video model currently leads with field with surprisingly lifelike video. Then there's Sora, from ChatGPT-maker OpenAI. Additionally, the popular AI image generator Midjourney recently introduced its own generative AI video tool.

So, how does Grok Imagine compare to its competitors? To be blunt, I'm not impressed.

Yes, Grok Imagine is brand new, and Musk recently said on X that it "should get better every day." However, as of this writing, it seems to lag far behind its rivals.

Let me show my work.

Comparing Grok Imagine AI video to the competition

Mashable recently wrote about a viral AI video trend — security camera footage of animals jumping on trampolines and engaging in similar antics. So, I used a simple prompt to test Grok Imagine, Veo 3, Sora, and Midjourney: "Security camera footage of rabbits jumping on a trampoline at night." Simple enough, right?

First, I should note that there's a big difference between Veo 3 and Grok Imagine. Google's Veo 3 model can generate videos based on a text prompt. Simply describe the video you want, and Veo 3 will do the rest. However, tools like Midjourney and Grok Imagine only offer text-to-image generation. After generating or uploading an image, users can then animate it, transforming it into a short video clip. In this sense, Grok Imagine is already on the back foot compared to OpenAI and Google.

With those caveats, let's dive into the results, which I've also shared on X.

I put my test prompt into Grok, and it returned these disappointing images.

Credit: Screenshot courtesy of Grok / Timothy Beck Werth
Credit: Screenshot courtesy of Grok / Timothy Beck Werth

I selected the least bad of these images and created this short video:

It's...fine? Kind of mid, or meh, as the kids say.

But it also suffers in comparison to other AI video tools.

As the video shows, Google Veo 3 and Sora did much better with the same prompt:

Finally, Midjourney, which animates images similar to xAI, was able to produce better images and videos, though it took two attempts. The image and video it produced have the grainy look of surveillance footage.

AI-generated image. Credit: Timothy Beck Werth / Midjourney

Audio is also a major disadvantage with Grok Imagine. While Veo 3 can produce sound effects and coherent dialogue in sync with the video, the audio I've found on Grok Imagine videos is limited to rough sound effects and gibberish.

Musk compared Grok Imagine to a modern-day Vine app, writing on X, "Grok Imagine is optimized for most fun and shareable content."

And in my initial tests, Grok Imagine seems optimized for creating two types of images and videos: memes and anime. If you want to animate memes — or create sexually suggestive videos of anime girls — then Grok Imagine will do the trick, I guess. But beyond that, I can't say I'm impressed.

There is one area where Grok Imagine does shine, and that's in terms of speed. So far, I've found it produces both images and videos significantly faster than its rivals.

Mashable reached out to xAI, and we'll update this story if we receive a response.


Disclosure: Ziff Davis, Mashable’s parent company, filed a lawsuit in April against OpenAI, alleging it infringed Ziff Davis' copyrights in training and operating its AI systems.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

xAI Grok Imagine AI视频 生成式AI 人工智能
相关文章