Mashable 15小时前
ChatGPT vs Claude vs Grok: Best vibe-coding platform for beginners
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

文章探讨了“vibe coding”的兴起,即利用AI以自然语言生成软件的潜力。作者以初学者视角,测试了GPT-5、Claude 4.1 Opus和Grok 4(Expert)在创建网页游戏、效率工具和iOS应用方面的表现。测试结果显示,AI在生成可用软件方面已取得显著进展,但距离“按需软件”的终极目标仍有差距。对于初学者而言,网页应用和游戏是更现实的选择,而iOS等原生应用开发则对新手不够友好,且需要更多专业知识和调试能力。文章详细对比了各AI工具在不同类型软件生成中的优劣,并对未来AI编程的发展前景进行了展望。

✨ AI辅助编程(Vibe Coding)的潜力在于让非程序员通过自然语言描述需求来创建软件,但目前仍无法完全替代专业编程,尤其是在复杂功能和用户体验方面。

🎮 在网页游戏开发方面,Claude在视觉效果上表现最佳,而ChatGPT在功能性上略胜一筹,Grok则在设计和体验上表现较弱,整体AI生成游戏已具备基本可玩性。

📝 效率工具(待办事项应用)的生成中,ChatGPT在功能实现(如任务完成、编辑、删除)和用户界面上表现最出色,Claude次之,Grok的功能性和界面则相对基础且有待改进。

📱 iOS应用开发是目前AI辅助编程的难点,仅Claude成功生成了可运行的应用,但功能实现(如进度图表、目标设定)仍有较大提升空间,且原生应用部署对新手门槛较高。

🚀 总体而言,AI在辅助初学者进行网页应用和游戏开发方面已展现出巨大潜力,但用户仍需谨慎选择,并认识到当前AI工具在处理复杂项目和原生应用开发上的局限性。

The rise of vibe coding is based on the promise of services like GPT-5: that in the future, you won’t have to know how to program at all in order to “create” software — you’ll just need to know how to communicate your software ideas in plain old English. The AI will do all the programming and debugging for you.

The ultimate goal: “software on demand,” or any kind of app, website, or visualization you can dream up, whenever you want it. But how close are we? If you’re totally new to coding and don’t want to learn how to code yourself, should you expect to get something usable? I set out to try some of the bigger consumer AI services to see how they perform. In short, I'm looking for the best vibe coding tools for beginners, not more advanced tools like Cursor or Windsurf.

For these tests, I used GPT-5, Claude 4.1 Opus, and Grok 4 (Expert). 

How I tested vibe coding

To see how well each of the different AI services performed when it came to coding, I asked each service to create a web-based game, a web-based productivity service, and an iOS app.

Just like when I tested the best AI image generators, I used the same prompts to test each model. Here are the vibe coding prompts I used:

My prompts test different kinds of software and services, as well as different kinds of information. The AI will need to build some kind of physics framework for the game, for example, and either find a pre-set list of quotes, or call an API to find quotes, in the productivity app.

I’m certainly no expert coder. I do have some basic web-development knowledge and can find my way around HTML and CSS files just fine, but that’s about where my experience in programming ends. I certainly won’t be able to assess how these services perform when it comes to code efficiency. I’m judging the end product based on how well those services follow my prompts. 

I've included links to the game and webpages these vibe coding tools created, but I won't ask you to download the app. You'll just have to take my word for it.

Vibe-coding a Web-based game

The game created by Claude. Credit: Screenshot: Christian de Looper

See Play the results:

All three services were able to create a working web game, with slightly different mechanics and additional touches. ChatGPT and Grok offered a game that included asteroids flying in from all directions and the ability to rotate the ship. Claude went for a Space Invaders-type set-up, with asteroids only entering from the top of the screen, and the ship pointed at the top. 

Still, Claude’s approach was the best-looking. There were bright, retro colors, a background with stars, and a cool movement trail behind the user. ChatGPT came in close second, and while there weren’t stars in the background or a sense of movement, I liked its minimalistic approach, which highlighted explosions. Grok’s game looked the worst — and played the worst — the design was boring and the asteroids were just gray circles.

The game created by Grok 4. Credit: Screenshot: Christian de Looper
The game created by ChatGPT. Credit: Screenshot: Christian de Looper

In the end I preferred the overall look of Claude’s game, but the functionality of ChatGPT’s. 

Winners: ChatGPT, Claude

Vibe-coding a Web-based app

The app created by ChatGPT. Credit: Screenshot: Christian de Looper

See the results:

All three services were able to create a productivity app that allowed the user to see the tasks on a calendar, and to see inspirational quotes with the press of a button. The basic interface was similar across all three of the services, too. They all opened to a list view by default, offering fields to type in a task and an option to select a date and time for the task.

However, the apps differed greatly in appearance. Grok was the most basic, once again. Its app looked like a Word document, with a few buttons and fields to enter text. Once again, ChatGPT was a significant step up in look, but not as flashy as Claude. It gave the inspirational quote a different background color, while Claude gave the whole interface a different color and used emoji to highlight times and dates.

The Grok productivity app. Credit: Screenshot: Christian de Looper
The Claude productivity app. Yes, that's a quote from Nelson Mandela. Credit: Screenshot: Christian de Looper

Differences in functionality were even greater. Grok's app was basically unusable; once a task was created, there was no way to remove it or check it off. Claude included a “delete” button next to tasks in the list view; clicking on a task in the calendar view generated a prompt asking if you wanted to delete a task.

ChatGPT, however, was the most functional in key areas. In the list view, it included a checkbox to mark a task as completed, along with a separate button to delete a task, and even a button to edit a task. It also included a “Notes” field for each task, allowing users to add more than just the task name and its due date. All of which for a much better to-do list, even if it wasn’t the flashiest. 

Winner: ChatGPT

Vibe-coding an iOS app

This one was…a little more challenging.

Why? Well, Claude is the only service that actually generated a working iOS app. Both ChatGPT and Grok created apps that failed to run in Apple's Xcode. The app that Claude created ran fine.

That said, Claude didn’t really go the extra mile.

The iOS productivity app created by Claude. Credit: Screenshot: Christian de Looper
The iOS productivity app created by Claude. Credit: Screenshot: Christian de Looper

Claude's app allowed me to enter workouts, then displayed them in a list view, allowing me to filter based on workout type, time they were logged, and so on. But it didn’t calculate calories burned for me (I had to enter that information just like I entered workout types and how long I worked out for). And its “progress” chart was just a bar graph that showed how many calories I burned during different workout types. There was no way to create or view goals.

To actually run an app on your iPhone that isn’t from the App Store, you have to jump through some hoops. This isn’t something beginners can do in a minute or two; they'd be better off sticking to web-based AI coding for now. 

Winner: Claude

The state of vibe coding

Two things are clear. Firstly, AI services are getting very good at helping non-coders or beginner coders create apps, services, and games. Secondly, we cannot yet dream up any piece of software and expect an AI service to spit out a flawless, working product. I don’t think we’re very far off, however.

App creation takes a little more than just generating a downloadable file. if you’re not interested in troubleshooting or learning any basics, I’d steer clear from trying to create actual iOS or Android apps. Web-only apps are preferable.

So what should you use to code if you’re a beginner? Well, if all you want to do is experiment with web apps and little games, both ChatGPT and Claude work well. Excluding the app creation test, ChatGPT performed better. OpenAI’s GPT-5, being free to all, is easier to access than Claude Opus 4.1 — and for those who subscribe to each service’s respective $20-per-month plan, OpenAI imposes fewer limits. Still, don’t expect sub-par coding from Claude.

And to be fair to Grok, you’ll still likely get working websites if you use it — they’ll just be basic, boring, and may not stick as closely to your prompts. As with Grok Imagine, xAI has a long way to go to catch up with its rivals.


Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI编程 Vibe Coding ChatGPT Claude Grok 软件开发
相关文章