Fortune | FORTUNE 07月20日 19:40
Experienced software developers assumed AI would save them a chunk of time. But in one experiment, their tasks took 20% longer
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

一项研究发现,在软件开发任务中使用AI工具反而减缓了开发者的工作速度,与预期相反。16名平均拥有五年经验的软件开发者在执行246项任务时,使用AI工具(如Cursor Pro或Claude 3.5/3.7 Sonnet)的任务耗时比不使用AI的对照组平均增加了19%。开发者需要花费大量时间调整AI生成的代码以适应项目需求,并调试输出结果。此外,撰写AI提示和等待AI生成结果也占用了不少时间。研究作者认为,AI在实际应用中,尤其是在有经验的专业人士手中,其生产力提升效果可能不如预期,并强调在推广AI应用前需要更多高质量的数据和审慎的考量,以避免盲目自动化带来的时间和资源浪费。

📊 **AI工具的实际应用效果出乎意料**:一项针对16名有经验软件开发者的实验显示,使用AI工具(如Cursor Pro、Claude 3.5/3.7 Sonnet)完成任务,反而比独立完成任务的平均耗时增加了19%,颠覆了AI能显著提升效率的普遍认知。

🛠️ **代码适配与调试消耗大量时间**:开发者在使用AI生成的代码时,需要花费大量时间进行清理和调试,以确保其能够无缝集成到现有项目中,这抵消了AI可能带来的部分效率提升。

⏳ **其他因素也影响AI效率**:除了代码调整,撰写详细的AI提示以及等待AI生成结果也占用了开发者的宝贵时间,进一步降低了AI工具在实际工作流程中的效率。

💡 **AI对熟练劳动力的边际效益递减**:研究结果与AI能大幅提升经济和劳动生产力的宏大论调形成对比,尤其对于经验丰富的专业人士,AI工具的辅助作用可能不如预期,甚至可能降低其原有工作效率。

cautious **审慎评估AI的适用性**:研究作者和经济学家呼吁,在快速推广AI应用时应保持谨慎,强调AI的实际效益需要组织层面的调整、互补性投资以及对员工技能的提升来配合,而不能仅凭技术本身来驱动生产力增长。

It’s like a new telling of the “Tortoise and the Hare”: A group of experienced software engineers entered into an experiment where they were tasked with completing some of their work with the help of AI tools. Thinking like the speedy hare, the developers expected AI to expedite their work and increase productivity. Instead, the technology slowed them down more. The AI-free tortoise approach, in the context of the experiment, would have been faster. 

The results of this experiment, published in a study this month, came as a surprise to the software developers tasked with using AI—and to the study’s authors, Joel Becker and Nate Rush, technical staff members of nonprofit technology research organization Model Evaluation and Threat Research (METR).

The researchers enlisted 16 software developers, who had an average of five years of experience, to conduct 246 tasks, each one a part of projects on which they were already working. For half the tasks, the developers were allowed to use AI tools—most of them selected code editor Cursor Pro or Claude 3.5/3.7 Sonnet—and for the other half, the developers conducted the tasks on their own.

Believing the AI tools would make them more productive, the software developers predicted the technology would reduce their task completion time by an average of 24%. Instead, AI resulted in their task time ballooning to 19% greater than when they weren’t using the technology.

“While I like to believe that my productivity didn’t suffer while using AI for my tasks, it’s not unlikely that it might not have helped me as much as I anticipated or maybe even hampered my efforts,” Philipp Burckhardt, a participant in the study, wrote in a blog post about his experience.

Why AI is slowing some workers down

So where did the hares veer off the path? The experienced developers, in the midst of their own projects, likely approached their work with plenty of additional context their AI assistants did not have, meaning they had to retrofit their own agenda and problem-solving strategies into the AI’s outputs, which they also spent ample time debugging, according to the study. 

“The majority of developers who participated in the study noted that even when they get AI outputs that are generally useful to them—and speak to the fact that AI generally can often do bits of very impressive work, or sort of very impressive work—these developers have to spend a lot of time cleaning up the resulting code to make it actually fit for the project,” study author Rush told Fortune.

Other developers lost time writing prompts for the chatbots or waiting around for the AI to generate results.

The results of the study contradict lofty promises about AI’s ability to transform the economy and workforce, including a 15% boost to U.S. GDP by 2035 and eventually a 25% increase in productivity

But Rush and Becker have shied away from making sweeping claims about what the results of the study mean for the future of AI.

For one, the study’s sample was small and non-generalizable, including only a specialized group of people to whom these AI tools were brand new. The study also measures technology at a specific moment in time, the authors said, not ruling out the possibility that AI tools could be developed in the future that would indeed help developers enhance their workflow.

The purpose of the study was, broadly speaking, to pump the brakes on the torrid implementation of AI in the workplace and elsewhere, acknowledging more data about AI’s actual effects need to be made known and accessible before more decisions are made about its applications.

“Some of the decisions we’re making right now around development and deployment of these systems are potentially very high consequence,” Rush said. “If we’re going to do that, let’s not just take the obvious answer. Let’s make high-quality measurements.”

AI’s broader impact on productivity

Economists have already asserted that METR’s research aligns with broader narratives on AI and productivity. While AI is beginning to chip away at entry-level positions, according to LinkedIn chief economic opportunity officer Aneesh Raman, it may offer diminishing returns for skilled workers such as experienced software developers.

“For those people who have already had 20 years, or in this specific example, five years of experience, maybe it’s not their main task that we should look for and force them to start using these tools if they’re already well functioning in the job with their existing work methods,” Anders Humlum, an assistant professor of economics at the University of Chicago’s Booth School of Business, told Fortune.

Humlum has similarly conducted research on AI’s impact on productivity. He found in a working study from May that among 25,000 workers in 7,000 workplaces in Denmark—a country with similar AI uptake as the U.S.—productivity improved a modest 3% among employees using the tools. 

Humlum’s research supports MIT economist and Nobel laureate Daron Acemoglu’s assertion that markets have overestimated productivity gains from AI. Acemoglu argues only 4.6% of tasks within the U.S. economy will be made more efficient with AI.

“In a rush to automate everything, even the processes that shouldn’t be automated, businesses will waste time and energy and will not get any of the productivity benefits that are promised,” Acemoglu previously wrote for Fortune. “The hard truth is that getting productivity gains from any technology requires organizational adjustment, a range of complementary investments, and improvements in worker skills, via training and on-the-job learning.”

The case of the software developers’ hampered productivity points to this need for critical thought on when AI tools are implemented, Humlum said. While previous research on AI productivity has looked at self-reported data or specific and contained tasks, data on challenges from skilled workers using the technology complicate the picture.

“In the real world, many tasks are not as easy as just typing into ChatGPT,” Humlum said. “Many experts have a lot of experience [they’ve] accumulated that is highly beneficial, and we should not just ignore that and give up on that valuable expertise that has been accumulated.”

“I would just take this as a good reminder to be very cautious about when to use these tools,” he added.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI工具 软件开发 生产力 技术应用 开发者实验
相关文章