Society's Backend 2024年12月13日
A Fundamental Overview of Machine Learning Experimentation [Part 1]
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文深入探讨了机器学习实验的重要性,指出它是AI公司在竞争中脱颖而出的关键。与传统的软件开发不同,机器学习模型的改进依赖于实验而非设计。文章强调了机器学习实验的过程,包括提出假设、训练模型并验证结果。由于每个模型都需要多次训练和验证才能改进,因此机器学习成本高昂。文章还指出,大规模机器学习的重点在于实验,而不仅仅是训练。理解机器学习的实验过程对于在该领域工作至关重要,因为它揭示了机器学习系统与软件系统的差异,以及对工程师、科学家和用户的实际影响。

💡 机器学习模型的改进不是通过设计,而是通过实验过程实现的。研究人员提出关于如何改进模型的假设,然后通过训练和验证模型来测试这些假设。

🧪 机器学习实验过程类似于科学研究,需要反复进行训练和验证。每次实验都可能产生有利、部分有利或不利的结果,这需要进一步的假设和测试。

💸 机器学习的成本高昂,不仅因为训练模型的成本高,还因为迭代模型需要多次训练。在大规模应用中,实验而非仅仅是训练才是优化的重点。

I’ve wanted to write about machine learning experimentation velocity for a long time because it’s the key for AI companies to beat their competitors, but it’s rarely discussed. It’s what I work on at Google and it has very real implications for how impactful AI is in everyone’s life.

Before we can get into the building blocks of excellent machine learning experimentation, you need to understand how it fundamentally works. I’m going to write about velocity in two parts:

    This article will explain machine learning experimentation.

    The next article will explain the 3 key principles for successful machine learning experimentation including how AI companies outcompete at scale.

Make sure to subscribe to receive the second part of this article. You can also keep up with my work on X or LinkedIn if newsletters aren’t your thing.

Machine Learning Experimentation

If you’re reading this, I’m guessing you’re likely familiar with the process of software development. Software development requires three primary stages: design, review, and iteration. Those stages usually happen in that order, but iterating may require more designing and reviewing.

When designing software, there’s a good idea of how the system will behave and a straightforward process for ironing out kinks. Software can be made deterministic which makes this process easier. Engineers are familiar with this process and understand how it works and how to make it fast and effective.

This process is also important because the general public has become familiar with it. Their understanding dictates what they know is and isn’t possible to do to improve the software they use. This dictates their expectations for how software companies will improve services and a timeline for how long it will take them to do so.

Machine learning systems work differently. The usual iterative system for building software services breaks down when machine learning models are introduced. This means all the processes we use (and the public are familiar with) to build reliable software quickly are thrown out the window. I’ve written more in-depth about this in another article if this interests you.

The process for designing machine learning models is less like traditional software development and more akin to research science. There’s a reason those developing machine learning models are called machine learning research scientists—it’s because their job is less about engineering and more about experimenting.

Machine learning models don’t improve via a design process, they improve through an experimentation process. Those with a background in machine learning use their knowledge to propose hypotheses about how a model can be improved. They then run an experiment to validate their hypothesis.

This experiment includes training a model with new parameters/data/architecture (whatever hypothesis is being tested) and validating the outcome. This outcome can be favorable and show improvement, it can be partially favorable and require more testing, or it can be unfavorable in which case further hypotheses need to be tested.

At companies with many AI models, each model is improved via this experimentation process. That means for each model, many training runs are done and validated to improve model performance. Many hypotheses are tested requiring the ability to train each model many times.

This is why machine learning modeling is so expensive. It isn’t only because training a model is expensive, it’s because iterating upon that model requires training it many, many times. At scale, experimentation is the focus, not just training. I rarely see this discussed—it seems the focus of better machine learning is having more training chips when in actuality that’s just the beginning of optimizing machine learning at scale.

If you want to work in machine learning at the scale most companies do, it’s key to understand the experimentation process because it:

    Includes more than just training.

    Shows how machine learning systems differ from software systems and the very real impact that has on engineers, scientists, and users.

    Shows that machine learning is expensive not just because training is expensive but because training a single production model requires training it multiple times over.

Thanks for reading! Don’t forget to subscribe to get the next article in your inbox. If you liked this article and know someone else that would enjoy it, please share it with them!

Always be (machine) learning,

Logan

Share

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

机器学习 实验速度 AI竞争 模型训练 假设验证
相关文章