Published on June 3, 2025 5:38 PM GMT

I've recently completed the in-person ARENA program, which is a 5-week bootcamp teaching the basics of safety research engineering (with the 5th week being a capstone project). Sometimes, I talk to people who want to work through the program independently and who ask for advice. Even though I didn't attempt this, I think doing the program in-person gives me some insight into how to get most out of the program when doing it independently, so here are my thoughts and tips:

On working speed

Day 0.0 (prerequisites)

mostly

If you don't find a working partner, then working deeply for 6-7 hours per day might be infeasible depending on your dispositions. In-person it gets feasible since you alternate with your pair-programming partner, which reduces the overall load on your attention.Often when you struggle in-person, your working partner knows how to move on. If both partners don't know what to do, you can ask a teaching assistant (TA). So you should expect to struggle more often, and for longer, if you're alone.

substantially

week 1 on transformer interpretability

day on SAEs alone

Week 1 on transformer interpretability

Should you do all the material?

If you're working independently, the answer is probably no. Probably not all material is equally valuable to everyone. In an in-person program it makes sense to have everyone work through the same material at the same pace since otherwise it becomes difficult to pair people up for pair-programming and for TAs to prepare for questions. But if you're alone, you probably want to skip more material that isn't as interesting to you personally.

How should you approach each day?

I wouldn't recommend spending much time on actually reading

^[1]

precisely that piece

nothing conceptually interesting about doing it

mostly ignore the difficulty ratings

Which days/sections are valuable to do?

This is very subjective, but here I'll give you my assessment of which days or sections in days are valuable to do. Probably other people's opinions will differ. Also, note that the program keeps evolving, so it's possible that my opinions and advice are out-of-date once you read this post.

Week 0: Fundamentals:

[0.0] Prerequisites

The VSCode section contains lots of things you could try to learn. I think you can mostly move on once you manage to run codes as cells.Numpy, einops, and einsum are all important, but in total, there is much more material linked and provided than you need for starting the program. It's not necessary to know every detail before moving on.

[0.1] Ray Tracing

[0.2] CNNs & ResNets

^[2]

[0.3] Optimization

The optimizers section is interesting, and I found it valuable to see that e.g. the Adam optimizer can be implemented in just a few lines of code. It's not strictly necessary to do it since one can largely take optimizers as blackboxes, so you could also skip this. Weights and biases (wandb): Do it, but simply look at the solutions of the exercises if you don't know how to do it. Distributed training: I recommend skipping this section, it's not very well-explained, and the rest of the material can be done on just one GPU.

[0.4] Backprop:

[0.5] VAEs and GANs

Week 1: Transformer Interpretability

[1.1] Transformers from scratch

[1.2] Intro to MechInterp

Toy models of superposition

OthelloGPT

Week 2: Reinforcement Learning

[2.1] Intro to RL:

[2.2] Q-Learning and DQN

[2.3] PPO

[2.4] RLHF

Week 3: LLM Evaluations

[3.1] Intro to Evals

[3.2] Dataset Generation

[3.3] Running Evals with Inspect

[3.4] LLM Agents

What is missing?

There is some material that would be useful to learn but which is missing from the ARENA material. I hope some of this material will be added in the future.

Most importantly and maybe surprisingly, while the program is called "Alignment Research Engineering Accelerator", there is actually almost no content on how to align AI systems (except the policy-optimization stage of RLHF). There is no content on scalable oversight.

Should you work through the material alone?

I don't know, it depends on your ability to motivate yourself to work through a large amount of material on your own, and also on your prior skills. Here is a somewhat cheap test:

Work through enough of the prerequisites (day [0.0]) to be able to move on.Do day [0.2] on CNNs and ResNets. If you can do this day, you probably can work through the material on your own. To be sure:Do day [1.1] on transformers.

This should be a reasonably cheap test. Notice how long you need to complete each of the days (remembering that [0.0] likely takes more than a day) and use this to assess whether it's worth it for you to work through a larger chunk of the material alone. If you want to do the program in-person instead, express your interest for the next iteration.

^{^}
One caveat here is that I was already somewhat familiar with much of the content on a conceptual/theoretical level. It's possible that other people gain more from actually reading the provided material.
^{^}
When I say "completely" here or later, I mean everything except the bonus material.

Discuss

On working speed

Should you do all the material?

How should you approach each day?

Which days/sections are valuable to do?

What is missing?

Should you work through the material alone?

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签