Published on July 23, 2024 1:45 AM GMT

This is my advice for careers in empirical ML research that might help AI safety (ML Safety). Other ways to improve AI safety, such as through AI governance and strategy, might be more impactful than ML safety research (I generally think they are). Skills can be complementary, so this advice might also help AI governance professionals build technical ML skills.

1. Career Advice

1.1 General Career Guides

Preventing an AI-related catastrophe - 80,000 Hours

A Survival Guide to a PhD (Andrej Karpathy)

How to pursue a career in technical AI alignment — EA Forum

AI safety technical research - Career review - 80,000 Hours

Beneficial AI Research Career Advice

2. Upskilling

2.1 Fundamental AI Safety Knowledge

AI Safety Fundamentals – BlueDot Impact

AI Safety, Ethics, and Society Textbook

2.2 Speedrunning Technical Knowledge in 12 Hours

Essence of linear algebra - 3Blue1Brown

Neural networks - 3Blue1Brown

Neural Networks: Backpropagation - CS 231N

The spelled-out intro to neural networks and backpropagation: building micrograd

[1hr Talk] Intro to Large Language Models

The Illustrated Transformer – Jay Alammar

Let's build GPT: from scratch, in code, spelled out.

2.3 How to Build Technical Skills

Stanford CS 224N | Natural Language Processing with Deep Learning

lecture videos

Practical Deep Learning for Coders - Practical Deep Learning (fast.ai)

Syllabus | Intro to ML Safety

Levelling Up in AI Safety Research Engineering [Public]

ARENA

CS 194/294-267 Understanding Large Language Models: Foundations and Safety

You should aim to understand the fundamentals of ML through 1 or 2 classes and then practice doing many manageable research projects with talented collaborators or a good mentor who can give you time to meet.It’s easy to keep taking classes, but you tend to learn many more practical ML skills through practice doing real research projects.You can also replicate papers to build experience. Be sure to focus on key results rather than wasting time replicating many experiments.“One learns from books and reels only that certain things can be done. Actual learning requires that you do those things.” –Frank Herbert

A friend didn’t study computer science but got into MATS 2023 with good AI risk takes. Then, they had GPT-4 write most of their code for experiments and did very well in their stream.Personally, GitHub Copilot and language model apps with code interpreters/artifacts write a significant fraction of my code.However, fundamental deep learning knowledge is still useful for making sound decisions about what experiments to run.

2.4 Math

mathification

Basic probabilityVery basics of multivariable calculus, like partial derivatives and chain ruleMatrix multiplication, matrix inverses, eigenvectors/eigenvalues, maybe a couple of decompositions

3. Grad School

3.1 Why to Do It or Not

Anything else, and you’ll likely waste a lot of time compared to alternative jobs you could get if you are at the level where you can get into ML grad school.

More people getting into AI safety should do a PhD | Adam Gleave

How to pursue a career in technical AI alignment — EA Forum

FAQ: Advice for AI alignment researchers – Rohin Shah

AI safety technical research - Career review - 80,000 Hours

Looking back on my alignment PhD — LessWrong

Machine Learning PhD - Career profile - 80,000 Hours

online CS M.S. degrees

3.2 How to Get In

Beneficial AI Research Career Advice

Machine Learning PhD Applications — Everything You Need to Know — Tim Dettmers

3.3 How to Do it Well

A Survival Guide to a PhD (Andrej Karpathy)

Dan Hendrycks + PhD students notes

4. The ML Researcher Life

4.1 Striving for Greatness as a Researcher

Hamming, "You and Your Research" (June 6, 1995)

It contains a lot of mundane-sounding advice that many people just don’t have the discipline to follow.“It’s not hard to do; you just do it!”I listen to this every few months for inspiration and focus.

4.2 Research Skills

Tips for Empirical Alignment Research — AI Alignment Forum

Dear future undergraduate researcher (Rose Wang)

Research as a Stochastic Decision Process

Touch reality as soon as possible (when doing machine learning research) — AI Alignment Forum

So, just imitate what others have succeeded with in similar problems or subdomains.

It forces you to have a coherent and concise story for your paper and makes paper writing more structured.You can draw fake plots as previsualization for experimental results to help communicate the point of an experiment, sync on presentation, and form hypotheses.You can share it with potential collaborators to quickly communicate the project.You get a jump start on crafting talks for your paper.

scientific method

Preregistration

Your priorities for the dayYour hypothesesWhat you didWhy did you decide to do those particular things, especially why you decided to run certain experiments or test a specific changeWhat results did you getWhat those results mean

Jacob Steinhardt’s public research log

4.3 Research Taste

If you work on what’s hot now, you’re too late.I'm not sure what the right timeline to aim for is. Too early and you’ll be chasing trends; too late and you’ll work on irrelevant topics or be too ahead of your time.I’d guess 6-12 months is a good balance.

If you have a good idea, it might be unpopular or go against existing precedent.If you listen too much to old researchers who don't like your new idea, they won't pursue new and original ideas.Also, don't overupdate on them liking it, as it could be hype or a crowded area.

https://openreview.net/

It is the qualitative natural language data—not the quantitative review score—that you want to predict.Beware of the high variability in ML reviewers these days: they’ll miss some things, and many of their critiques will be bad faith.

To tailor your research to increase acceptance odds.To model what research problems the ML community will likely work on or not.To dig further into the assumptions and sketchy parts of papers that you might not find, but the community does.

emotional

The actual logical reasons are often secondary to the emotional reasons, e.g., hype or reviews.But if you have a good model of the ML community's emotions, you can adversarially train yourself to filter out the hype, trends, and bad motivations.Then, you can form a better model of actually good research: research taste.

Ask "why" questions to go up in abstraction about motivations. E.g., "Why do you care about Bayesian methods?"Ask "how" questions to go down in abstraction about concrete choices. E.g., "How are Bayesian methods better at X than Y?"Forces you to have good knowledge of classic research that can quickly indicate if someone's work is irrelevant or redundant.

How I Formed My Own Views About AI Safety — AI Alignment Forum

Research Taste Exercises [rough note] -- colah's blog

How I select alignment research projects — AI Alignment Forum

4.4 Academic Collaborations

Despite being somewhat common in ML, having too many collaborators is usually a good way for a paper to die in Idea Hell or otherwise take a lot of time due to conflicting ideas.More engineers without opinions can often help accelerate research. Still, too many engineers on a project is definitely a thing and can lead to over-engineering, fractured codebase understanding, and high management costs.

Several people recommended to me having 1-2 projects you lead at a time and only up to a couple more you collaborate on.

It’s not uncommon to bring in specialized people later to provide critical feedback on certain topics in exchange for authorship.You can also do this if you are specialized enough.

The mentor usually ended up as the first author on these papers since they came up with the idea and did the initial work, and then they managed collaborators with less effort.

4.5 Writing Papers

LEADERSHIP LAB: The Craft of Writing Effectively

Super important. People don’t communicate the value of their work enough.

Tips for Writing Technical Papers

Provides a decent structure you can default to for paper organization.

[1807.03341] Troubling Trends in Machine Learning Scholarship

Issues due to perverse incentives in the field you should avoid.

Writing Science: How to Write Papers That Get Cited and Proposals That Get Funded

The Elements of Style

Easy Paper Writing Tips | Ethan Perez

TAIS 2024 | Research Communication is IMPORTANT so DO BETTER — Robert Miles

How to create a better research poster in less time (#betterposter Generation 2).

4.6 Publishing

They’re big social events for meeting collaborators and finding job opportunities.This is partly due to the modern preprint+Twitter ecosystem, where everyone has already read the papers that interest them months before a big conference with those papers occurs.

Usually, they’re due only a couple of months before the real conference event.Much chiller review processes than conferences.Usually non-archival, so you are allowed to submit the same paper to many to increase your feedback and odds of acceptance.Good for getting decent feedback and technically a publication for preliminary work that you can expand into a full conference paper later.

AAAI

ACL

NAACL: North American Chapter of the ACL

Transactions on Machine Learning Research

Journal of Machine Learning Research

It’s ideal to have your paper “done” by submission time.But it’s also fine and sometimes optimal to submit a rushed paper, keep improving it before the first-round reviews come back, and then update reviewers with your much-improved paper alongside their other complaints during rebuttals.

abysmal

Some people just optimize for publications, submitting shoddy papers to many places.Citations and conference acceptances are not the same as impact.It probably only makes sense to play the game now if you instrumentally need a few publications to get into grad school or some other credentialist role.

4.7 Publicizing

Most of the Shapley value of a paper’s impact hinges on how well you publicize it after releasing a preprint. Most papers only get a couple of citations.Definitely post a Twitter thread and engage with commenters and retweeters.Aim to give some talks. Study and practice how to give good research talks.Send your paper with some nice context directly to a few researchers who would most like to read it.

5. Staying Frosty

5.1 ML Newsletters I Like

AI News • Buttondown

I usually just read the small summary at the top each day, but they also have summaries of all top AI Discord, Reddit, and Twitter discussions each day

AI Safety Newsletter | Center for AI Safety | Substack

Import AI | Jack Clark | Substack

5.2 Keeping up with ML Research

Follow a bunch of researchers you like and some of the researchers they retweet on Twitter.Join AI safety Slack workspaces for organic paper-sharing. If you can't access these, you can ask Aaron Scher to join his Slack Connect paper channel.Subscribe to the newsletters above.

There’s a lot of junk out there

Do I understand the claims this paper is making?Do I think this paper establishes sufficient evidence for these claims?What are the implications of these claims?Is it valuable to keep reading?

"Oh, that might be a cool paper on Twitter" -> open link -> look at title -> skim abstract -> look at 1-3 figures -> "Ahh, that's probably what that's about" -> decide whether to remember it, forget about it, or, rarely, read more

Sometimes, it is useful to contextualize how a non-groundbreaking paper fits into the existing literature, which can help you decide whether to read more.

How to Read a Paper - S. Keshav

How to Read Research Papers: A Pragmatic Approach for ML Practitioners - NVIDIA

Career Advice / Reading Research Papers - Stanford CS230: Deep Learning - Andrew Ng

How I Read a Paper: Facebook's DETR (Video Tutorial) - YouTube

Over time, you'll not only get faster at skimming, you'll also build more context that will make you have to look fewer things upE.g. "this paper studies [adversarial prompt attacks] on [transformer]-based [sentiment classification] models" is a lot easier to understand if you know what each of those [things] are.

arxiv.org/abs/2302.08582

https://alphaxiv.org/

6. Hiring ML Talent

6.1 Finding ML Researchers

Talent sourcing is work, and you need to allocate time and other resources if you want it to happen.Ideally, hire someone whose main job is recruiting and who won’t seem totally lost when talking to ML researchers.Organizations can pay tech recruiting firms or contractors to help them with this without hiring a full recruiter.The MVP is to ask for recommendations for people, peruse LinkedIn, and actively DM many candidates, asking them to apply.You can also look for relevant research papers and contact the people listed in the first half and at the very end of the author list.

They’re big social events to meet collaborators and find job opportunities.Even if an organization doesn’t have a paid booth or hosted party at a conference, representatives often attend to recruit researchers.

If you know someone well in a large organization of ML researchers—such as an AGI lab or prominent academic department—consider asking if they’ve heard of anyone considering a career transition.Academic researchers may especially be open to work but have yet to actively seek it out due to the pernicious comfort of academic roles.Recruiting talent from AGI scaling labs may be good in multiple ways.

great mechanisms

6.2 Finding ML Safety-Focused Candidates

Airtable - Potential PhD Supervisors, AI Alignment / Safety

CAIS Extinction Statement

FLI Pause Letter

Constellation or other local AI safety communitiesSome AI safety university groupsDirectly asking some trusted people to refer people

6.3 Incentives

Novelty

Progress

Prestige

Citations

Playing the Game

Societal Impact

MoneyCredentialsInterdisciplinary work

You can figure it out pretty quickly by talking to an ML researcher if you try.Sometimes, you can just directly ask what motivates them to do research, and they may be forthcoming.

Acknowledgments

Many thanks to Karson Elmgren and Ella Guest for helpful feedback and to several other ML safety researchers for past discussions that informed this piece!

Discuss