Published on January 20, 2025 10:31 PM GMT

Our research is centered on empirical research with LLMs. If you are conducting similar research, these tips and tools may help streamline your workflow and increase experiment velocity. We are also releasing two repositories to promote sharing more tooling within the AI safety community.

John Hughes is an independent alignment researcher working with Ethan Perez and was a MATS mentee in the Summer of 2023. In Ethan's previous writeup on research tips, he explains the criteria that strong collaborators often have, and he puts 70% weight on "getting ideas to work quickly." Part of being able to do this is knowing what tools there are at your disposal.

This post, written primarily by John, shares the tools and principles we both use to increase our experimental velocity. Many readers will already know much of this, but we wanted to be comprehensive, so it is a good resource for new researchers (e.g., those starting MATS). If you are a well-versed experimentalist, we recommend checking out the tools in Part 2—you might find some new ones to add to your toolkit. We're also excited to learn from the community, so please feel free to share what works for you in the comments!

Quick Summary

Part 1: Workflow Tips

Part 2: Useful Tools

uv

Part 3: Experiment Tips

Part 4: Shared AI Safety Tooling Repositories

shared tooling

examples

Part 1: Workflow Tips

Terminal

Efficient terminal navigation is essential for productivity, especially when working on tasks like running API inference jobs or GPU fine-tuning on remote machines. Managing directories, editing files, or handling your Git repository can feel tedious when relying solely on bash commands in a standard terminal. Here are some ways to make working in the terminal more intuitive and efficient.

iTerm2

Natural Text Editing Preset

increased keyboard key repeat rate

zsh-autosuggestions

zsh-syntax-highlighting

zsh-completions

zsh-history-substring-search

https://dotfiles.github.io/

unlimited history length

.

~/.zshrc

~/.tmux.conf

gc

git commit -m

file

rl

here

ls

cd

Note: there are many recommendations here, which can be overwhelming, but all of this is automated in John's dotfiles (including installing zsh and tmux, changing key repeat speeds on Mac and setting up aliases). So, if you'd like to get going quickly, we recommend following the README to install and deploy this configuration.

Integrated Development Environment (IDE)

Choosing the right Integrated Development Environment (IDE) can enhance your productivity, especially when using LLM coding assistants. A good IDE simplifies code navigation, debugging, and version control.

Cursor

.cursorrules

Syncing code by pushing to GitHub and then pulling it onto your remote machine is inefficient, as you need to repeat the process every time you test a new fix for a bug. A more effective approach is to edit the code directly on the remote machine, test it there, and push only the finalized bug fix.You can also edit remote files outside of the code repository from within VSCode/Cursor, which is very helpful. Packages such as Vim and Nano are text editors from within the terminal, but these have much higher learning curves.

VSCode debugger

breakpoint()

here

Autosave (very useful, so you never have to worry about accidentally running a script without pressing save)Jupyter run startup commands (e.g. autoreload)Jupyter notebook file root (setting to the root of the repo can be helpful)VSCode Debugger remote attach settings (allow you to debug code running on a remote machine from your local VSCode instance)Linting & code formatting extension configurationFile watcher excludes (so VSCode doesn’t slow down by tracking changes in virtual environments or other folders that contain many files)

GitLens

Jupyter

Ruff

Black

Nvidia-smi+

JSON Lines Viewer

jsonl

LaTeX Workshop

devServer

website

Inspect AI

Karabiner

Vim keybindings

Jump to the next open tab: ⌘⌥ left/rightSearch all tabs: ⌘⇧A

Git, GitHub and Pre-Commit Hooks

Mastering Git, GitHub, and pre-commit hooks is key to maintaining a smooth and reliable workflow. These tools help you manage version control, collaborate effectively, and automate code quality checks to prevent errors before they happen.

GitHub

.pre-commit-config.yaml

here

pyproject.toml

here

make hooks

Ruff

Black

nbstripout

ReviewNB

Part 2: Useful Tools

Not all of these recommendations are directly related to research (e.g., time-tracking apps), but they are excellent productivity tools worth knowing about. The goal of this list is to make you aware of what’s available—not to encourage you to adopt all of these tools at once, but to provide options you can explore and incorporate as needed.

Software/Subscriptions

Cursor

Zed

GitHub Copilot

ChatGPT+

Claude Pro

Tuple

pair with guests

Google One

Grammarly

Perplexity

TimingApp

Wakatime

ReadAI

Otter

Granola,

Rectangle

Copy Clip

Raycast

Alfred

Context

Karabiner

Zotero

—

Homerow

Dash

Speechify

BetterTouchTool

LiquidText

ice

LLM Tools

Weights & Biases

Inspect

Aider

Devin

openweights

LiteLLM

repo2txt

langchain

vLLM

PromptFoo

Langfuse

Ollama

exa.ai

unsloth

axolotl

Prismatic VLMs

open-clio

LLM Providers

RunPod

TogetherAI

OpenRouter

HuggingFace Dedicated Inference Endpoints

Command Line and Python Packages

pip

pyenv

virtualenv

pip

scalene

py-spy

asyncio

shell-ask

ask.sh

jless

ncdu

— an interactive recursive filesize explorer

htop

—

nvtop

—

ripgrep

Dust

(better du)

duf

bat

, exa

code2prompt

opencommit

—

magic-wormhole

— copy files between machines

Part 3: Experiment Tips

De-risk and extended project mode

First, we'd like to explain that there are usually two modes that a research project is in: namely, de-risk mode and extended project mode. These modes significantly change how you should approach experiments, coding style, and project management.

De-risk mode

Quick experimentation using Python notebooks that minimize time-to-insight.Minimal investment to avoid effort in engineering practices, like extensive documentation, strict coding standards, or generalized pipelines.

Extended project mode

Transitioning from notebooks to structured scripts, modules, or pipelines.Applying code reviews, testing, and version control.Using tools like pre-commit hooks and CI/CD workflows to enforce quality.

The workflow should always be conditioned on the situation:

Start in de-risk mode

Switch to extended project mode

Note: sometimes projects start here if there is significant infrastructure needed, it suits the collaborators workflow better or the project is already de-risked before starting.

Ethan tends to be in de-risk mode for 75% of his work, and he uses Python notebooks to explore ideas (for example, many-shot jailbreaking was derisked in a notebook with ~50 lines of code). The Alignment Science team at Anthropic is also primarily in "de-risk mode" for initial alignment experiments and sometimes switches to "Extended project mode" for larger, sustained efforts.

Note: Apollo defines these modes similarly as "individual sprint mode" and "standard mode" in their Engineering Guide. We opt for different names since lots of the research we are involved with can primarily be in de-risk mode for a long period of time.

Tips for both modes

Have a clear project plan that includes motivation and research goals, and list all the experiments you can possibly think of running. Get feedback from peers and iterate.Think about milestones for the project and what you want to deliver. This will help to keep your self accountable. Don't underestimate how long it takes to write a paper.

If you'd like to open-source code, it is worth investing time at the start thinking about how you will design the repo so it is easy to use.Know the LLM tools that are out there (e.g. list in Part 2). It might be a good idea to build off an existing framework like Inspect or use tools like LiteLLM to make sure you have the flexibility down the line to run more models easily.As you build experience knowing how you best run experiments, start to build your own reusable tooling (and perhaps contribute it to our safety-tooling repo - see Part 4).

One of the important ways to move quickly with research is to choose the right next experiments to run. Therefore, it is important to communicate plans regularly with the team so you can get feedback.We use a Slack channel for daily updates within the cohort. This is helpful for structuring your own next steps, keeping yourself accountable, and also providing your mentor/team with good context.Projects we run often have a daily standup with close collaborators. We find this essential for staying aligned on research goals, prioritising the right work and delivering on time.

tips

What is the motivation for this experiment? Does it fit in with the research question I want to answer?Have I derisked this enough already, and are there other more interesting things to run instead? Is this definitely the highest priority?What result do I expect to get? Is learning that useful?Should I explore one model and one dataset first before expanding to more?Will running this extra experiment add significant value to our paper? (especially relevant when close to a deadline)Am I changing too many variables at once? Can I simplify my setup to draw better conclusions?

./experiments/<name>/250109_jailbreaking_technique_v1

1_run_harmbench.sh

2_run_classifier.sh

3_analyse_attack_success_rate.ipynb

Tips for extended project mode

When operating in de-risk mode, it's important not to overdo it. For early-stage, low-compute experiments or projects managed by a single person, working directly in notebooks is often the most efficient and practical approach.

We encourage the teams we work with to review PRs and work to merge them fast as a number one priority. This helps ensure everyone runs the latest and greatest code and avoids difficult merge conflicts down the line.

Caching of LLM responses and other algorithm state is important for this. For example, this allows you to tweak concurrency settings and rekick off a run without losing progress.This may also involve saving intermediate outputs/checkpoints, which has the added benefit that you can check progress or potential bugs as your experiment is running.

If the same script is run, the result of the experiment should be as close to the original as possible. This isn’t always possible due to nondeterministic LLM APIs (even at temp 0), but everything else should be the same (e.g., data splits, hyperparameters, etc).Setting this up correctly is important to make caching work, too; otherwise, the prompts will be different, and everything will start from scratch.It can be useful to save the git commit hash in your experiment directory or even make a copy of the entire codebase (just in case you need to debug in the future).

jsonl

.describe()

example

fire

hydra

simple_parsing

example

simple-gpu-scheduler

openweights

Part 4: Shared AI Safety Tooling Repositories

For many early-career researchers, there's an unnecessarily steep learning curve for even figuring out what good norms for their research code should look like in the first place. We're all for people learning and trying things for themselves, but we think it would be great to have the option to do that on top of a solid foundation that has been proven to work for others. That's why things like e.g. the ARENA curriculum are so valuable.

However, there aren't standardised templates/repos for most of the work in empirical alignment research. We think this probably slows down new researchers a lot, requiring them to unnecessarily duplicate work and make decisions that they might not notice are slowing them down. ML research, in general, involves so much tinkering and figuring things out that building from a strong template can be a meaningful speedup and provide a helpful initial learning experience.

For the MATS 7 scholars mentored by Ethan, Jan, Fabien, Mrinank, and others from the Anthropic Alignment Science team, we have created a GitHub organization called safety-research to allow everyone to easily discover and benefit from each others’ code. We are piloting using two repositories: 1) for shared tooling such as inference and fine-tuning tools and 2) providing a template repo to clone at the start of a project that has examples of using the shared tooling. We are open-sourcing these two repositories and would love for others to join us!

Repo 1: safety-tooling

Share Great Tooling

Upskill Collaborators

Submodule Design

Repo 2: safety-examples

Share Examples

Onboard Researchers Quickly

Note: We are very excited about UK AISI's Inspect framework, which also implements lots of what is in safety-tooling and much more (such as tool usage and extensive model graded evaluations). We love the VSCode extension for inspecting log files and the terminal viewer for experiment progress across models and tasks. We aim to build a bigger portfolio of research projects that use Inspect within safety-examples and build more useful research tools that Inspect doesn't support in safety-tooling.

Acknowledgements

We'd like to thank Jack Youstra and Daniel Paleka, as many of the useful tool suggestions stem from conversations with them. For more of their recommendations, check out their blogs here and here. John would like to thank Ed Rees and others at Speechmatics, from whom he's borrowed and adapted dotfiles functionality over the years. Thanks to Sara Price, James Chua, Henry Sleight and Dan Valentine for providing feedback on this post.

Discuss

Quick Summary

Part 1: Workflow Tips

Terminal

Integrated Development Environment (IDE)

Git, GitHub and Pre-Commit Hooks

Part 2: Useful Tools

Software/Subscriptions

LLM Tools

LLM Providers

Command Line and Python Packages

Part 3: Experiment Tips

De-risk and extended project mode

Tips for both modes

Tips for extended project mode

Part 4: Shared AI Safety Tooling Repositories

Repo 1: safety-tooling

Repo 2: safety-examples

Acknowledgements

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签