Skip to content
Change the repository type filter

All

    Repositories list

    • bergson

      Public
      Mapping out the "memory" of neural nets with data attribution
      Python
      103247Updated Nov 12, 2025Nov 12, 2025
    • lm-evaluation-harness

      Public
      A framework for few-shot evaluation of language models.
      Python
      2.8k11k510164Updated Nov 11, 2025Nov 11, 2025
    • delphi

      Public
      Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.
      Python
      5122333Updated Nov 10, 2025Nov 10, 2025
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      322121510Updated Nov 10, 2025Nov 10, 2025
    • sparsify

      Public
      Sparsify transformers with SAEs and transcoders
      Python
      8665352Updated Nov 10, 2025Nov 10, 2025
    • Tools for understanding how transformer predictions are built layer-by-layer
      Python
      59200Updated Nov 10, 2025Nov 10, 2025
    • Python
      21020Updated Nov 6, 2025Nov 6, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      11k000Updated Nov 3, 2025Nov 3, 2025
    • gamescope

      Public
      Can interpretability methods confer an advantage in competitive games?
      Python
      0200Updated Oct 31, 2025Oct 31, 2025
    • djinn

      Public
      Generating, validating and running exploitable verifiable coding problems
      Python
      0700Updated Oct 31, 2025Oct 31, 2025
    • Problems generated by djinn (exploitably verifiable coding problems)
      0000Updated Oct 31, 2025Oct 31, 2025
    • Simplified library for mapping out the "memory" of neural nets with data attribution
      Python
      10000Updated Oct 26, 2025Oct 26, 2025
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      7612Updated Oct 14, 2025Oct 14, 2025
    • Jupyter Notebook
      71100Updated Oct 9, 2025Oct 9, 2025
    • aria

      Public
      Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)
      Python
      138200Updated Oct 8, 2025Oct 8, 2025
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      1.1k7.3k6126Updated Sep 26, 2025Sep 26, 2025
    • DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      4.6k17001Updated Sep 26, 2025Sep 26, 2025
    • Sparsify transformers with cross-layer transcoders
      Python
      861602Updated Aug 12, 2025Aug 12, 2025
    • attribute

      Public
      Python
      61201Updated Aug 6, 2025Aug 6, 2025
    • attention-probes

      Public
      Linear probes with attention weighting
      Python
      1700Updated Aug 2, 2025Aug 2, 2025
    • verifiers

      Public
      Verifiers for LLM Reinforcement Learning
      Python
      430000Updated Jul 31, 2025Jul 31, 2025
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      4282283Updated Jul 29, 2025Jul 29, 2025
    • Python
      0100Updated Jul 22, 2025Jul 22, 2025
    • MIDI tokenizers and pre-processing utils.
      Python
      3531Updated Jul 21, 2025Jul 21, 2025
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      96000Updated Jul 9, 2025Jul 9, 2025
    • nanoGPT-mup

      Public
      The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      8.3k17220Updated Jun 27, 2025Jun 27, 2025
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      1952.7k153Updated Jun 9, 2025Jun 9, 2025
    • Investigating goal instability in RL
      Python
      0100Updated Jun 2, 2025Jun 2, 2025
    • open-r1

      Public
      Fully open reproduction of DeepSeek-R1
      Python
      2.4k400Updated May 21, 2025May 21, 2025
    • POSER

      Public
      Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
      Python
      4200Updated May 21, 2025May 21, 2025