Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.2k 690

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.4k 162

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.6k 265

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 16.3k 1.3k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 931 87

Repositories

Showing 10 of 540 repositories
  • allenai/asta-extension’s past year of commit activity
    JavaScript 1 Apache-2.0 0 0 2 Updated Dec 22, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,468 Apache-2.0 477 17 (1 issue needs help) 38 Updated Dec 22, 2025
  • bolmo-core Public

    Code for Bolmo: Byteifying the Next Generation of Language Models

    allenai/bolmo-core’s past year of commit activity
    Python 103 Apache-2.0 11 1 5 Updated Dec 22, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 612 Apache-2.0 109 6 45 Updated Dec 22, 2025
  • asta-bench Public
    allenai/asta-bench’s past year of commit activity
    Python 57 Apache-2.0 11 1 13 Updated Dec 22, 2025
  • DrawEduMath Public

    Can VLMs understand students' hand-drawn math work?

    allenai/DrawEduMath’s past year of commit activity
    Python 14 Apache-2.0 1 0 18 Updated Dec 22, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 59 Apache-2.0 10 20 7 Updated Dec 21, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 62 Apache-2.0 11 1 31 Updated Dec 21, 2025
  • olmoearth_projects Public

    OlmoEarth projects

    allenai/olmoearth_projects’s past year of commit activity
    Python 52 7 1 4 Updated Dec 20, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 16,295 Apache-2.0 1,259 32 14 Updated Dec 20, 2025