Search - Eval-Framework v0.5.2

Skip to content

Eval-Framework v0.5.2

Eval-Framework v0.5.2

Getting Started

Installation
Using the CLI

User Guides

Creating Completion Tasks
How to Add a New Benchmark to Eval Framework
Included Benchmark Tasks
Controlling HuggingFace Upload Results Guide
Docker Guide
How to Evaluate HuggingFace Models with Eval Framework
Creating Loglikelihood Tasks
Model Arguments
Overview Dataloading
Understanding Evaluation Results Guide
Using Determined
Utils in eval-framework
Weights and Biases Integration with Eval-Framework

Contributing Guidelines

Contributing to Eval Framework
Testing

API Reference

API Reference

Copyright © 2025, Aleph Alpha Research

Made with Sphinx and @pradyunsg's Furo