eval_frameworkΒΆ
- eval_framework package
- Subpackages
- eval_framework.context package
- eval_framework.llm package
- eval_framework.metrics package
- eval_framework.result_processors package
- eval_framework.tasks package
- Subpackages
- Submodules
- eval_framework.tasks.base module
- eval_framework.tasks.eval_config module
- eval_framework.tasks.perturbation module
- eval_framework.tasks.registry module
- eval_framework.tasks.task_loader module
- eval_framework.tasks.task_names module
- eval_framework.tasks.task_style module
- eval_framework.tasks.utils module
- Module contents
- Submodules
- eval_framework.base_config module
- eval_framework.evaluation_generator module
- eval_framework.exceptions module
- eval_framework.logger module
- eval_framework.main module
- eval_framework.response_generator module
- eval_framework.run module
- eval_framework.run_direct module
- eval_framework.suite module
MetricSourceSuiteAggregateSuiteResultTaskSuiteTaskSuite.aggregatesTaskSuite.batch_sizeTaskSuite.extra_llm_argsTaskSuite.get_hyperparam_overrides()TaskSuite.hf_revisionTaskSuite.is_leafTaskSuite.load()TaskSuite.load_from_py()TaskSuite.load_from_yaml()TaskSuite.max_tokensTaskSuite.model_configTaskSuite.nameTaskSuite.num_fewshotTaskSuite.num_samplesTaskSuite.repeatsTaskSuite.task_nameTaskSuite.task_subjectsTaskSuite.tasksTaskSuite.temperatureTaskSuite.top_kTaskSuite.top_pTaskSuite.validate_suite()
compute_aggregates()parse_strings_to_task_or_suite()resolve_to_evalconfig_kwargs()run_suite()save_suite_results()
- Module contents
- Subpackages