Pinned Loading
Repositories
Showing 10 of 103 repositories
- inspect_ai Public Forked from UKGovernmentBEIS/inspect_ai
Inspect: A framework for large language model evaluations
VectorInstitute/inspect_ai’s past year of commit activity - inspect_evals Public Forked from UKGovernmentBEIS/inspect_evals
Collection of evals for the Inspect evaluation framework
VectorInstitute/inspect_evals’s past year of commit activity - adrenaline Public
A pipeline for creating clinical reasoning benchmarks from electronic health records
VectorInstitute/adrenaline’s past year of commit activity
Top languages
Loading…