JusticeBench

LegalBench is an open benchmark suite spanning six categories of legal reasoning (e.g., rule application, case comparison, statutory interpretation), with tasks and datasets contributed by lawyers and researchers.

It covers multiple task formats (classification, extraction, generation) and document types (opinions, statutes, contracts), making it a practical “starter kit” for comparing models on law-like skills before you run local legal evaluations (e.g., eviction answers, fee-waiver motions, RA letters).

It includes some justice-related tasks like performance at spotting people's legal issues from their short 1-3 paragraph descriptions, through the Learned Hands labeling platform from Stanford and Suffolk LIT Lab.

Use LegalBench to shortlist models, then layer your own actionability/guardrails/language-parity rubrics and jurisdictional materials.

LegalBench benchmark tasks

Description

Access the Dataset