Abstract
MLEB is the largest open-source benchmark for legal information retrieval, encompassing multiple jurisdictions, document types, and task types.
We present the Massive Legal Embedding Benchmark (MLEB), the largest, most diverse, and most comprehensive open-source benchmark for legal information retrieval to date. MLEB consists of ten expert-annotated datasets spanning multiple jurisdictions (the US, UK, EU, Australia, Ireland, and Singapore), document types (cases, legislation, regulatory guidance, contracts, and literature), and task types (search, zero-shot classification, and question answering). Seven of the datasets in MLEB were newly constructed in order to fill domain and jurisdictional gaps in the open-source legal information retrieval landscape. We document our methodology in building MLEB and creating the new constituent datasets, and release our code, results, and data openly to assist with reproducible evaluations.
Community
Hey all,
This is my first-ever paper! In it, we present the Massive Legal Embedding Benchmark (MLEB), a new open-source benchmark for legal information retrieval. It consists of ten expert-annotated datasets spanning multiple jurisdictions (the US, UK, EU, Australia, Ireland, and Singapore), document types (cases, legislation, regulatory guidance, contracts, and literature), and task types (search, zero-shot classification, and question answering).
This paper documents our methodology in creating MLEB as well as our findings. We've openly released our datasets here on Hugging Face and have made our code available on GitHub: https://github.com/isaacus-dev/mleb
Amazing Work
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Large Language Models Meet Legal Artificial Intelligence: A Survey (2025)
- Scaling Legal AI: Benchmarking Mamba and Transformers for Statutory Classification and Case Law Retrieval (2025)
- Towards Reliable Retrieval in RAG Systems for Large Legal Datasets (2025)
- KoBLEX: Open Legal Question Answering with Multi-hop Reasoning (2025)
- LLMs for LLMs: A Structured Prompting Methodology for Long Legal Documents (2025)
- ALARB: An Arabic Legal Argument Reasoning Benchmark (2025)
- AI for Statutory Simplification: A Comprehensive State Legal Corpus and Labor Benchmark (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 10
Browse 10 datasets citing this paperSpaces citing this paper 0
No Space linking this paper