The paper focuses on the task of extracting relevant paragraphs from legal judgments based on a given query. The authors construct a specialized dataset for this task from the European Court of Human Rights (ECtHR) using case law guides. They assess the performance of current retrieval models in a zero-shot way and establish fine-tuning benchmarks using various models. The results highlight the significant gap between fine-tuned and zero-shot performance, emphasizing the challenge of handling distribution shift in the legal domain. The authors notice that legal pre-training handles distribution shift on the corpus side but still struggles on query-side distribution shift, with unseen legal queries. They also explore various Parameter Efficient Fine-Tuning (PEFT) methods to evaluate their practicality within the context of information retrieval, shedding light on the effectiveness of different PEFT methods across diverse configurations with pre-training and model architectures influencing the choice of PEFT method.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by T.Y.S.S Sant... lúc arxiv.org 04-02-2024
https://arxiv.org/pdf/2404.00595.pdfYêu cầu sâu hơn