The paper focuses on the task of extracting relevant paragraphs from legal judgments based on a given query. The authors construct a specialized dataset for this task from the European Court of Human Rights (ECtHR) using case law guides. They assess the performance of current retrieval models in a zero-shot way and establish fine-tuning benchmarks using various models. The results highlight the significant gap between fine-tuned and zero-shot performance, emphasizing the challenge of handling distribution shift in the legal domain. The authors notice that legal pre-training handles distribution shift on the corpus side but still struggles on query-side distribution shift, with unseen legal queries. They also explore various Parameter Efficient Fine-Tuning (PEFT) methods to evaluate their practicality within the context of information retrieval, shedding light on the effectiveness of different PEFT methods across diverse configurations with pre-training and model architectures influencing the choice of PEFT method.
Til et andet sprog
fra kildeindhold
arxiv.org
Vigtigste indsigter udtrukket fra
by T.Y.S.S Sant... kl. arxiv.org 04-02-2024
https://arxiv.org/pdf/2404.00595.pdfDybere Forespørgsler