Towards Hybrid Evaluation Methodologies for Large Language Models in the Legal Domain

Sanchi, Marco; Novotn&#225;, Tereza

doi:10.3233/FAIA241279

IOS Press Ebooks

Guest Access

As a guest user you are not logged in or recognized by your IP address. You have access to the Front Matter, Abstracts, Author Index, Subject Index and the full text of Open Access publications.

loading subjects...

Towards Hybrid Evaluation Methodologies for Large Language Models in the Legal Domain

Authors

Marco Sanchi, Tereza Novotná

Pages

389 - 392

DOI

10.3233/FAIA241279

Category

Research Article

Series

Frontiers in Artificial Intelligence and Applications

Ebook

Volume 395: Legal Knowledge and Information Systems

Abstract

This paper analyses automated and human-driven evaluation approaches for Large Language Models (LLMs) performance in the legal domain, stressing the need to combine both into hybrid evaluation frameworks. This conclusion is reinforced by a qualitative case study that uncovers assessment factors considered by lawyers when using LLMs. The diverse nature of these factors, requiring distinct evaluation approaches, underscores the need for adopting a hybrid methodology.

This website uses cookies

This website uses cookies