NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question Answering

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

The application of large language models (LLMs) in the legal domain holds significant potential for information retrieval and question answering, yet Thai legal QA systems face challenges due to a lack of standardized evaluation benchmarks and the complexity of Thai legal structures. This paper introduces NitiBench, a benchmark comprising two datasets: the NitiBench-CCL, covering general Thai financial law, and the NitiBench-Tax, which includes real-world tax law cases requiring advanced legal reasoning. We evaluate retrieval-augmented generation (RAG) and long-context LLM-based approaches to address three key research questions: the impact of domain-specific components like section-based chunking and cross-referencing, the comparative performance of different retrievers and LLMs, and the viability of long-context LLMs as an alternative to RAG. Our results show that section-based chunking significantly improves retrieval and end-to-end performance, current retrievers struggle with complex queries, and long-context LLMs still underperform RAG-based systems in Thai legal QA. To support fair evaluation, we propose tailored multi-label retrieval metrics and the use of an LLM-as-judge for coverage and contradiction detection method. These findings highlight the limitations of current Thai legal NLP solutions and provide a foundation for future research in the field. We also open-sourced our codes and dataset to available publicly.

Related collections

Author and article information

Journal

Publication date Created: 15 February 2025

Article

ArXiV ID: 2502.10868

SO-VID: ae7792dd-daeb-4aab-845d-8eef098617b7

License:

http://creativecommons.org/licenses/by/4.0/

History

Custom metadata

Categories cs.CL

ScienceOpen disciplines: Theoretical computer science

Data availability:

ScienceOpen disciplines: Theoretical computer science

NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question Answering

Read this article at

Abstract

Related collections

Open source discrete and agent-based modeling frameworks for biology

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 210