Mitigating Hallucinations in Semantic Annotation of Administrative Documents for Digital Twin Government

Jeongsu Kim, Wonseok Son, Jungkyu Han, Sejin Chun (2025). NOLTA 2025

Keywords Semantic Annotation, Hallucination Mitigation, Proposition Verification, Digital Twin Government, Administrative Documents
International Conference

Abstract

With the growing adoption of the digtial twin for data-driven decision-making, the accurate interpretation of massive public administrative documents has become increasingly critical. Semantic annotation, a key technique for enhancing the value of structured data, faces limitations such as ambiguity and lack of context—issues that are especially critical in the administrative domain. Recent studies on data augmentation using large language models (LLMs) have improved the performance of semantic annotation. However, the phenomenon of hallucination—where LLMs generate incorrect or fabricated content—poses a significant barrier to sensitive application domains like government. This paper proposes FACT-DT, a framework for Hallucination Control in Semantic Annotation of administrative tables. our framework leverages both tabular data and tabular context (e.g., topics, column headers, and inter-column relationships) to minimize context loss and control hallucinations during generation. It achieves this by decomposing the generated outputs into verifiable atomic propositions and extracting inter-sentential relationships to verify factuality. The effectiveness of the proposed framework in reducing hallucinations and enhancing faithfulness is quantitatively validated using LLM-based evaluation methods, such as G-Eval.