Building a Comprehensive Large Arabic Fact Checking Dataset Using Large Language Models
| dc.contributor.advisor | Elbassuoni, Shady | |
| dc.contributor.author | Khalil, Christophe | |
| dc.contributor.commembers | Assaf, Rida | |
| dc.contributor.commembers | Mouawad, Amer | |
| dc.contributor.degree | MS | |
| dc.contributor.department | Department of Computer Science | |
| dc.contributor.faculty | Faculty of Arts and Sciences | |
| dc.contributor.institution | American University of Beirut | |
| dc.date | 2025 | |
| dc.date.accessioned | 2025-02-18T11:17:58Z | |
| dc.date.available | 2025-02-18T11:17:58Z | |
| dc.date.issued | 2025-02-17T22:00:00Z | |
| dc.date.submitted | 2025-02-12T22:00:00Z | |
| dc.description.abstract | Large-scale fact verification poses a significant challenge in Arabic natural language processing due to limited datasets and resources. This work introduces a new large- scale dataset for fact-checking in Modern Standard Arabic, constructed through an automated framework leveraging large language models (LLMs). We propose a three-step pipeline: (1) claim generation from Arabic Wikipedia articles with sup- porting evidence, (2) systematic claim mutation to create challenging counterfactual statements, and (3) rigorous verification and labeling. The resulting dataset com- prises 180,000 claim-evidence pairs labeled as Supported, Refuted, or Not Enough Info. Human evaluation demonstrates strong inter-annotator agreement (κ= 0.89) in Cohen’s Kappa for the Generation Task and (κ= 0.94) for the Refutation Task on our testing sample, while our baseline models achieve 87% accuracy on the verifi- cation task with respect to the expert annotator. Our approach employs specialized prompt engineering and grammatical rules to address Arabic-specific linguistic fea- tures. This provides the first large-scale benchmark for Arabic fact verification.Our methodology presents a scalable approach for developing similar resources for other low-resource languages. Through this work, we aim to advance the state of auto- mated fact verification in Arabic and provide a foundation for future research in multilingual fact-checking. | |
| dc.identifier.uri | http://hdl.handle.net/10938/34784 | |
| dc.language.iso | en | |
| dc.subject.keywords | Large Language Models (LLMs) | |
| dc.subject.lcsh | Deep learning (Machine learning) | |
| dc.subject.lcsh | Arabic language--Data processing | |
| dc.subject.lcsh | Data sets | |
| dc.subject.lcsh | Natural language processing (Computer science) | |
| dc.subject.lcsh | Computational linguistics | |
| dc.title | Building a Comprehensive Large Arabic Fact Checking Dataset Using Large Language Models | |
| dc.type | Thesis | |
| local.AUBID | 202371962 |