On the Trustworthiness of Large Language Models
| dc.contributor.advisor | Tajeddine, Razane | |
| dc.contributor.author | Al Sahili, Ali El Akbar | |
| dc.contributor.commembers | Chehab, Ali | |
| dc.contributor.commembers | Issa, Ibrahim | |
| dc.contributor.degree | ME | |
| dc.contributor.department | Department of Electrical and Computer Engineering | |
| dc.contributor.faculty | Maroun Semaan Faculty of Engineering and Architecture | |
| dc.contributor.institution | American University of Beirut | |
| dc.date | 2026 | |
| dc.date.accessioned | 2026-06-02T12:09:06Z | |
| dc.date.submitted | 2026-05-13 | |
| dc.description | Release date : 2027-05-13. | |
| dc.description.abstract | Large Language Models (LLMs) have demonstrated extraordinary capabilities across a wide array of natural language processing tasks, yet their widespread deployment introduces critical privacy and safety risks. Specifically, these models are prone to memorizing sensitive training data, which can be subsequently extracted by malicious actors, and they are also susceptible to adversarial manipulations that bypass safety alignments to generate harmful content. This thesis aims to address gaps in the evaluation of such risks. On the privacy side, it evaluates the role of Membership Inference Attacks (MIAs) within targeted data extraction pipelines, showing that the performance of MIAs is highly dependent on the threat model and attack setup. On the safety side, it provides a systematic study of jailbreak attacks in multilingual settings, analyzing their transferability across languages and highlighting weaknesses in alignment for non-standard scripts and low-resource languages. | |
| dc.identifier.uri | https://hdl.handle.net/10938/35392 | |
| dc.language.iso | en | |
| dc.title | On the Trustworthiness of Large Language Models | |
| dc.type | Thesis | |
| local.AUBID | 202578327 |
Files
Original bundle
1 - 3 of 3
Loading...
- Name:
- AlSahiliAliElAkbar_2026.pdf
- Size:
- 1.58 MB
- Format:
- Adobe Portable Document Format
- Description:
- Main Thesis
Loading...
- Name:
- AlSahiliAliElAkbar_ReleaseForm_2026.pdf
- Size:
- 601.72 KB
- Format:
- Adobe Portable Document Format
- Description:
- Release Form
Loading...
- Name:
- AlSahiliAliElAkbar_ApprovalForm_2026.pdf
- Size:
- 122.43 KB
- Format:
- Adobe Portable Document Format
- Description:
- Approval Form
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.65 KB
- Format:
- Item-specific license agreed upon to submission
- Description: