On the Trustworthiness of Large Language Models

dc.contributor.advisorTajeddine, Razane
dc.contributor.authorAl Sahili, Ali El Akbar
dc.contributor.commembersChehab, Ali
dc.contributor.commembersIssa, Ibrahim
dc.contributor.degreeME
dc.contributor.departmentDepartment of Electrical and Computer Engineering
dc.contributor.facultyMaroun Semaan Faculty of Engineering and Architecture
dc.contributor.institutionAmerican University of Beirut
dc.date2026
dc.date.accessioned2026-06-02T12:09:06Z
dc.date.submitted2026-05-13
dc.descriptionRelease date : 2027-05-13.
dc.description.abstractLarge Language Models (LLMs) have demonstrated extraordinary capabilities across a wide array of natural language processing tasks, yet their widespread deployment introduces critical privacy and safety risks. Specifically, these models are prone to memorizing sensitive training data, which can be subsequently extracted by malicious actors, and they are also susceptible to adversarial manipulations that bypass safety alignments to generate harmful content. This thesis aims to address gaps in the evaluation of such risks. On the privacy side, it evaluates the role of Membership Inference Attacks (MIAs) within targeted data extraction pipelines, showing that the performance of MIAs is highly dependent on the threat model and attack setup. On the safety side, it provides a systematic study of jailbreak attacks in multilingual settings, analyzing their transferability across languages and highlighting weaknesses in alignment for non-standard scripts and low-resource languages.
dc.identifier.urihttps://hdl.handle.net/10938/35392
dc.language.isoen
dc.titleOn the Trustworthiness of Large Language Models
dc.typeThesis
local.AUBID202578327

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
AlSahiliAliElAkbar_2026.pdf
Size:
1.58 MB
Format:
Adobe Portable Document Format
Description:
Main Thesis
Loading...
Thumbnail Image
Name:
AlSahiliAliElAkbar_ReleaseForm_2026.pdf
Size:
601.72 KB
Format:
Adobe Portable Document Format
Description:
Release Form
Loading...
Thumbnail Image
Name:
AlSahiliAliElAkbar_ApprovalForm_2026.pdf
Size:
122.43 KB
Format:
Adobe Portable Document Format
Description:
Approval Form

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.65 KB
Format:
Item-specific license agreed upon to submission
Description: