Gender Bias Detection: Examining the Implicit Bias Inherited by ChatGPT

Chazbeck, Jana

AUB ScholarWorks Home
→
Students Publications
→
AUB Students' Theses, Dissertations, and Projects
→
View Item

Gender Bias Detection: Examining the Implicit Bias Inherited by ChatGPT

Chazbeck, Jana

URI: http://hdl.handle.net/10938/24341

Date: 2024-02-15

Abstract:

In this drastically evolving digital era, textual content production heavily relies on Large Language Models. These models are prone to inherit and thus propagate various forms of stereotypes and gender bias from their training corpus, which has harmful consequences on the worldwide population, such as loss of human potential, aggressive behaviors, biased mental imagery, and unfair labor force participation. Therefore, this thesis focused on evaluating gender bias in the responses of one of the most recent and popular LLMs, ChatGPT. We examined occupational and semantic bias in three common tasks of ChatGPT as well as in the embedding task of Ada-V2 model. After that, we finetuned ChatGPT on bias detection for three types of bias: sexism, dehumanization, and generic bias. The finetuned versions outperformed the original model as well as other popular LLMs in bias detection. We were also able to highlight two major weaknesses in ChatGPT’s learning capabilities as well as reduce the gender gaps in the model’s responses. This research built a strong basis for future work to ensure the safe and valuable use of recent AI tools like ChatGPT.

Advisor(s):

Khreich, Wael

Show full item record

Files in this item

Name: ChazbeckJana_2024.pdf

Size: 2.062Mb

Format: PDF

Description: ChazbeckJana_2024

View/Open

This item appears in the following Collection(s)

AUB Students' Theses, Dissertations, and Projects [12719]

Search AUB ScholarWorks

Browse

All of AUB ScholarWorks
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

My Account

Copyright Statement

All materials included in the institutional repository are protected by copyright laws and are the property of their respective copyright holders. Materials may be used for non-commercial, educational, or research purposes only, and must be cited or attributed to the original source. Permission for any other use must be obtained from the copyright holder(s) directly. The American University of Beirut Libraries does not assume responsibility for any infringement of copyright laws that may occur as a result of the use of materials in the repository. If you believe that your copyright has been infringed upon in the repository, please contact the AUB Libraries immediately.

For further information, please contact us at scholarworks@aub.edu.lb

Gender Bias Detection: Examining the Implicit Bias Inherited by ChatGPT

Gender Bias Detection: Examining the Implicit Bias Inherited by ChatGPT

Abstract:

Advisor(s):

Files in this item

This item appears in the following Collection(s)

Search AUB ScholarWorks

Browse

All of AUB ScholarWorks

This Collection

My Account

Copyright Statement