AUB ScholarWorks

MinSVM for imbalanced datasets with a case study on Arabic comics classification -

Show simple item record

dc.contributor.author Nayal, Ammar,
dc.date.accessioned 2017-08-30T14:12:36Z
dc.date.available 2017-08-30T14:12:36Z
dc.date.issued 2015
dc.date.submitted 2015
dc.identifier.other b18328994
dc.identifier.uri http://hdl.handle.net/10938/10830
dc.description Thesis. M.E. American University of Beirut. Department of Electrical and Computer Engineering, 2014. ET:6154
dc.description Advisor : Dr. Mariette Awad, Assistant Professor, Electrical and Computer Engineering ; Committee Members: Dr. Mohamad Adnan Al-Alaoui, Professor, Electrical and Computer Engineering ; Dr. Fadi Zaraket, Assistant Professor, Electrical and Computer Engineering.
dc.description Includes bibliographical references (leaves 49-53)
dc.description.abstract Class imbalance occurs when the different classification categories, or samples, are not equally represented in the training dataset. Class imbalance is frequent in many real life applications and particularly in Arabic short text classification. Classifying an imbalanced dataset is problematic because most traditional clas-sifiers achieve a high accuracy for the majority class, but a consistently low accuracy on the minority class. The many studies developed to classify standard Arabic text docu-ments do not perform well on Arabic short text due to the sparsity of the feature vector. This study proposes the Minority Support Vector Machines (MinSVM) classi-fier, a novel classifier based on Support Vector Machine for binary classification, a Root based Feature Reduction (RFR) scheme for short Arabic text. To validate the performance of our research, MinSVM was tested on some benchmark imbalanced datasets and on a Arabic comics datasets that was manually con-structed. In all our experiments, MinSVM results outperformed some of the main meth-ods suggested in literature for imbalance datasets
dc.format.extent 1 online resource (x, 53 leaves) : illustrations ; 30cm
dc.language.iso eng
dc.relation.ispartof Theses, Dissertations, and Projects
dc.subject.classification ET:006154
dc.subject.lcsh Support vector machines.
dc.subject.lcsh Machine learning.
dc.subject.lcsh Pattern recognition systems.
dc.subject.lcsh Artificial intelligence.
dc.subject.lcsh Comic books, strips, etc.. -- Case studies.
dc.subject.lcsh Text processing (Computer science)
dc.subject.lcsh Arabic language -- Data processing.
dc.subject.lcsh Data mining.
dc.title MinSVM for imbalanced datasets with a case study on Arabic comics classification -
dc.type Thesis
dc.contributor.department Faculty of Engineering and Architecture.
dc.contributor.department Department of Electrical and Computer Engineering,
dc.contributor.institution American University of Beirut.


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search AUB ScholarWorks


Browse

My Account