dc.contributor.author |
Nayal, Ammar, |
dc.date.accessioned |
2017-08-30T14:12:36Z |
dc.date.available |
2017-08-30T14:12:36Z |
dc.date.issued |
2015 |
dc.date.submitted |
2015 |
dc.identifier.other |
b18328994 |
dc.identifier.uri |
http://hdl.handle.net/10938/10830 |
dc.description |
Thesis. M.E. American University of Beirut. Department of Electrical and Computer Engineering, 2014. ET:6154 |
dc.description |
Advisor : Dr. Mariette Awad, Assistant Professor, Electrical and Computer Engineering ; Committee Members: Dr. Mohamad Adnan Al-Alaoui, Professor, Electrical and Computer Engineering ; Dr. Fadi Zaraket, Assistant Professor, Electrical and Computer Engineering. |
dc.description |
Includes bibliographical references (leaves 49-53) |
dc.description.abstract |
Class imbalance occurs when the different classification categories, or samples, are not equally represented in the training dataset. Class imbalance is frequent in many real life applications and particularly in Arabic short text classification. Classifying an imbalanced dataset is problematic because most traditional clas-sifiers achieve a high accuracy for the majority class, but a consistently low accuracy on the minority class. The many studies developed to classify standard Arabic text docu-ments do not perform well on Arabic short text due to the sparsity of the feature vector. This study proposes the Minority Support Vector Machines (MinSVM) classi-fier, a novel classifier based on Support Vector Machine for binary classification, a Root based Feature Reduction (RFR) scheme for short Arabic text. To validate the performance of our research, MinSVM was tested on some benchmark imbalanced datasets and on a Arabic comics datasets that was manually con-structed. In all our experiments, MinSVM results outperformed some of the main meth-ods suggested in literature for imbalance datasets |
dc.format.extent |
1 online resource (x, 53 leaves) : illustrations ; 30cm |
dc.language.iso |
eng |
dc.relation.ispartof |
Theses, Dissertations, and Projects |
dc.subject.classification |
ET:006154 |
dc.subject.lcsh |
Support vector machines. |
dc.subject.lcsh |
Machine learning. |
dc.subject.lcsh |
Pattern recognition systems. |
dc.subject.lcsh |
Artificial intelligence. |
dc.subject.lcsh |
Comic books, strips, etc.. -- Case studies. |
dc.subject.lcsh |
Text processing (Computer science) |
dc.subject.lcsh |
Arabic language -- Data processing. |
dc.subject.lcsh |
Data mining. |
dc.title |
MinSVM for imbalanced datasets with a case study on Arabic comics classification - |
dc.type |
Thesis |
dc.contributor.department |
Faculty of Engineering and Architecture. |
dc.contributor.department |
Department of Electrical and Computer Engineering, |
dc.contributor.institution |
American University of Beirut. |