AUB ScholarWorks

Data collection from the web and social media sites

Show simple item record

dc.contributor.author Eido, Farah Malih.
dc.date.accessioned 2013-10-02T09:24:07Z
dc.date.available 2013-10-02T09:24:07Z
dc.date.issued 2013
dc.identifier.uri http://hdl.handle.net/10938/9681
dc.description Project (M.S.)--American University of Beirut, Department of Computer Science, 2013.
dc.description First Reader : Dr. Wassim El Hajj, Assistant Professor, Computer Science--Second Reader : Dr. Haidar Safa, Associate Professor, Computer Science.
dc.description Includes bibliographical references (leaves 23-24)
dc.description.abstract As social media is becoming the trend now where more than half the world's population is on social networking sites, and given the wealth of information that can be found on these sites as well as the web, a crucial need has emerged for businesses to have insights into these data sources. To answer this need, we propose an automated information gathering web application that accepts a query and gathers the relevant information from Twitter tweets, YouTube comments, Facebook wall posts, and Google search engine returned web pages'. The gathered data is then displayed and stored in a structured format in a database. One edge of this data collection project is that it is tailored to the Arabic Language where both the query and the returned data are in Arabic. However, the system works for English queries as well. The other edge in the project is the use of Vaadin, a java framework for building modern web applications that look great and perform well. The main benefit of this application is that it constitutes an integral module for any project that requires data gathering from the web and social media sites. To name a few, such projects include product and market analysis and opinion mining.
dc.format.extent vii, 24 leaves : ill. (some col.) ; 30 cm.
dc.language.iso eng
dc.relation.ispartof Theses, Dissertations, and Projects
dc.subject.classification Pj:001737 AUBNO
dc.subject.lcsh Web sites.
dc.subject.lcsh Social media.
dc.subject.lcsh Java (Computer program language)
dc.subject.lcsh Computer software.
dc.subject.lcsh Databases.
dc.title Data collection from the web and social media sites
dc.type Project
dc.contributor.department American University of Beirut. Faculty of Arts and Sciences. Department of Computer Science.


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search AUB ScholarWorks


Browse

My Account