AUB ScholarWorks

Hadoop extensions for distributed computing on reconfigurable active SSD -

Show simple item record Kaitoua, Abdulrahman Abdulmajid, 2013 2015-02-03T09:49:31Z 2015-02-03T09:49:31Z 2013 2013
dc.identifier.other b17910456
dc.description Thesis (M.E.)-- American University of Beirut, Department of Electrical and Computer Engineeering, 2013.
dc.description Advisor : Dr. Hazem Hajj, Associate Professor, Electrical and Computer Engineering--Committee Members : Dr. Hassan Ali Artail, Professor, Electrical and Computer Engineering ; Dr. Mazen Saghir, Associate Professor, Electrical and Computer Engineering.
dc.description Includes bibliographical references (leaves 52-55)
dc.description.abstract In recent years, there has been an expanded usage and proliferation of cloud computing services. Along with the usage expansion, there has been extensive research on extending cloud computing capabilities to be more computationally efficient. One such example is the recent work that proposed a novel architecture for a distributed reconfigurable processing platform with SSD storage, called RASSD. The system is targeted for processing data-intensive applications at the storage node itself, without having to move data over slow networks. In this thesis, we extend the Hadoop cloud computing framework to serve as cloud computing platform for integrating hardware accelerating capabilities with cloud computing environments. In particular, we show how to extend Hadoop for the distributed RASSD environment. Our contributions include: 1. Additional components for extending Hadoop and the Map-Reduce model, 2. The changes needed to configure the usage of Hadoop for the RASSD environment, 3. Demonstrating how applications are mapped to the extended MapReduce model, and 4. evaluating the benefits and impact of hardware acceleration on overall application performance in the cloud environment. Our RASSD Hadoop work suggests a two-level implementation of MapReduce to support the capability of accelerating map functions on the active storage units. To support the evaluation of the distributed hardware acceleration system under different conditions, we built a Hadoop-based simulator to evaluate the RASSD environment. Additionally, network simulations were used to simulate the performance benefit with a larger number of remote RASSD nodes. The system was evaluated using applications provided with Stanford’s Phoenix MapReduce platform. Our implementations demonstrated the ease of programming in the extended Hadoop environment, while the experimental results showed significant performance gains with map acceleration on the remote RASSD processing nodes. With an assumption of 20X map acceleration on RASSD nodes, the overall p
dc.format.extent x, 55 leaves : colored illustrations ; 30 cm
dc.language.iso eng
dc.relation.ispartof Theses, Dissertations, and Projects
dc.subject.classification ET:005911 AUBNO
dc.subject.lcsh Adaptive computing systems.
dc.subject.lcsh Software engineering.
dc.subject.lcsh Electronic data processing -- Distributed processing.
dc.subject.lcsh Parallel programming (Computer science)
dc.subject.lcsh Computer programming.
dc.subject.lcsh Solid state electronics.
dc.subject.lcsh High performance computing.
dc.title Hadoop extensions for distributed computing on reconfigurable active SSD -
dc.type Thesis
dc.contributor.department American University of Beirut. Faculty of Engineering and Architecture. Department of Electrical and Computer Engineeering.

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search AUB ScholarWorks


My Account