Scalable distributed lifelong multi-task reinforcement learning -

El Zini, Julia

AUB ScholarWorks Home
→
Students Publications
→
AUB Students' Theses, Dissertations, and Projects
→
View Item

dc.contributor.author	El Zini, Julia
dc.date.accessioned	2018-10-11T11:43:20Z
dc.date.available	2018-10-11T11:43:20Z
dc.date.copyright	2020-02
dc.date.issued	2017
dc.date.submitted	2017
dc.identifier.other	b21055762
dc.identifier.uri	http://hdl.handle.net/10938/21495
dc.description	Thesis. M.S. American University of Beirut. Department of Computer Science, 2017. T:6736$Advisor : Dr. Mohamad I. Jaber, Assitant Professor, Computer Science ; Members of Committee : Dr. Mariette Awad, Associate Professor, Electrical and Computer Engineering ; Dr. Wassim El Hajj, Associate Professor, Computer Science.
dc.description	Includes bibliographical references (leaves 69-77)
dc.description.abstract	Multi-task reinforcement learning (MTRL) suffers from scalability issues when the number of tasks or trajectories per task grows-large. One of the main reasons behind this limitation is the reliance on centralized solutions. Recent methods exploited the connection between MTRL and general consensus to propose scalable solutions with linear convergence guarantees. In this work, we improve over state-of-the-art by presenting a distributed solver for MTRL with quadratic convergence guarantees. Our algorithm exploits a novel connection between MTRL and Laplacian-based general consensus that leads to an efficient solver. We further extend our work to the lifelong settings where we propose the first distributed lifelong MTRL solver who exhibits vanishing regret. We analyze both the theoretical and empirical properties of our method. In set of extensive experiments, we also show that the novel algorithm outperforms state-of-the-art on a variety of dynamical systems, including a simulated humanoid robot.
dc.format.extent	1 online resource (ix, 77 leaves) ; illustrations
dc.language.iso	eng
dc.subject.classification	T:006736
dc.subject.lcsh	Reinforcement learning.$Mathematical optimization.$Distributed artificial intelligence.$Multitasking (Computer science)
dc.title	Scalable distributed lifelong multi-task reinforcement learning -
dc.type	Thesis
dc.contributor.department	Department of Computer Science
dc.contributor.faculty	Faculty of Arts and Sciences
dc.contributor.institution	American University of Beirut

Files in this item

Name: t-6736.pdf

Size: 3.563Mb

Format: PDF

View/Open

This item appears in the following Collection(s)

AUB Students' Theses, Dissertations, and Projects [12709]

Show simple item record

Search AUB ScholarWorks

Browse

All of AUB ScholarWorks
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

My Account

Copyright Statement

All materials included in the institutional repository are protected by copyright laws and are the property of their respective copyright holders. Materials may be used for non-commercial, educational, or research purposes only, and must be cited or attributed to the original source. Permission for any other use must be obtained from the copyright holder(s) directly. The American University of Beirut Libraries does not assume responsibility for any infringement of copyright laws that may occur as a result of the use of materials in the repository. If you believe that your copyright has been infringed upon in the repository, please contact the AUB Libraries immediately.

For further information, please contact us at scholarworks@aub.edu.lb