Second Order Trust Region Optimization Methods for Training Neural Networks: Beyond Inexact Newton

dc.contributor.advisorTurkiyyah, George
dc.contributor.authorLomer, Kyle
dc.contributor.departmentDepartment of Computer Science
dc.contributor.facultyFaculty of Arts and Sciences
dc.contributor.institutionAmerican University of Beirut
dc.date2020
dc.date.accessioned2020-09-22T14:26:59Z
dc.date.available2020-09-22T14:26:59Z
dc.date.issued9/22/2020
dc.descriptionDr Shady Elbassuoni Dr Izzat El Hajj
dc.description.abstractSecond order optimization methods have always been less widely used for training neural networks than first order methods such as Stochastic Gradient Descent. This is mainly due to the complexity and high costs in terms of both processor and memory resources of second order methods. In recent years more work has been done to adapt these methods to make them more suitable for training neural networks. In this paper we demonstrate how trust region methods can be used to improve the convergence and cost-effectiveness of second order optimization. This is achieved by only using cheap first order information when it is an appropriate approximation for the expensive second order information, based on the relative size of the trust region. We also present techniques to automatically tune the hyperparameters these methods introduce; including a novel approach to adaptive regularization. These methods are demonstrated on autoencoders and image classifiers in comparison to first order methods.
dc.identifier.urihttp://hdl.handle.net/10938/21953
dc.language.isoen
dc.subjectComputer Science
dc.subjectOptimization
dc.subjectNumerical Methods
dc.subjectMachine Learning
dc.subjectNeural Networks
dc.titleSecond Order Trust Region Optimization Methods for Training Neural Networks: Beyond Inexact Newton
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ThesisKyleLomerSigned.pdf
Size:
3.48 MB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.65 KB
Format:
Item-specific license agreed upon to submission
Description: