Second Order Trust Region Optimization Methods for Training Neural Networks: Beyond Inexact Newton

Lomer, Kyle

AUB ScholarWorks Home
→
Students Publications
→
AUB Students' Theses, Dissertations, and Projects
→
View Item

Second Order Trust Region Optimization Methods for Training Neural Networks: Beyond Inexact Newton

Lomer, Kyle

URI: http://hdl.handle.net/10938/21953

Date: 9/22/2020

Abstract:

Second order optimization methods have always been less widely used for training neural networks than first order methods such as Stochastic Gradient Descent. This is mainly due to the complexity and high costs in terms of both processor and memory resources of second order methods. In recent years more work has been done to adapt these methods to make them more suitable for training neural networks. In this paper we demonstrate how trust region methods can be used to improve the convergence and cost-effectiveness of second order optimization. This is achieved by only using cheap first order information when it is an appropriate approximation for the expensive second order information, based on the relative size of the trust region. We also present techniques to automatically tune the hyperparameters these methods introduce; including a novel approach to adaptive regularization. These methods are demonstrated on autoencoders and image classifiers in comparison to first order methods.

Advisor(s):

Turkiyyah, George

Description:

Dr Shady Elbassuoni Dr Izzat El Hajj

Show full item record

Files in this item

Name: ThesisKyleLomerSi ...

Size: 3.479Mb

Format: PDF

Description: Thesis

View/Open

This item appears in the following Collection(s)

AUB Students' Theses, Dissertations, and Projects [12709]

Search AUB ScholarWorks

Browse

All of AUB ScholarWorks
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

My Account

Copyright Statement

All materials included in the institutional repository are protected by copyright laws and are the property of their respective copyright holders. Materials may be used for non-commercial, educational, or research purposes only, and must be cited or attributed to the original source. Permission for any other use must be obtained from the copyright holder(s) directly. The American University of Beirut Libraries does not assume responsibility for any infringement of copyright laws that may occur as a result of the use of materials in the repository. If you believe that your copyright has been infringed upon in the repository, please contact the AUB Libraries immediately.

For further information, please contact us at scholarworks@aub.edu.lb

Second Order Trust Region Optimization Methods for Training Neural Networks: Beyond Inexact Newton

Second Order Trust Region Optimization Methods for Training Neural Networks: Beyond Inexact Newton

Abstract:

Advisor(s):

Description:

Files in this item

This item appears in the following Collection(s)

Search AUB ScholarWorks

Browse

All of AUB ScholarWorks

This Collection

My Account

Copyright Statement