Backtracking Gradient Descent Method and some Applications in Large Scale Optimisation Part 1: Theory

Tuyen Trung Truong,Hang-Tuan Nguyen

Backtracking Gradient Descent Method and some Applications in Large Scale Optimisation Part 1: Theory

Authors

Tuyen Trung Truong,Hang-Tuan Nguyen Author

Keywords:

Backtracking, deep learning, global convergence, gradient descent, line search method, optimisation, random dynamical systems.

Abstract

Deep Neural Networks (DNN) are essential in many realistic applications, including Data Science. At the core of DNN is numerical optimisation, in particular gradient descent methods (GD). The purpose of this paper is twofold. First, we prove some new results on the backtracking variant of GD under very general situations. Second, we present a comprehensive comparison of our new results to the previously known results in the literature, providing pros and cons of these methods. To illustrate the efficiency of Backtracking line search, we will present some experimental results (on validation accuracy, training time and so on) on CIFAR10, based on implemetations developed in another paper by the authors. Source codes for the experiments are available on GitHub.

Downloads

Download data is not yet available.

Downloads

Published

2022-01-05

Issue

Vol. 7 No. 1 (2022): volume 7 issue 1 (2022)

Section

Articles

License

You are free to:

Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
Adapt — remix, transform, and build upon the material for any purpose, even commercially.
The licensor cannot revoke these freedoms as long as you follow the license terms.

Under the following terms:

Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

Notices:

You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .

No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.

How to Cite

Backtracking Gradient Descent Method and some Applications in Large Scale Optimisation Part 1: Theory. (2022). Minimax Theory and Its Applications, 7(1), 79–108. https://journalmta.com/index.php/jmta/article/view/175

Download Citation

Backtracking Gradient Descent Method and some Applications in Large Scale Optimisation Part 1: Theory

Authors

Keywords:

Abstract

Downloads

Downloads

Published

Issue

Section

License

You are free to:

Under the following terms:

Notices:

How to Cite

Latest publications

Information

Make a Submission

indexing