hIPPYLearn : an inexact Newton-CG method for training neural networks with analysis of the Hessian
dc.contributor.advisor | Ghattas, Omar N. | |
dc.contributor.committeeMember | Dawson, Clint | |
dc.creator | Gao, Ge, 1993- | |
dc.creator.orcid | 0000-0001-5033-1279 | |
dc.date.accessioned | 2017-11-02T13:30:03Z | |
dc.date.available | 2017-11-02T13:30:03Z | |
dc.date.created | 2017-05 | |
dc.date.issued | 2017-05 | |
dc.date.submitted | May 2017 | |
dc.date.updated | 2017-11-02T13:30:03Z | |
dc.description.abstract | Neural networks, as part of deep learning, have become extremely pop- ular due to their ability to extract information from data and to generalize it to new unseen inputs. Neural network has contributed to progress in many classic problems. For example, in natural language processing, utilization of neural network significantly improved the accuracy of parsing natural language sentences [11]. However, training complicated neural network is expensive and time-consuming. In this paper, we introduce more efficient methods to train neural network using Newton-type optimization algorithm. Specifically, we use TensorFlow, the powerful machine learning package developed by Google [2] to define the structure of the neural network and the loss function that we want to optimize. TensorFlow’s automatic differentiation capabilities allow us to efficiently compute gradient and Hessian of the loss function that are needed by the scalable numerical optimization algorithm implemented in hIPPYlib [12]. Numerical examples demonstrate the better performance of Newton method compared to Steepest Descent method, both in terms of number of iterations and computational time. Another important contribution of this work is the study of the spectral properties of the Hessian of the loss function. The distribution of the eigenvalues of the Hessian, in fact, provides extremely valuable information regarding which directions in parameter space are well informed by the data. | |
dc.description.department | Computational Science, Engineering, and Mathematics | |
dc.format.mimetype | application/pdf | |
dc.identifier | doi:10.15781/T2599ZH9S | |
dc.identifier.uri | http://hdl.handle.net/2152/62383 | |
dc.language.iso | en | |
dc.subject | Machine learning | |
dc.subject | Neural networks | |
dc.subject | Optimization problem | |
dc.subject | MNIST | |
dc.title | hIPPYLearn : an inexact Newton-CG method for training neural networks with analysis of the Hessian | |
dc.type | Thesis | |
dc.type.material | text | |
thesis.degree.department | Computational Science, Engineering, and Mathematics | |
thesis.degree.discipline | Computational Science, Engineering, and Mathematics | |
thesis.degree.grantor | The University of Texas at Austin | |
thesis.degree.level | Masters | |
thesis.degree.name | Master of Science in Computational Science, Engineering, and Mathematics |
Access full-text files
Original bundle
1 - 1 of 1