We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Mutual information for fitting deep nonlinear models

Authors: Jacob S. Hunter (1), Nathan O. Hodas (1) ((1) Pacific Northwest National Laboratory)
Abstract: Deep nonlinear models pose a challenge for fitting parameters due to lack of knowledge of the hidden layer and the potentially non-affine relation of the initial and observed layers. In the present work we investigate the use of information theoretic measures such as mutual information and Kullback-Leibler (KL) divergence as objective functions for fitting such models without knowledge of the hidden layer. We investigate one model as a proof of concept and one application of cogntive performance. We further investigate the use of optimizers with these methods. Mutual information is largely successful as an objective, depending on the parameters. KL divergence is found to be similarly succesful, given some knowledge of the statistics of the hidden layer.
Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Report number: PNNL-SA-121434
Cite as: arXiv:1612.05708 [math.OC]
  (or arXiv:1612.05708v1 [math.OC] for this version)

Submission history

From: Jacob Hunter [view email]
[v1] Sat, 17 Dec 2016 05:26:46 GMT (18609kb,D)

Link back to: arXiv, form interface, contact.