Towards Size-Independent Generalization Bounds for Deep Operator Nets

Gopalani, Pulkit; Karmakar, Sayar; Kumar, Dibyakanti; Mukherjee, Anirbit

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2205

Computer Science > Machine Learning

Title: Towards Size-Independent Generalization Bounds for Deep Operator Nets

Authors: Pulkit Gopalani, Sayar Karmakar, Dibyakanti Kumar, Anirbit Mukherjee

(Submitted on 23 May 2022 (v1), last revised 22 Jan 2024 (this version, v2))

Abstract: In recent times machine learning methods have made significant advances in becoming a useful tool for analyzing physical systems. A particularly active area in this theme has been "physics-informed machine learning" which focuses on using neural nets for numerically solving differential equations. In this work, we aim to advance the theory of measuring out-of-sample error while training DeepONets -- which is among the most versatile ways to solve PDE systems in one-shot.
Firstly, for a class of DeepONets, we prove a bound on their Rademacher complexity which does not explicitly scale with the width of the nets involved. Secondly, we use this to show how the Huber loss can be chosen so that for these DeepONet classes generalization error bounds can be obtained that have no explicit dependence on the size of the nets. We note that our theoretical results apply to any PDE being targeted to be solved by DeepONets.

Comments:	27 pages, 5 figures; Added theorem on generalization error indicating benefits of training DeepONets on the Huber loss and corresponding experiments
Subjects:	Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
Cite as:	arXiv:2205.11359 [cs.LG]
	(or arXiv:2205.11359v2 [cs.LG] for this version)

Submission history

From: Pulkit Gopalani [view email]
[v1] Mon, 23 May 2022 14:45:34 GMT (76kb,D)
[v2] Mon, 22 Jan 2024 18:01:37 GMT (933kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.11359

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Towards Size-Independent Generalization Bounds for Deep Operator Nets

Submission history