Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC

Sell, Torben; Singh, Sumeetpal S.

Statistics > Methodology

arXiv:2012.10943 (stat)

[Submitted on 20 Dec 2020 (v1), last revised 8 Sep 2022 (this version, v3)]

Title:Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC

Authors:Torben Sell, Sumeetpal S. Singh

View PDF

Abstract:This paper introduces a new neural network based prior for real valued functions on $\mathbb R^d$ which, by construction, is more easily and cheaply scaled up in the domain dimension $d$ compared to the usual Karhunen-Loève function space prior. The new prior is a Gaussian neural network prior, where each weight and bias has an independent Gaussian prior, but with the key difference that the variances decrease in the width of the network in such a way that the resulting function is \emph{almost surely} well defined in the limit of an infinite width network. We show that in a Bayesian treatment of inferring unknown functions, the induced posterior over functions is amenable to Monte Carlo sampling using Hilbert space Markov chain Monte Carlo (MCMC) methods. This type of MCMC is popular, e.g. in the Bayesian Inverse Problems literature, because it is stable under \emph{mesh refinement}, i.e. the acceptance probability does not shrink to $0$ as more parameters of the function's prior are introduced, even \emph{ad infinitum}. In numerical examples we demonstrate these stated competitive advantages over other function space priors. We also implement examples in Bayesian Reinforcement Learning to automate tasks from data and demonstrate, for the first time, stability of MCMC to mesh refinement for these type of problems.

Comments:	24 pages, 21 figures
Subjects:	Methodology (stat.ME); Computation (stat.CO); Machine Learning (stat.ML)
Cite as:	arXiv:2012.10943 [stat.ME]
	(or arXiv:2012.10943v3 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.2012.10943

Submission history

From: Torben Sell [view email]
[v1] Sun, 20 Dec 2020 14:52:57 UTC (6,314 KB)
[v2] Sun, 31 Oct 2021 11:28:09 UTC (7,381 KB)
[v3] Thu, 8 Sep 2022 11:13:15 UTC (10,247 KB)

Statistics > Methodology

Title:Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators