Theory + AI Symposium

Name: Theory + AI Symposium
Start: 2025-04-07T13:00:00-04:00
End: 2025-04-08T19:30:00-04:00
Location: Perimeter Institute for Theoretical Physics

Apr 7–8, 2025

Perimeter Institute for Theoretical Physics

America/Toronto timezone

Perimeter Institute

[email protected]

Colloquium: Boltzmann Machines

Apr 7, 2025, 3:30 p.m.

PI/1-100 - Theatre (Perimeter Institute for Theoretical Physics)

PI/1-100 - Theatre

Perimeter Institute for Theoretical Physics

190

Geoffrey Hinton (University of Toronto)

The standard way to do this is to use the chain rule to backpropagate gradients through layers of neurons. I shall briefly review a few of the engineering successes of backpropagation and then describe a very different way of getting the gradients that, for a while, seemed a lot more plausible as a model of how the brain gets gradients.

Consider a system composed of binary neurons that can be active or inactive with weighted pairwise couplings between pairs of neurons, including long range couplings. If the neurons represent pixels in a binary image, we can store a set of binary training images by adjusting the coupling weights so that the images are local minima of a Hopfield energy function which is minus the sum over all pairs of active neurons of their coupling weights. But this energy function can only capture pairwise correlations. It cannot represent the kinds of complicated higher-order correlations that occur in images. Now suppose that in addition to the "visible" neurons that represent the pixel intensities, we also have a large set of hidden neurons that have weighted couplings with each other and with the visible neurons. Suppose also that all of the neurons are asynchronous and stochastic: They adopt the active state with a log odds that is equal to the difference in the energy function when the neuron is inactive versus active. Given a set of training images, is there a simple way to set the weights on all of the couplings so that the training images are local minima of the free energy function obtained by integrating out the states of the hidden neurons? The Boltzmann machine learning algorithm solved this problem in an elegant way. It was proof of principle that learning in neural networks with hidden neurons was possible using only locally available information, contrary to what was generally believed at the time.

There are no materials yet.

25040080
83b8e044-1607-43fe-b3dc-b572803a0238

Theory + AI Symposium

Perimeter Institute

Colloquium: Boltzmann Machines

PI/1-100 - Theatre

Perimeter Institute for Theoretical Physics

Speaker

Description

Presentation materials

External references

Choose timezone

Theory + AI Symposium

Perimeter Institute

Speaker

Description

Presentation materials

External references