The Gradient: Perspectives on AI cover image

Kyunghyun Cho: Neural Machine Translation, Language, and Doing Good Science

The Gradient: Perspectives on AI

CHAPTER

How to Train Restrictive Boltzmann Machines on M-Nist

As I told you earlier, I was more into software engineering as well as a network engineering. My tools were largely C, C++ and when it comes to linear G-Brah metal lab. So I started to implement these Boltzmann machines, very basic version in metal lab. It took me about seven or eight months to realize that what everyone was doing is to use zero and one to represent a binary variable value. On the other hand, I was using minus one and one, which introduces a weird symmetry that as stochastic gradient descent has very, very difficult time break to break. And then that's how we got into this business of training with strict rules for machines and

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner