(Training will download a ~100MB file.) (This only uses the first 50,000 images in MNIST, due to the GitHub file size limit.) (To train on all 60,000 images, run this locally and unzip mnist_train.zip which contains the full dataset.)