The dataset is part of MNIST from kaggle Digit Recognizer competition. "train.format" is the train set, which has been binarized. "test.format" is the test set, which has been binarized.