Exercise:Self-Taught Learning

From Ufldl

Jump to: navigation, search
Line 8: Line 8:
To demonstrate the effectiveness of this method, we will train the sparse autoencoder on an unlabeled data set comprised out of all the digits 0 to 9, and then test them on the digits 5 to 9. The purpose of this is to demonstrate that self-taught learning can be surprisingly effective in improving results even if some data set items do not fall within the classes of our classification task.
To demonstrate the effectiveness of this method, we will train the sparse autoencoder on an unlabeled data set comprised out of all the digits 0 to 9, and then test them on the digits 5 to 9. The purpose of this is to demonstrate that self-taught learning can be surprisingly effective in improving results even if some data set items do not fall within the classes of our classification task.
 +
 +
=== Support Code/Data ===
 +
 +
The following additional files are required for this exercise:
 +
* [http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz MNIST Dataset (Training Images)]
 +
* [http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz MNIST Dataset (Training Labels)]
 +
* [[Using the MNIST Dataset | Support functions for loading MNIST in Matlab ]]
 +
This exercise has dependencies on the previous exercises:
 +
* [[Exercise:Sparse Autoencoder]]
 +
* [[Exercise:Vectorization]]
 +
* [[Exercise:Softmax Regression]]
===Step One: Generate the input and test data sets===
===Step One: Generate the input and test data sets===

Revision as of 01:23, 6 May 2011

Personal tools