ift6266: writeup/nips2010_submission.tex comparison

comparison writeup/nips2010_submission.tex @ 482:ce69aa9204d8

changement au titre et reecriture abstract

author	Yoshua Bengio <bengioy@iro.umontreal.ca>
date	Mon, 31 May 2010 13:59:11 -0400
parents	150203d2b5c3
children	b9cdb464de5f

comparison

equal deleted inserted replaced

-:3e4290448eeb
+:ce69aa9204d8
 \usepackage{algorithm,algorithmic}
 \usepackage[utf8]{inputenc}
 \usepackage{graphicx,subfigure}
 \usepackage[numbers]{natbib}
-\title{Generating and Exploiting Perturbed and Multi-Task Handwritten Training Data for Deep Architectures}
+\title{Deep Self-Taught Learning for Handwritten Character Recognition}
 \author{The IFT6266 Gang}
 \begin{document}
 %\makeanontitle
 \maketitle
 \begin{abstract}
 Recent theoretical and empirical work in statistical machine learning has
 demonstrated the importance of learning algorithms for deep
 architectures, i.e., function classes obtained by composing multiple
-non-linear transformations. In the area of handwriting recognition,
+non-linear transformations. The self-taught learning (exploitng unlabeled
-deep learning algorithms
+examples or examples from other distributions) has already been applied
-had been evaluated on rather small datasets with a few tens of thousands
+to deep learners, but mostly to show the advantage of unlabeled
-of examples. Here we propose a powerful generator of variations
+examples. Here we explore the advantage brought by {\em out-of-distribution
-of examples for character images based on a pipeline of stochastic
+examples} and show that {\em deep learners benefit more from them than a
-transformations that include not only the usual affine transformations
+corresponding shallow learner}, in the area
-but also the addition of slant, local elastic deformations, changes
+of handwritten character recognition. In fact, we show that they reach
-in thickness, background images, color, contrast, occlusion, and
+human-level performance on both handwritten digit classification and
-various types of pixel and spatially correlated noise.
+62-class handwritten character recognition.  For this purpose we
-We evaluate a deep learning algorithm (Stacked Denoising Autoencoders)
+developed a powerful generator of stochastic variations and noise
-on the task of learning to classify digits and letters transformed
+processes character images, including not only affine transformations but
-with this pipeline, using the hundreds of millions of generated examples
+also slant, local elastic deformations, changes in thickness, background
-and testing on the full 62-class NIST test set.
+images, color, contrast, occlusion, and various types of pixel and
-We find that the SDA outperforms its
+spatially correlated noise. The out-of-distribution examples are
-shallow counterpart, an ordinary Multi-Layer Perceptron,
+obtained by training with these highly distorted images or
-and that it is better able to take advantage of the additional
+by including object classes different from those in the target test set.
-generated data, as well as better able to take advantage of
-the multi-task setting, i.e.,
-training from more classes than those of interest in the end.
-In fact, we find that the SDA reaches human performance as
-estimated by the Amazon Mechanical Turk on the 62-class NIST test characters.
 \end{abstract}
 \section{Introduction}
 Deep Learning has emerged as a promising new area of research in
 \end{figure}
 \begin{figure}[h]
 \resizebox{.99\textwidth}{!}{\includegraphics{images/transfo.png}}\\
-\caption{Illustration of each transformation applied to the same image
+\caption{Illustration of each transformation applied alone to the same image
-of the upper-case h (upper-left image). first row (from left to rigth) : original image, slant,
+of an upper-case h (top left). First row (from left to rigth) : original image, slant,
 thickness, affine transformation, local elastic deformation; second row (from left to rigth) :
-pinch, motion blur, occlusion, pixel permutation, gaussian noise; third row (from left to rigth) :
+pinch, motion blur, occlusion, pixel permutation, Gaussian noise; third row (from left to rigth) :
-background image, salt and pepper noise, spatially gaussian noise, scratches,
+background image, salt and pepper noise, spatially Gaussian noise, scratches,
 color and contrast changes.}
 \label{fig:transfo}
 \end{figure}

Mercurial > ift6266

comparison writeup/nips2010_submission.tex @ 482:ce69aa9204d8