ift6266: writeup/nips2010_submission.tex comparison

comparison writeup/nips2010_submission.tex @ 493:a194ce5a4249

difference stat. sign.

author	Yoshua Bengio <bengioy@iro.umontreal.ca>
date	Tue, 01 Jun 2010 07:55:38 -0400
parents	19eab4daf212
children	5764a2ae1fb5

comparison

equal deleted inserted replaced

-:19eab4daf212
+:a194ce5a4249
 SDA2), along with the previous results on the digits NIST special database
 19 test set from the literature respectively based on ARTMAP neural
 networks ~\citep{Granger+al-2007}, fast nearest-neighbor search
 ~\citep{Cortes+al-2000}, MLPs ~\citep{Oliveira+al-2002}, and SVMs
 ~\citep{Milgram+al-2005}.  More detailed and complete numerical results
-(figures and tables) can be found in the appendix.  The 3 kinds of model
+(figures and tables, including standard errors on the error rates) can be
-differ in the training sets used: NIST only (MLP0,SDA0), NISTP (MLP1,
+found in the supplementary material.  The 3 kinds of model differ in the
-SDA1), or P07 (MLP2, SDA2). The deep learner not only outperformed the
+training sets used: NIST only (MLP0,SDA0), NISTP (MLP1, SDA1), or P07
-shallow ones and previously published performance but reaches human
+(MLP2, SDA2). The deep learner not only outperformed the shallow ones and
-performance on both the 62-class task and the 10-class (digits) task. In
+previously published performance (in a statistically and qualitatively
-addition, as shown in the left of Figure~\ref{fig:fig:improvements-charts},
+significant way) but reaches human performance on both the 62-class task
-the relative improvement in error rate brought by self-taught learning is
+and the 10-class (digits) task. In addition, as shown in the left of
-greater for the SDA. The left side shows the improvement to the clean NIST
+Figure~\ref{fig:fig:improvements-charts}, the relative improvement in error
-test set error brought by the use of out-of-distribution examples (i.e. the
+rate brought by self-taught learning is greater for the SDA, and these
-perturbed examples examples from NISTP or P07). The right side of
+differences with the MLP are statistically and qualitatively
+significant. The left side of the figure shows the improvement to the clean
+NIST test set error brought by the use of out-of-distribution examples
+(i.e. the perturbed examples examples from NISTP or P07). The right side of
 Figure~\ref{fig:fig:improvements-charts} shows the relative improvement
 brought by the use of a multi-task setting, in which the same model is
 trained for more classes than the target classes of interest (i.e. training
 with all 62 classes when the target classes are respectively the digits,
-lower-case, or upper-case characters). Again, whereas the gain is marginal
+lower-case, or upper-case characters). Again, whereas the gain from the
-or negative for the MLP, it is substantial for the SDA.  Note that for
+multi-task setting is marginal or negative for the MLP, it is substantial
-these multi-task experiment, only the original NIST dataset is used. For
+for the SDA.  Note that for these multi-task experiment, only the original
-example, the MLP-digits bar shows the relative improvement in MLP error
+NIST dataset is used. For example, the MLP-digits bar shows the relative
-rate on the NIST digits test set (1 - single-task model's error /
+improvement in MLP error rate on the NIST digits test set (1 - single-task
-multi-task model's error).  The single-task model is trained with only 10
+model's error / multi-task model's error).  The single-task model is
-outputs (one per digit), seeing only digit examples, whereas the multi-task
+trained with only 10 outputs (one per digit), seeing only digit examples,
-model is trained with 62 outputs, with all 62 character classes as
+whereas the multi-task model is trained with 62 outputs, with all 62
-examples.  For the multi-task model, the digit error rate is measured by
+character classes as examples.  Hence the hidden units are shared across
-comparing the correct digit class with the output class associated with
+all tasks.  For the multi-task model, the digit error rate is measured by
-the maximum conditional probability among only the digit classes outputs.
+comparing the correct digit class with the output class associated with the
+maximum conditional probability among only the digit classes outputs.  The
+setting is similar for the other two target classes (lower case characters
+and upper case characters).
 \begin{figure}[h]
 \resizebox{.99\textwidth}{!}{\includegraphics{images/error_rates_charts.pdf}}\\
 \caption{Left: overall results; error bars indicate a 95\% confidence interval.
 Right: error rates on NIST test digits only, with results from literature. }

Mercurial > ift6266

comparison writeup/nips2010_submission.tex @ 493:a194ce5a4249