comparison writeup/mlj_submission.tex @ 590:4672fb6b4385

Changé mlj_submission.tex pour merge
author fsavard
date Thu, 30 Sep 2010 17:54:56 -0400
parents 9a6abcf143e8
children 8bd4ff0c5c05
comparison
equal deleted inserted replaced
588:9a6abcf143e8 590:4672fb6b4385
127 It is also only recently that successful algorithms were proposed to 127 It is also only recently that successful algorithms were proposed to
128 overcome some of these difficulties. All are based on unsupervised 128 overcome some of these difficulties. All are based on unsupervised
129 learning, often in an greedy layer-wise ``unsupervised pre-training'' 129 learning, often in an greedy layer-wise ``unsupervised pre-training''
130 stage~\citep{Bengio-2009}. One of these layer initialization techniques, 130 stage~\citep{Bengio-2009}. One of these layer initialization techniques,
131 applied here, is the Denoising 131 applied here, is the Denoising
132 Auto-encoder~(DA)~\citep{VincentPLarochelleH2008} (see Figure~\ref{fig:da}), 132 Auto-encoder~(DA)~\citep{VincentPLarochelleH2008-very-small} (see Figure~\ref{fig:da}),
133 which 133 which
134 performed similarly or better than previously proposed Restricted Boltzmann 134 performed similarly or better than previously proposed Restricted Boltzmann
135 Machines in terms of unsupervised extraction of a hierarchy of features 135 Machines in terms of unsupervised extraction of a hierarchy of features
136 useful for classification. Each layer is trained to denoise its 136 useful for classification. Each layer is trained to denoise its
137 input, creating a layer of features that can be used as input for the next layer. 137 input, creating a layer of features that can be used as input for the next layer.
201 201
202 \begin{wrapfigure}[8]{l}{0.15\textwidth} 202 \begin{wrapfigure}[8]{l}{0.15\textwidth}
203 %\begin{minipage}[b]{0.14\linewidth} 203 %\begin{minipage}[b]{0.14\linewidth}
204 %\vspace*{-5mm} 204 %\vspace*{-5mm}
205 \begin{center} 205 \begin{center}
206 \includegraphics[scale=.4]{images/Original.png}\\ 206 \includegraphics[scale=.4]{Original.png}\\
207 {\bf Original} 207 {\bf Original}
208 \end{center} 208 \end{center}
209 \end{wrapfigure} 209 \end{wrapfigure}
210 %%\vspace{0.7cm} 210 %%\vspace{0.7cm}
211 %\end{minipage}% 211 %\end{minipage}%
238 %\begin{wrapfigure}[7]{l}{0.15\textwidth} 238 %\begin{wrapfigure}[7]{l}{0.15\textwidth}
239 \begin{minipage}[b]{0.14\linewidth} 239 \begin{minipage}[b]{0.14\linewidth}
240 %\centering 240 %\centering
241 \begin{center} 241 \begin{center}
242 \vspace*{-5mm} 242 \vspace*{-5mm}
243 \includegraphics[scale=.4]{images/Thick_only.png}\\ 243 \includegraphics[scale=.4]{Thick_only.png}\\
244 %{\bf Thickness} 244 %{\bf Thickness}
245 \end{center} 245 \end{center}
246 \vspace{.6cm} 246 \vspace{.6cm}
247 \end{minipage}% 247 \end{minipage}%
248 \hspace{0.3cm}\begin{minipage}[b]{0.86\linewidth} 248 \hspace{0.3cm}\begin{minipage}[b]{0.86\linewidth}
266 \subsubsection*{Slant} 266 \subsubsection*{Slant}
267 \vspace*{2mm} 267 \vspace*{2mm}
268 268
269 \begin{minipage}[b]{0.14\linewidth} 269 \begin{minipage}[b]{0.14\linewidth}
270 \centering 270 \centering
271 \includegraphics[scale=.4]{images/Slant_only.png}\\ 271 \includegraphics[scale=.4]{Slant_only.png}\\
272 %{\bf Slant} 272 %{\bf Slant}
273 \end{minipage}% 273 \end{minipage}%
274 \hspace{0.3cm} 274 \hspace{0.3cm}
275 \begin{minipage}[b]{0.83\linewidth} 275 \begin{minipage}[b]{0.83\linewidth}
276 %\centering 276 %\centering
288 288
289 \begin{minipage}[b]{0.14\linewidth} 289 \begin{minipage}[b]{0.14\linewidth}
290 %\centering 290 %\centering
291 %\begin{wrapfigure}[8]{l}{0.15\textwidth} 291 %\begin{wrapfigure}[8]{l}{0.15\textwidth}
292 \begin{center} 292 \begin{center}
293 \includegraphics[scale=.4]{images/Affine_only.png} 293 \includegraphics[scale=.4]{Affine_only.png}
294 \vspace*{6mm} 294 \vspace*{6mm}
295 %{\small {\bf Affine \mbox{Transformation}}} 295 %{\small {\bf Affine \mbox{Transformation}}}
296 \end{center} 296 \end{center}
297 %\end{wrapfigure} 297 %\end{wrapfigure}
298 \end{minipage}% 298 \end{minipage}%
318 %\hspace*{-8mm} 318 %\hspace*{-8mm}
319 \begin{minipage}[b]{0.14\linewidth} 319 \begin{minipage}[b]{0.14\linewidth}
320 %\centering 320 %\centering
321 \begin{center} 321 \begin{center}
322 \vspace*{5mm} 322 \vspace*{5mm}
323 \includegraphics[scale=.4]{images/Localelasticdistorsions_only.png} 323 \includegraphics[scale=.4]{Localelasticdistorsions_only.png}
324 %{\bf Local Elastic Deformation} 324 %{\bf Local Elastic Deformation}
325 \end{center} 325 \end{center}
326 %\end{wrapfigure} 326 %\end{wrapfigure}
327 \end{minipage}% 327 \end{minipage}%
328 \hspace{3mm} 328 \hspace{3mm}
345 \begin{minipage}[b]{0.14\linewidth} 345 \begin{minipage}[b]{0.14\linewidth}
346 %\centering 346 %\centering
347 %\begin{wrapfigure}[7]{l}{0.15\textwidth} 347 %\begin{wrapfigure}[7]{l}{0.15\textwidth}
348 %\vspace*{-5mm} 348 %\vspace*{-5mm}
349 \begin{center} 349 \begin{center}
350 \includegraphics[scale=.4]{images/Pinch_only.png}\\ 350 \includegraphics[scale=.4]{Pinch_only.png}\\
351 \vspace*{15mm} 351 \vspace*{15mm}
352 %{\bf Pinch} 352 %{\bf Pinch}
353 \end{center} 353 \end{center}
354 %\end{wrapfigure} 354 %\end{wrapfigure}
355 %%\vspace{.6cm} 355 %%\vspace{.6cm}
382 382
383 %%\vspace*{-.2cm} 383 %%\vspace*{-.2cm}
384 \begin{minipage}[t]{0.14\linewidth} 384 \begin{minipage}[t]{0.14\linewidth}
385 \centering 385 \centering
386 \vspace*{0mm} 386 \vspace*{0mm}
387 \includegraphics[scale=.4]{images/Motionblur_only.png} 387 \includegraphics[scale=.4]{Motionblur_only.png}
388 %{\bf Motion Blur} 388 %{\bf Motion Blur}
389 \end{minipage}% 389 \end{minipage}%
390 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} 390 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth}
391 %%\vspace*{.5mm} 391 %%\vspace*{.5mm}
392 \vspace*{2mm} 392 \vspace*{2mm}
403 \subsubsection*{Occlusion} 403 \subsubsection*{Occlusion}
404 404
405 \begin{minipage}[t]{0.14\linewidth} 405 \begin{minipage}[t]{0.14\linewidth}
406 \centering 406 \centering
407 \vspace*{3mm} 407 \vspace*{3mm}
408 \includegraphics[scale=.4]{images/occlusion_only.png}\\ 408 \includegraphics[scale=.4]{occlusion_only.png}\\
409 %{\bf Occlusion} 409 %{\bf Occlusion}
410 %%\vspace{.5cm} 410 %%\vspace{.5cm}
411 \end{minipage}% 411 \end{minipage}%
412 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} 412 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth}
413 %\vspace*{-18mm} 413 %\vspace*{-18mm}
430 %\vspace*{-6mm} 430 %\vspace*{-6mm}
431 \begin{minipage}[t]{0.14\linewidth} 431 \begin{minipage}[t]{0.14\linewidth}
432 \begin{center} 432 \begin{center}
433 %\centering 433 %\centering
434 \vspace*{6mm} 434 \vspace*{6mm}
435 \includegraphics[scale=.4]{images/Bruitgauss_only.png} 435 \includegraphics[scale=.4]{Bruitgauss_only.png}
436 %{\bf Gaussian Smoothing} 436 %{\bf Gaussian Smoothing}
437 \end{center} 437 \end{center}
438 %\end{wrapfigure} 438 %\end{wrapfigure}
439 %%\vspace{.5cm} 439 %%\vspace{.5cm}
440 \end{minipage}% 440 \end{minipage}%
466 \begin{minipage}[t]{0.14\textwidth} 466 \begin{minipage}[t]{0.14\textwidth}
467 %\begin{wrapfigure}[7]{l}{ 467 %\begin{wrapfigure}[7]{l}{
468 %\vspace*{-5mm} 468 %\vspace*{-5mm}
469 \begin{center} 469 \begin{center}
470 \vspace*{1mm} 470 \vspace*{1mm}
471 \includegraphics[scale=.4]{images/Permutpixel_only.png} 471 \includegraphics[scale=.4]{Permutpixel_only.png}
472 %{\small\bf Permute Pixels} 472 %{\small\bf Permute Pixels}
473 \end{center} 473 \end{center}
474 %\end{wrapfigure} 474 %\end{wrapfigure}
475 \end{minipage}% 475 \end{minipage}%
476 \hspace{3mm}\begin{minipage}[t]{0.86\linewidth} 476 \hspace{3mm}\begin{minipage}[t]{0.86\linewidth}
493 %%\vspace*{-3mm} 493 %%\vspace*{-3mm}
494 \begin{center} 494 \begin{center}
495 %\hspace*{-3mm}\begin{minipage}[t]{0.18\linewidth} 495 %\hspace*{-3mm}\begin{minipage}[t]{0.18\linewidth}
496 %\centering 496 %\centering
497 \vspace*{0mm} 497 \vspace*{0mm}
498 \includegraphics[scale=.4]{images/Distorsiongauss_only.png} 498 \includegraphics[scale=.4]{Distorsiongauss_only.png}
499 %{\small \bf Gauss. Noise} 499 %{\small \bf Gauss. Noise}
500 \end{center} 500 \end{center}
501 %\end{wrapfigure} 501 %\end{wrapfigure}
502 \end{minipage}% 502 \end{minipage}%
503 \hspace{0.3cm}\begin{minipage}[t]{0.86\linewidth} 503 \hspace{0.3cm}\begin{minipage}[t]{0.86\linewidth}
515 515
516 \begin{minipage}[t]{\linewidth} 516 \begin{minipage}[t]{\linewidth}
517 \begin{minipage}[t]{0.14\linewidth} 517 \begin{minipage}[t]{0.14\linewidth}
518 \centering 518 \centering
519 \vspace*{0mm} 519 \vspace*{0mm}
520 \includegraphics[scale=.4]{images/background_other_only.png} 520 \includegraphics[scale=.4]{background_other_only.png}
521 %{\small \bf Bg Image} 521 %{\small \bf Bg Image}
522 \end{minipage}% 522 \end{minipage}%
523 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} 523 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth}
524 \vspace*{1mm} 524 \vspace*{1mm}
525 Following~\citet{Larochelle-jmlr-2009}, the {\bf background image} module adds a random 525 Following~\citet{Larochelle-jmlr-2009}, the {\bf background image} module adds a random
534 \subsubsection*{Salt and Pepper Noise} 534 \subsubsection*{Salt and Pepper Noise}
535 535
536 \begin{minipage}[t]{0.14\linewidth} 536 \begin{minipage}[t]{0.14\linewidth}
537 \centering 537 \centering
538 \vspace*{0mm} 538 \vspace*{0mm}
539 \includegraphics[scale=.4]{images/Poivresel_only.png} 539 \includegraphics[scale=.4]{Poivresel_only.png}
540 %{\small \bf Salt \& Pepper} 540 %{\small \bf Salt \& Pepper}
541 \end{minipage}% 541 \end{minipage}%
542 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} 542 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth}
543 \vspace*{1mm} 543 \vspace*{1mm}
544 The {\bf salt and pepper noise} module adds noise $\sim U[0,1]$ to random subsets of pixels. 544 The {\bf salt and pepper noise} module adds noise $\sim U[0,1]$ to random subsets of pixels.
556 %\begin{minipage}[t]{0.14\linewidth} 556 %\begin{minipage}[t]{0.14\linewidth}
557 %\centering 557 %\centering
558 \begin{center} 558 \begin{center}
559 \vspace*{4mm} 559 \vspace*{4mm}
560 %\hspace*{-1mm} 560 %\hspace*{-1mm}
561 \includegraphics[scale=.4]{images/Rature_only.png}\\ 561 \includegraphics[scale=.4]{Rature_only.png}\\
562 %{\bf Scratches} 562 %{\bf Scratches}
563 \end{center} 563 \end{center}
564 \end{minipage}% 564 \end{minipage}%
565 %\end{wrapfigure} 565 %\end{wrapfigure}
566 \hspace{0.3cm}\begin{minipage}[t]{0.86\linewidth} 566 \hspace{0.3cm}\begin{minipage}[t]{0.86\linewidth}
582 \subsubsection*{Grey Level and Contrast Changes} 582 \subsubsection*{Grey Level and Contrast Changes}
583 583
584 \begin{minipage}[t]{0.15\linewidth} 584 \begin{minipage}[t]{0.15\linewidth}
585 \centering 585 \centering
586 \vspace*{0mm} 586 \vspace*{0mm}
587 \includegraphics[scale=.4]{images/Contrast_only.png} 587 \includegraphics[scale=.4]{Contrast_only.png}
588 %{\bf Grey Level \& Contrast} 588 %{\bf Grey Level \& Contrast}
589 \end{minipage}% 589 \end{minipage}%
590 \hspace{3mm}\begin{minipage}[t]{0.85\linewidth} 590 \hspace{3mm}\begin{minipage}[t]{0.85\linewidth}
591 \vspace*{1mm} 591 \vspace*{1mm}
592 The {\bf grey level and contrast} module changes the contrast by changing grey levels, and may invert the image polarity (white 592 The {\bf grey level and contrast} module changes the contrast by changing grey levels, and may invert the image polarity (white
789 deep architecture (whereby complex concepts are expressed as 789 deep architecture (whereby complex concepts are expressed as
790 compositions of simpler ones through a deep hierarchy). 790 compositions of simpler ones through a deep hierarchy).
791 791
792 \begin{figure}[ht] 792 \begin{figure}[ht]
793 %\vspace*{-2mm} 793 %\vspace*{-2mm}
794 \centerline{\resizebox{0.8\textwidth}{!}{\includegraphics{images/denoising_autoencoder_small.pdf}}} 794 \centerline{\resizebox{0.8\textwidth}{!}{\includegraphics{denoising_autoencoder_small.pdf}}}
795 %\vspace*{-2mm} 795 %\vspace*{-2mm}
796 \caption{Illustration of the computations and training criterion for the denoising 796 \caption{Illustration of the computations and training criterion for the denoising
797 auto-encoder used to pre-train each layer of the deep architecture. Input $x$ of 797 auto-encoder used to pre-train each layer of the deep architecture. Input $x$ of
798 the layer (i.e. raw input or output of previous layer) 798 the layer (i.e. raw input or output of previous layer)
799 s corrupted into $\tilde{x}$ and encoded into code $y$ by the encoder $f_\theta(\cdot)$. 799 s corrupted into $\tilde{x}$ and encoded into code $y$ by the encoder $f_\theta(\cdot)$.
838 838
839 %\vspace*{-1mm} 839 %\vspace*{-1mm}
840 840
841 \begin{figure}[ht] 841 \begin{figure}[ht]
842 %\vspace*{-2mm} 842 %\vspace*{-2mm}
843 \centerline{\resizebox{.99\textwidth}{!}{\includegraphics{images/error_rates_charts.pdf}}} 843 \centerline{\resizebox{.99\textwidth}{!}{\includegraphics{error_rates_charts.pdf}}}
844 %\vspace*{-3mm} 844 %\vspace*{-3mm}
845 \caption{SDAx are the {\bf deep} models. Error bars indicate a 95\% confidence interval. 0 indicates that the model was trained 845 \caption{SDAx are the {\bf deep} models. Error bars indicate a 95\% confidence interval. 0 indicates that the model was trained
846 on NIST, 1 on NISTP, and 2 on P07. Left: overall results 846 on NIST, 1 on NISTP, and 2 on P07. Left: overall results
847 of all models, on NIST and NISTP test sets. 847 of all models, on NIST and NISTP test sets.
848 Right: error rates on NIST test digits only, along with the previous results from 848 Right: error rates on NIST test digits only, along with the previous results from
853 \end{figure} 853 \end{figure}
854 854
855 855
856 \begin{figure}[ht] 856 \begin{figure}[ht]
857 %\vspace*{-3mm} 857 %\vspace*{-3mm}
858 \centerline{\resizebox{.99\textwidth}{!}{\includegraphics{images/improvements_charts.pdf}}} 858 \centerline{\resizebox{.99\textwidth}{!}{\includegraphics{improvements_charts.pdf}}}
859 %\vspace*{-3mm} 859 %\vspace*{-3mm}
860 \caption{Relative improvement in error rate due to self-taught learning. 860 \caption{Relative improvement in error rate due to self-taught learning.
861 Left: Improvement (or loss, when negative) 861 Left: Improvement (or loss, when negative)
862 induced by out-of-distribution examples (perturbed data). 862 induced by out-of-distribution examples (perturbed data).
863 Right: Improvement (or loss, when negative) induced by multi-task 863 Right: Improvement (or loss, when negative) induced by multi-task