Mercurial > ift6266
comparison writeup/mlj_submission.tex @ 590:4672fb6b4385
Changé mlj_submission.tex pour merge
author | fsavard |
---|---|
date | Thu, 30 Sep 2010 17:54:56 -0400 |
parents | 9a6abcf143e8 |
children | 8bd4ff0c5c05 |
comparison
equal
deleted
inserted
replaced
588:9a6abcf143e8 | 590:4672fb6b4385 |
---|---|
127 It is also only recently that successful algorithms were proposed to | 127 It is also only recently that successful algorithms were proposed to |
128 overcome some of these difficulties. All are based on unsupervised | 128 overcome some of these difficulties. All are based on unsupervised |
129 learning, often in an greedy layer-wise ``unsupervised pre-training'' | 129 learning, often in an greedy layer-wise ``unsupervised pre-training'' |
130 stage~\citep{Bengio-2009}. One of these layer initialization techniques, | 130 stage~\citep{Bengio-2009}. One of these layer initialization techniques, |
131 applied here, is the Denoising | 131 applied here, is the Denoising |
132 Auto-encoder~(DA)~\citep{VincentPLarochelleH2008} (see Figure~\ref{fig:da}), | 132 Auto-encoder~(DA)~\citep{VincentPLarochelleH2008-very-small} (see Figure~\ref{fig:da}), |
133 which | 133 which |
134 performed similarly or better than previously proposed Restricted Boltzmann | 134 performed similarly or better than previously proposed Restricted Boltzmann |
135 Machines in terms of unsupervised extraction of a hierarchy of features | 135 Machines in terms of unsupervised extraction of a hierarchy of features |
136 useful for classification. Each layer is trained to denoise its | 136 useful for classification. Each layer is trained to denoise its |
137 input, creating a layer of features that can be used as input for the next layer. | 137 input, creating a layer of features that can be used as input for the next layer. |
201 | 201 |
202 \begin{wrapfigure}[8]{l}{0.15\textwidth} | 202 \begin{wrapfigure}[8]{l}{0.15\textwidth} |
203 %\begin{minipage}[b]{0.14\linewidth} | 203 %\begin{minipage}[b]{0.14\linewidth} |
204 %\vspace*{-5mm} | 204 %\vspace*{-5mm} |
205 \begin{center} | 205 \begin{center} |
206 \includegraphics[scale=.4]{images/Original.png}\\ | 206 \includegraphics[scale=.4]{Original.png}\\ |
207 {\bf Original} | 207 {\bf Original} |
208 \end{center} | 208 \end{center} |
209 \end{wrapfigure} | 209 \end{wrapfigure} |
210 %%\vspace{0.7cm} | 210 %%\vspace{0.7cm} |
211 %\end{minipage}% | 211 %\end{minipage}% |
238 %\begin{wrapfigure}[7]{l}{0.15\textwidth} | 238 %\begin{wrapfigure}[7]{l}{0.15\textwidth} |
239 \begin{minipage}[b]{0.14\linewidth} | 239 \begin{minipage}[b]{0.14\linewidth} |
240 %\centering | 240 %\centering |
241 \begin{center} | 241 \begin{center} |
242 \vspace*{-5mm} | 242 \vspace*{-5mm} |
243 \includegraphics[scale=.4]{images/Thick_only.png}\\ | 243 \includegraphics[scale=.4]{Thick_only.png}\\ |
244 %{\bf Thickness} | 244 %{\bf Thickness} |
245 \end{center} | 245 \end{center} |
246 \vspace{.6cm} | 246 \vspace{.6cm} |
247 \end{minipage}% | 247 \end{minipage}% |
248 \hspace{0.3cm}\begin{minipage}[b]{0.86\linewidth} | 248 \hspace{0.3cm}\begin{minipage}[b]{0.86\linewidth} |
266 \subsubsection*{Slant} | 266 \subsubsection*{Slant} |
267 \vspace*{2mm} | 267 \vspace*{2mm} |
268 | 268 |
269 \begin{minipage}[b]{0.14\linewidth} | 269 \begin{minipage}[b]{0.14\linewidth} |
270 \centering | 270 \centering |
271 \includegraphics[scale=.4]{images/Slant_only.png}\\ | 271 \includegraphics[scale=.4]{Slant_only.png}\\ |
272 %{\bf Slant} | 272 %{\bf Slant} |
273 \end{minipage}% | 273 \end{minipage}% |
274 \hspace{0.3cm} | 274 \hspace{0.3cm} |
275 \begin{minipage}[b]{0.83\linewidth} | 275 \begin{minipage}[b]{0.83\linewidth} |
276 %\centering | 276 %\centering |
288 | 288 |
289 \begin{minipage}[b]{0.14\linewidth} | 289 \begin{minipage}[b]{0.14\linewidth} |
290 %\centering | 290 %\centering |
291 %\begin{wrapfigure}[8]{l}{0.15\textwidth} | 291 %\begin{wrapfigure}[8]{l}{0.15\textwidth} |
292 \begin{center} | 292 \begin{center} |
293 \includegraphics[scale=.4]{images/Affine_only.png} | 293 \includegraphics[scale=.4]{Affine_only.png} |
294 \vspace*{6mm} | 294 \vspace*{6mm} |
295 %{\small {\bf Affine \mbox{Transformation}}} | 295 %{\small {\bf Affine \mbox{Transformation}}} |
296 \end{center} | 296 \end{center} |
297 %\end{wrapfigure} | 297 %\end{wrapfigure} |
298 \end{minipage}% | 298 \end{minipage}% |
318 %\hspace*{-8mm} | 318 %\hspace*{-8mm} |
319 \begin{minipage}[b]{0.14\linewidth} | 319 \begin{minipage}[b]{0.14\linewidth} |
320 %\centering | 320 %\centering |
321 \begin{center} | 321 \begin{center} |
322 \vspace*{5mm} | 322 \vspace*{5mm} |
323 \includegraphics[scale=.4]{images/Localelasticdistorsions_only.png} | 323 \includegraphics[scale=.4]{Localelasticdistorsions_only.png} |
324 %{\bf Local Elastic Deformation} | 324 %{\bf Local Elastic Deformation} |
325 \end{center} | 325 \end{center} |
326 %\end{wrapfigure} | 326 %\end{wrapfigure} |
327 \end{minipage}% | 327 \end{minipage}% |
328 \hspace{3mm} | 328 \hspace{3mm} |
345 \begin{minipage}[b]{0.14\linewidth} | 345 \begin{minipage}[b]{0.14\linewidth} |
346 %\centering | 346 %\centering |
347 %\begin{wrapfigure}[7]{l}{0.15\textwidth} | 347 %\begin{wrapfigure}[7]{l}{0.15\textwidth} |
348 %\vspace*{-5mm} | 348 %\vspace*{-5mm} |
349 \begin{center} | 349 \begin{center} |
350 \includegraphics[scale=.4]{images/Pinch_only.png}\\ | 350 \includegraphics[scale=.4]{Pinch_only.png}\\ |
351 \vspace*{15mm} | 351 \vspace*{15mm} |
352 %{\bf Pinch} | 352 %{\bf Pinch} |
353 \end{center} | 353 \end{center} |
354 %\end{wrapfigure} | 354 %\end{wrapfigure} |
355 %%\vspace{.6cm} | 355 %%\vspace{.6cm} |
382 | 382 |
383 %%\vspace*{-.2cm} | 383 %%\vspace*{-.2cm} |
384 \begin{minipage}[t]{0.14\linewidth} | 384 \begin{minipage}[t]{0.14\linewidth} |
385 \centering | 385 \centering |
386 \vspace*{0mm} | 386 \vspace*{0mm} |
387 \includegraphics[scale=.4]{images/Motionblur_only.png} | 387 \includegraphics[scale=.4]{Motionblur_only.png} |
388 %{\bf Motion Blur} | 388 %{\bf Motion Blur} |
389 \end{minipage}% | 389 \end{minipage}% |
390 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} | 390 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} |
391 %%\vspace*{.5mm} | 391 %%\vspace*{.5mm} |
392 \vspace*{2mm} | 392 \vspace*{2mm} |
403 \subsubsection*{Occlusion} | 403 \subsubsection*{Occlusion} |
404 | 404 |
405 \begin{minipage}[t]{0.14\linewidth} | 405 \begin{minipage}[t]{0.14\linewidth} |
406 \centering | 406 \centering |
407 \vspace*{3mm} | 407 \vspace*{3mm} |
408 \includegraphics[scale=.4]{images/occlusion_only.png}\\ | 408 \includegraphics[scale=.4]{occlusion_only.png}\\ |
409 %{\bf Occlusion} | 409 %{\bf Occlusion} |
410 %%\vspace{.5cm} | 410 %%\vspace{.5cm} |
411 \end{minipage}% | 411 \end{minipage}% |
412 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} | 412 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} |
413 %\vspace*{-18mm} | 413 %\vspace*{-18mm} |
430 %\vspace*{-6mm} | 430 %\vspace*{-6mm} |
431 \begin{minipage}[t]{0.14\linewidth} | 431 \begin{minipage}[t]{0.14\linewidth} |
432 \begin{center} | 432 \begin{center} |
433 %\centering | 433 %\centering |
434 \vspace*{6mm} | 434 \vspace*{6mm} |
435 \includegraphics[scale=.4]{images/Bruitgauss_only.png} | 435 \includegraphics[scale=.4]{Bruitgauss_only.png} |
436 %{\bf Gaussian Smoothing} | 436 %{\bf Gaussian Smoothing} |
437 \end{center} | 437 \end{center} |
438 %\end{wrapfigure} | 438 %\end{wrapfigure} |
439 %%\vspace{.5cm} | 439 %%\vspace{.5cm} |
440 \end{minipage}% | 440 \end{minipage}% |
466 \begin{minipage}[t]{0.14\textwidth} | 466 \begin{minipage}[t]{0.14\textwidth} |
467 %\begin{wrapfigure}[7]{l}{ | 467 %\begin{wrapfigure}[7]{l}{ |
468 %\vspace*{-5mm} | 468 %\vspace*{-5mm} |
469 \begin{center} | 469 \begin{center} |
470 \vspace*{1mm} | 470 \vspace*{1mm} |
471 \includegraphics[scale=.4]{images/Permutpixel_only.png} | 471 \includegraphics[scale=.4]{Permutpixel_only.png} |
472 %{\small\bf Permute Pixels} | 472 %{\small\bf Permute Pixels} |
473 \end{center} | 473 \end{center} |
474 %\end{wrapfigure} | 474 %\end{wrapfigure} |
475 \end{minipage}% | 475 \end{minipage}% |
476 \hspace{3mm}\begin{minipage}[t]{0.86\linewidth} | 476 \hspace{3mm}\begin{minipage}[t]{0.86\linewidth} |
493 %%\vspace*{-3mm} | 493 %%\vspace*{-3mm} |
494 \begin{center} | 494 \begin{center} |
495 %\hspace*{-3mm}\begin{minipage}[t]{0.18\linewidth} | 495 %\hspace*{-3mm}\begin{minipage}[t]{0.18\linewidth} |
496 %\centering | 496 %\centering |
497 \vspace*{0mm} | 497 \vspace*{0mm} |
498 \includegraphics[scale=.4]{images/Distorsiongauss_only.png} | 498 \includegraphics[scale=.4]{Distorsiongauss_only.png} |
499 %{\small \bf Gauss. Noise} | 499 %{\small \bf Gauss. Noise} |
500 \end{center} | 500 \end{center} |
501 %\end{wrapfigure} | 501 %\end{wrapfigure} |
502 \end{minipage}% | 502 \end{minipage}% |
503 \hspace{0.3cm}\begin{minipage}[t]{0.86\linewidth} | 503 \hspace{0.3cm}\begin{minipage}[t]{0.86\linewidth} |
515 | 515 |
516 \begin{minipage}[t]{\linewidth} | 516 \begin{minipage}[t]{\linewidth} |
517 \begin{minipage}[t]{0.14\linewidth} | 517 \begin{minipage}[t]{0.14\linewidth} |
518 \centering | 518 \centering |
519 \vspace*{0mm} | 519 \vspace*{0mm} |
520 \includegraphics[scale=.4]{images/background_other_only.png} | 520 \includegraphics[scale=.4]{background_other_only.png} |
521 %{\small \bf Bg Image} | 521 %{\small \bf Bg Image} |
522 \end{minipage}% | 522 \end{minipage}% |
523 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} | 523 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} |
524 \vspace*{1mm} | 524 \vspace*{1mm} |
525 Following~\citet{Larochelle-jmlr-2009}, the {\bf background image} module adds a random | 525 Following~\citet{Larochelle-jmlr-2009}, the {\bf background image} module adds a random |
534 \subsubsection*{Salt and Pepper Noise} | 534 \subsubsection*{Salt and Pepper Noise} |
535 | 535 |
536 \begin{minipage}[t]{0.14\linewidth} | 536 \begin{minipage}[t]{0.14\linewidth} |
537 \centering | 537 \centering |
538 \vspace*{0mm} | 538 \vspace*{0mm} |
539 \includegraphics[scale=.4]{images/Poivresel_only.png} | 539 \includegraphics[scale=.4]{Poivresel_only.png} |
540 %{\small \bf Salt \& Pepper} | 540 %{\small \bf Salt \& Pepper} |
541 \end{minipage}% | 541 \end{minipage}% |
542 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} | 542 \hspace{0.3cm}\begin{minipage}[t]{0.83\linewidth} |
543 \vspace*{1mm} | 543 \vspace*{1mm} |
544 The {\bf salt and pepper noise} module adds noise $\sim U[0,1]$ to random subsets of pixels. | 544 The {\bf salt and pepper noise} module adds noise $\sim U[0,1]$ to random subsets of pixels. |
556 %\begin{minipage}[t]{0.14\linewidth} | 556 %\begin{minipage}[t]{0.14\linewidth} |
557 %\centering | 557 %\centering |
558 \begin{center} | 558 \begin{center} |
559 \vspace*{4mm} | 559 \vspace*{4mm} |
560 %\hspace*{-1mm} | 560 %\hspace*{-1mm} |
561 \includegraphics[scale=.4]{images/Rature_only.png}\\ | 561 \includegraphics[scale=.4]{Rature_only.png}\\ |
562 %{\bf Scratches} | 562 %{\bf Scratches} |
563 \end{center} | 563 \end{center} |
564 \end{minipage}% | 564 \end{minipage}% |
565 %\end{wrapfigure} | 565 %\end{wrapfigure} |
566 \hspace{0.3cm}\begin{minipage}[t]{0.86\linewidth} | 566 \hspace{0.3cm}\begin{minipage}[t]{0.86\linewidth} |
582 \subsubsection*{Grey Level and Contrast Changes} | 582 \subsubsection*{Grey Level and Contrast Changes} |
583 | 583 |
584 \begin{minipage}[t]{0.15\linewidth} | 584 \begin{minipage}[t]{0.15\linewidth} |
585 \centering | 585 \centering |
586 \vspace*{0mm} | 586 \vspace*{0mm} |
587 \includegraphics[scale=.4]{images/Contrast_only.png} | 587 \includegraphics[scale=.4]{Contrast_only.png} |
588 %{\bf Grey Level \& Contrast} | 588 %{\bf Grey Level \& Contrast} |
589 \end{minipage}% | 589 \end{minipage}% |
590 \hspace{3mm}\begin{minipage}[t]{0.85\linewidth} | 590 \hspace{3mm}\begin{minipage}[t]{0.85\linewidth} |
591 \vspace*{1mm} | 591 \vspace*{1mm} |
592 The {\bf grey level and contrast} module changes the contrast by changing grey levels, and may invert the image polarity (white | 592 The {\bf grey level and contrast} module changes the contrast by changing grey levels, and may invert the image polarity (white |
789 deep architecture (whereby complex concepts are expressed as | 789 deep architecture (whereby complex concepts are expressed as |
790 compositions of simpler ones through a deep hierarchy). | 790 compositions of simpler ones through a deep hierarchy). |
791 | 791 |
792 \begin{figure}[ht] | 792 \begin{figure}[ht] |
793 %\vspace*{-2mm} | 793 %\vspace*{-2mm} |
794 \centerline{\resizebox{0.8\textwidth}{!}{\includegraphics{images/denoising_autoencoder_small.pdf}}} | 794 \centerline{\resizebox{0.8\textwidth}{!}{\includegraphics{denoising_autoencoder_small.pdf}}} |
795 %\vspace*{-2mm} | 795 %\vspace*{-2mm} |
796 \caption{Illustration of the computations and training criterion for the denoising | 796 \caption{Illustration of the computations and training criterion for the denoising |
797 auto-encoder used to pre-train each layer of the deep architecture. Input $x$ of | 797 auto-encoder used to pre-train each layer of the deep architecture. Input $x$ of |
798 the layer (i.e. raw input or output of previous layer) | 798 the layer (i.e. raw input or output of previous layer) |
799 s corrupted into $\tilde{x}$ and encoded into code $y$ by the encoder $f_\theta(\cdot)$. | 799 s corrupted into $\tilde{x}$ and encoded into code $y$ by the encoder $f_\theta(\cdot)$. |
838 | 838 |
839 %\vspace*{-1mm} | 839 %\vspace*{-1mm} |
840 | 840 |
841 \begin{figure}[ht] | 841 \begin{figure}[ht] |
842 %\vspace*{-2mm} | 842 %\vspace*{-2mm} |
843 \centerline{\resizebox{.99\textwidth}{!}{\includegraphics{images/error_rates_charts.pdf}}} | 843 \centerline{\resizebox{.99\textwidth}{!}{\includegraphics{error_rates_charts.pdf}}} |
844 %\vspace*{-3mm} | 844 %\vspace*{-3mm} |
845 \caption{SDAx are the {\bf deep} models. Error bars indicate a 95\% confidence interval. 0 indicates that the model was trained | 845 \caption{SDAx are the {\bf deep} models. Error bars indicate a 95\% confidence interval. 0 indicates that the model was trained |
846 on NIST, 1 on NISTP, and 2 on P07. Left: overall results | 846 on NIST, 1 on NISTP, and 2 on P07. Left: overall results |
847 of all models, on NIST and NISTP test sets. | 847 of all models, on NIST and NISTP test sets. |
848 Right: error rates on NIST test digits only, along with the previous results from | 848 Right: error rates on NIST test digits only, along with the previous results from |
853 \end{figure} | 853 \end{figure} |
854 | 854 |
855 | 855 |
856 \begin{figure}[ht] | 856 \begin{figure}[ht] |
857 %\vspace*{-3mm} | 857 %\vspace*{-3mm} |
858 \centerline{\resizebox{.99\textwidth}{!}{\includegraphics{images/improvements_charts.pdf}}} | 858 \centerline{\resizebox{.99\textwidth}{!}{\includegraphics{improvements_charts.pdf}}} |
859 %\vspace*{-3mm} | 859 %\vspace*{-3mm} |
860 \caption{Relative improvement in error rate due to self-taught learning. | 860 \caption{Relative improvement in error rate due to self-taught learning. |
861 Left: Improvement (or loss, when negative) | 861 Left: Improvement (or loss, when negative) |
862 induced by out-of-distribution examples (perturbed data). | 862 induced by out-of-distribution examples (perturbed data). |
863 Right: Improvement (or loss, when negative) induced by multi-task | 863 Right: Improvement (or loss, when negative) induced by multi-task |