Mercurial > ift6266
diff writeup/nips2010_submission.tex @ 479:6593e67381a3
Added transformation figure
author | Xavier Glorot <glorotxa@iro.umontreal.ca> |
---|---|
date | Sun, 30 May 2010 18:54:36 -0400 |
parents | db28764b8252 |
children | 150203d2b5c3 |
line wrap: on
line diff
--- a/writeup/nips2010_submission.tex Sun May 30 17:49:14 2010 -0400 +++ b/writeup/nips2010_submission.tex Sun May 30 18:54:36 2010 -0400 @@ -277,6 +277,19 @@ \end{figure} +\begin{figure}[h] +\resizebox{.99\textwidth}{!}{\includegraphics{images/transfo.png}}\\ +\caption{Illustration of each transformation applied to the same image +of the upper-case h (upper-left image). first row (from left to rigth) : original image, slant, +thickness, affine transformation, local elastic deformation; second row (from left to rigth) : +pinch, motion blur, occlusion, pixel permutation, gaussian noise; third row (from left to rigth) : +background image, salt and pepper noise, spatially gaussian noise, scratches, +color and contrast changes.} +\label{fig:transfo} +\end{figure} + + + \section{Experimental Setup} Whereas much previous work on deep learning algorithms had been performed on @@ -302,7 +315,7 @@ The fourth series, $hsf_4$, experimentally recognized to be the most difficult one is recommended by NIST as testing set and is used in our work and some previous work~\cite{Granger+al-2007,Cortes+al-2000,Oliveira+al-2002,Milgram+al-2005} for that purpose. We randomly split the remainder into a training set and a validation set for -model selection. The sizes of these data sets are: XXX for training, XXX for validation, +model selection. The sizes of these data sets are: for training, XXX for validation, and XXX for testing. The performances reported by previous work on that dataset mostly use only the digits. Here we use all the classes both in the training and testing phase. This is especially @@ -312,7 +325,14 @@ of letters in the test set, not in the training set (more like the natural distribution of letters in text). -\item {\bf Fonts} TODO!!! +\item {\bf Fonts} +In order to have a good variety of sources we downloaded an important number of free fonts from: {\tt http://anonymous.url.net} +%real adress {\tt http://cg.scs.carleton.ca/~luc/freefonts.html} +in addition to Windows 7's, this adds up to a total of $9817$ different fonts that we can choose uniformly. +The ttf file is either used as input of the Captcha generator (see next item) or, by producing a corresponding image, +directly as input to our models. + + \item {\bf Captchas} The Captcha data source is an adaptation of the \emph{pycaptcha} library (a python based captcha generator library) for