ift6266: writeup/aistats2011_cameraready.tex comparison

comparison writeup/aistats2011_cameraready.tex @ 637:fe98896745a5

fitting

author	Yoshua Bengio <bengioy@iro.umontreal.ca>
date	Sat, 19 Mar 2011 23:07:03 -0400
parents	83d53ffe3f25
children	677d1b1d8158

comparison

equal deleted inserted replaced

-:83d53ffe3f25
+:fe98896745a5
 where ${\rm sigm}(a)=1/(1+\exp(-a))$
 and the reconstruction is obtained through the same transformation
 \[
 z={\rm sigm}(d+V' y)
 \]
-but using the transpose of the encoder weights.
+using the transpose of encoder weights.
-We minimize the training
+The training
 set average of the cross-entropy
-reconstruction error
+reconstruction loss
 \[
-L_H(x,z)=\sum_i z_i \log x_i + (1-z_i) \log(1-x_i).
+L_H(x,z)=\sum_i z_i \log x_i + (1-z_i) \log(1-x_i)
 \]
+is minimized.
 Here we use the random binary masking corruption
 (which in $\tilde{x}$ sets to 0 a random subset of the elements of $x$, and
 copies the rest).
 Once the first denoising auto-encoder is trained, its parameters can be used
 to set the first layer of the deep MLP. The original data are then processed
 from the same above set). The fraction of inputs corrupted was selected
 among $\{10\%, 20\%, 50\%\}$. Another hyper-parameter is the number
 of hidden layers but it was fixed to 3 for our experiments,
 based on previous work with
 SDAs on MNIST~\citep{VincentPLarochelleH2008-very-small}.
-We also compared against 1 and against 2 hidden layers, in order
+We also compared against 1 and against 2 hidden layers,
-to disantangle the effect of depth from the effect of unsupervised
+to disantangle the effect of depth from that of unsupervised
 pre-training.
-The size of the hidden
+The size of each hidden
-layers was kept constant across hidden layers, and the best results
+layer was kept constant across hidden layers, and the best results
-were obtained with the largest values that we could experiment
+were obtained with the largest values that we tried
-with given our patience, with 1000 hidden units.
+(1000 hidden units).
 %\vspace*{-1mm}
 \begin{figure*}[ht]
 %\vspace*{-2mm}

Mercurial > ift6266

comparison writeup/aistats2011_cameraready.tex @ 637:fe98896745a5