ift6266: writeup/nips2010_submission.tex comparison

comparison writeup/nips2010_submission.tex @ 532:2e33885730cf

changements aux charts.ods

author	Yoshua Bengio <bengioy@iro.umontreal.ca>
date	Tue, 01 Jun 2010 21:19:54 -0400
parents	4354c3c8f49c
children	22d5cd82d5f0

comparison

equal deleted inserted replaced

-:4354c3c8f49c
+:2e33885730cf
 stochastic gradient descent.
 Self-taught learning~\citep{RainaR2007} is a paradigm that combines principles
 of semi-supervised and multi-task learning: the learner can exploit examples
 that are unlabeled and/or come from a distribution different from the target
-distribution, e.g., from other classes that those of interest. Whereas
+distribution, e.g., from other classes that those of interest.
-it has already been shown that deep learners can clearly take advantage of
+It has already been shown that deep learners can clearly take advantage of
-unsupervised learning and unlabeled examples~\citep{Bengio-2009,WestonJ2008-small}
+unsupervised learning and unlabeled examples~\citep{Bengio-2009,WestonJ2008-small},
-and multi-task learning, not much has been done yet to explore the impact
+but more needs to be done to explore the impact
 of {\em out-of-distribution} examples and of the multi-task setting
-(but see~\citep{CollobertR2008}). In particular the {\em relative
+(one exception is~\citep{CollobertR2008}, but using very different kinds
+of learning algorithms). In particular the {\em relative
 advantage} of deep learning for this settings has not been evaluated.
 The hypothesis explored here is that a deep hierarchy of features
 may be better able to provide sharing of statistical strength
 between different regions in input space or different tasks,
 as discussed in the conclusion.
 deep architecture (whereby complex concepts are expressed as
 compositions of simpler ones through a deep hierarchy).
 Here we chose to use the Denoising
 Auto-Encoder~\citep{VincentPLarochelleH2008} as the building block for
 these deep hierarchies of features, as it is very simple to train and
-teach (see Figure~\ref{fig:da}, as well as
+explain (see Figure~\ref{fig:da}, as well as
 tutorial and code there: {\tt http://deeplearning.net/tutorial}),
 provides immediate and efficient inference, and yielded results
 comparable or better than RBMs in series of experiments
 \citep{VincentPLarochelleH2008}. During training, a Denoising
 Auto-Encoder is presented with a stochastically corrupted version

Mercurial > ift6266

comparison writeup/nips2010_submission.tex @ 532:2e33885730cf