comparison doc/v2_planning/dataset.txt @ 1023:fb6cae14fd07

dataset: Comment about viewing a dataset as a distribution
author Olivier Delalleau <delallea@iro>
date Fri, 03 Sep 2010 16:30:50 -0400
parents a5886b394bda
children a154c9b68239
comparison
equal deleted inserted replaced
1020:53f6eb80abf1 1023:fb6cae14fd07
5 5
6 * Datasets as views/tasks (Pascal Vincent's idea): our dataset specification 6 * Datasets as views/tasks (Pascal Vincent's idea): our dataset specification
7 needs to be flexible enough to accommodate different (sub)tasks and views of 7 needs to be flexible enough to accommodate different (sub)tasks and views of
8 the same underlying data. 8 the same underlying data.
9 * Datasets as probability distributions from which one can sample. 9 * Datasets as probability distributions from which one can sample.
10 * That's not something I would consider to be a dataset-related problem to
11 tackle now: a probability distribution in Pylearn would probably be a
12 different kind of beast, and it should be easy enough to have a
13 DatasetToDistribution class for instance, that would take care of viewing a
14 dataset as a probability distribution. -- OD
10 * Our specification should allow transparent handling of infinite datasets (or 15 * Our specification should allow transparent handling of infinite datasets (or
11 simply datasets which cannot fit in memory) 16 simply datasets which cannot fit in memory)
12 * GPU/buffering issues. 17 * GPU/buffering issues.
13 18
14 Commiteee: DE, OB, OD, AB, PV 19 Commiteee: DE, OB, OD, AB, PV