# HG changeset patch # User Olivier Delalleau # Date 1283545850 14400 # Node ID fb6cae14fd0755e544f35b9237320d3371785e5d # Parent 53f6eb80abf11b68ea0deb8b9625e6c3cf724c32 dataset: Comment about viewing a dataset as a distribution diff -r 53f6eb80abf1 -r fb6cae14fd07 doc/v2_planning/dataset.txt --- a/doc/v2_planning/dataset.txt Fri Sep 03 16:20:10 2010 -0400 +++ b/doc/v2_planning/dataset.txt Fri Sep 03 16:30:50 2010 -0400 @@ -7,6 +7,11 @@ needs to be flexible enough to accommodate different (sub)tasks and views of the same underlying data. * Datasets as probability distributions from which one can sample. + * That's not something I would consider to be a dataset-related problem to + tackle now: a probability distribution in Pylearn would probably be a + different kind of beast, and it should be easy enough to have a + DatasetToDistribution class for instance, that would take care of viewing a + dataset as a probability distribution. -- OD * Our specification should allow transparent handling of infinite datasets (or simply datasets which cannot fit in memory) * GPU/buffering issues.