Mercurial > pylearn
view doc/v2_planning/dataset.txt @ 1034:564c069134c2
added more software & links
author | Dumitru Erhan <dumitru.erhan@gmail.com> |
---|---|
date | Tue, 07 Sep 2010 10:58:21 -0400 |
parents | a154c9b68239 |
children | 1b61cbe0810b |
line wrap: on
line source
Discussion of Function Specification for Dataset Types ====================================================== Some talking points from the September 2 meeting: * Datasets as views/tasks (Pascal Vincent's idea): our dataset specification needs to be flexible enough to accommodate different (sub)tasks and views of the same underlying data. * Datasets as probability distributions from which one can sample. * That's not something I would consider to be a dataset-related problem to tackle now: a probability distribution in Pylearn would probably be a different kind of beast, and it should be easy enough to have a DatasetToDistribution class for instance, that would take care of viewing a dataset as a probability distribution. -- OD * Our specification should allow transparent handling of infinite datasets (or simply datasets which cannot fit in memory) * GPU/buffering issues. Commiteee: DE, OB, OD, AB, PV Leader: DE