# HG changeset patch # User Olivier Delalleau # Date 1283522654 14400 # Node ID 6803b57e1da99333ee126b1d5ca997ac32027c50 # Parent dc5185cca21eacffb13567f6de4b4ce5c38b2ed7# Parent a5886b394bdabbb0e433d546717a220730b6eab6 Merged diff -r dc5185cca21e -r 6803b57e1da9 doc/v2_planning/dataset.txt --- a/doc/v2_planning/dataset.txt Fri Sep 03 10:04:01 2010 -0400 +++ b/doc/v2_planning/dataset.txt Fri Sep 03 10:04:14 2010 -0400 @@ -1,3 +1,14 @@ Discussion of Function Specification for Dataset Types ====================================================== +Some talking points from the September 2 meeting: + + * Datasets as views/tasks (Pascal Vincent's idea): our dataset specification + needs to be flexible enough to accommodate different (sub)tasks and views of + the same underlying data. + * Datasets as probability distributions from which one can sample. + * Our specification should allow transparent handling of infinite datasets (or + simply datasets which cannot fit in memory) + * GPU/buffering issues. + +Commiteee: DE, OB, OD, AB, PV diff -r dc5185cca21e -r 6803b57e1da9 doc/v2_planning/existing_python_ml_libraries.txt --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/doc/v2_planning/existing_python_ml_libraries.txt Fri Sep 03 10:04:14 2010 -0400 @@ -0,0 +1,18 @@ +Committee members: GD, DWF, IG, DE + +This committee will investigate the possibility of interfacing and/or +borrowing from other Python machine learning libraries that exist out there. +Some questions that we need to answer: + + * How much should we try to interface with other libraries? + * What parts can we and should we implement ourselves and what should we leave + to the other libraries? + +Preliminary list of libraries to look at: + + * Pybrain + * MDP + * Orange + * Shogun python bindings + * libsvm python bindings +