# HG changeset patch # User Dumitru Erhan # Date 1283522462 14400 # Node ID a5886b394bdabbb0e433d546717a220730b6eab6 # Parent 2e515be92a0e6091b85e9594a8aba2d677ca7571 Updating with talking points from Sept. 2 discussion diff -r 2e515be92a0e -r a5886b394bda doc/v2_planning/dataset.txt --- a/doc/v2_planning/dataset.txt Thu Sep 02 22:28:32 2010 -0400 +++ b/doc/v2_planning/dataset.txt Fri Sep 03 10:01:02 2010 -0400 @@ -1,3 +1,14 @@ Discussion of Function Specification for Dataset Types ====================================================== +Some talking points from the September 2 meeting: + + * Datasets as views/tasks (Pascal Vincent's idea): our dataset specification + needs to be flexible enough to accommodate different (sub)tasks and views of + the same underlying data. + * Datasets as probability distributions from which one can sample. + * Our specification should allow transparent handling of infinite datasets (or + simply datasets which cannot fit in memory) + * GPU/buffering issues. + +Commiteee: DE, OB, OD, AB, PV diff -r 2e515be92a0e -r a5886b394bda doc/v2_planning/existing_python_ml_libraries.txt --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/doc/v2_planning/existing_python_ml_libraries.txt Fri Sep 03 10:01:02 2010 -0400 @@ -0,0 +1,18 @@ +Committee members: GD, DWF, IG, DE + +This committee will investigate the possibility of interfacing and/or +borrowing from other Python machine learning libraries that exist out there. +Some questions that we need to answer: + + * How much should we try to interface with other libraries? + * What parts can we and should we implement ourselves and what should we leave + to the other libraries? + +Preliminary list of libraries to look at: + + * Pybrain + * MDP + * Orange + * Shogun python bindings + * libsvm python bindings +