pylearn: doc/v2_planning/learner.txt comparison

comparison doc/v2_planning/learner.txt @ 1058:e342de3ae485

v2planning learner - added comments and TODO points

author	James Bergstra <bergstrj@iro.umontreal.ca>
date	Thu, 09 Sep 2010 11:49:57 -0400
parents	bc3f7834db83
children	f082a6c0b008

comparison

equal deleted inserted replaced

-:baf1988db557
+:e342de3ae485
 etc.)  Such a learner would replace synchronous instructions (return on completion) with
 asynchronous ones (return after scheduling) and the active instruction set would also change
 asynchronously, but neither of these things is inconsistent with the Learner API.
-TODO
+TODO - Experiment API?
-~~~~
+~~~~~~~~~~~~~~~~~~~~~~
 I feel like something is missing from the API - and that is an interface to the graph structure
 discussed above.  The nodes in this graph are natural places to store meta-information for
 visualization, statistics-gathering etc.   But none of the APIs above corresponds to the graph
 itself. In other words, there is no API through which to attach information to nodes.  It is
 not good to say that the Learner instance *is* the node because (a) learner instances change
 during graph exploration and (b) learner instances are big, and we don't want to have to keep a
 whole saved model just to attach meta-info e.g. validation score.    Choosing this API spills
-over into other committees, so we should get their feedback about how to resolve it.
+over into other committees, so we should get their feedback about how to resolve
+it.  Maybe we need an 'Experiment' API to stand for this graph?
-Comments
-~~~~~~~~
+TODO: Validation & Monitoring Costs
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+Even if we do have the Experiment API as a structure to hang validation and
+monitoring results, what should be the mechanism for extracting those results.
+The Learner API is not right because extracting a monitoring cost doesn't change
+the model, doesn't change the legal instructions/edges etc.  Maybe we should use
+a similar mechanism to Instruction, called something like Measurement?  Any node
+/ learner can report the list of instructions (for moving) and the list of
+measurements (and the cost of computing them too)
+TODO - Parameter Distributions
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 YB asks: it seems to me that what we really need from "Type" is not just
 testing that a value is legal, but more practically a function that specifies the
 prior distribution for the hyper-parameter, i.e., how to sample from it,
 and possibly some representation of it that could be used to infer
 Having the min and max and default limits us to the uniform distribution,
 which may not always be appropriate. For example sometimes we'd like
 Gaussian (-infty to infty) or Exponential (0 to infty) or Poisson (non-negative integers).
 For that reason, I think that "Type" is not a very good name.
 How about "Prior" or "Density" or something like that?
+JB replies: I agree that being able to choose (and update) distributions over
+these values is important. I don't think the Type structure is the right place
+to handle it though.  The challenge is to allow those distributions to change
+for a variety of reasons - e.g. the sampling distribution on the capacity
+variables is affected by the size of the dataset, it is also affected by
+previous experience in general as well as experiments on that particular
+dataset.  I'm not sure that the 'Type' structure is right to deal with this.
+Also, even with a strategy for handling these distributions, I believe a simple
+mechanism for rejecting insane values might be useful.
+So how should we handle it?  Hmmm...
+Comments
+~~~~~~~~
 OD asks: (I hope it's ok to leave comments even though I'm not in committee... I'm
 interested to see how the learner interface is shaping up so I'll be keeping
 an eye on this file)
 I'm wondering what's the benefit of such an API compared to simply defining a

Mercurial > pylearn

comparison doc/v2_planning/learner.txt @ 1058:e342de3ae485