pylearn: doc/v2_planning/api_optimization.txt comparison

comparison doc/v2_planning/api_optimization.txt @ 1100:153cf820a975

v2planning - updates to api_optimization

author	James Bergstra <bergstrj@iro.umontreal.ca>
date	Mon, 13 Sep 2010 13:41:53 -0400
parents	16ea3e5c5a7a
children	e7c52923f122

comparison

equal deleted inserted replaced

-:0b666177f725
+:153cf820a975
 The API includes an iterative interface based on Theano, and a one-shot
 interface similar to SciPy and MATLAB that is based on Python and Numpy, that
 only uses Theano for the implementation.
-Iterative Interface
+Theano Interface
--------------------
+-----------------
+The theano interface to optimization algorithms is to ask for a dictionary of
+updates that can be used in theano.function.  Implementations of iterative
+optimization algorithms should be global functions with a signature like
+'iterative_optimizer'.
 def iterative_optimizer(parameters,
 cost=None,
-grads=None,
+gradients=None,
 stop=None,
 updates=None,
 **kwargs):
 """
-:param parameters: list or tuple of Theano variables (typically shared vars)
+:param parameters: list or tuple of Theano variables
 that we want to optimize iteratively.  If we're minimizing f(x), then
-together, these variables represent 'x'.
+together, these variables represent 'x'.  Typically these are shared
+variables and their values are the initial values for the minimization
+algorithm.
 :param cost: scalar-valued Theano variable that computes an exact or noisy estimate of
 cost  (what are the conditions on the noise?).  Some algorithms might
-need an exact cost, some algorithms might ignore the cost if the grads
+need an exact cost, some algorithms might ignore the cost if the
-are given.
+gradients are given.
-:param grads: list or tuple of Theano variables representing the gradients on
+:param gradients: list or tuple of Theano variables representing the gradients on
 the corresponding parameters.  These default to tensor.grad(cost,
 parameters).
 :param stop: a shared variable (scalar integer) that (if provided) will be
 updated to say when the iterative minimization algorithm has finished
 internal variables that are part of the optimization algorithm.
 """
-One-shot Interface
+Numpy Interface
-------------------
+---------------
+The numpy interface to optimization algorithms is supposed to mimick
+scipy's.  Its arguments are numpy arrays, and functions that manipulate numpy
+arrays.
+TODO: There is also room for an iterative object (that doesn't hog program
+control) but which nonetheless works on numpy objects.  Actually minimize() should
+use this iterative interface under the hood.
 def minimize(x0, f, df, opt_algo, **kwargs):
 """
 Return a point x_new that minimizes function `f` with derivative `df`.
 :param kwargs: passed through to `opt_algo`
 """
-OD: Could it be more convenient for x0 to be a list?
+OD asks: Could it be more convenient for x0 to be a list?
+JB replies: Yes, but that's not the interface used by other minimize()
+routines (e.g. in scipy).  Maybe another list-based interface is required?
-OD: Why make a difference between iterative and one-shot versions? A one-shot
+OD asks: Why make a difference between iterative and one-shot versions? A one-shot
 algorithm can be seen as an iterative one that stops after its first
 iteration. The difference I see between the two interfaces proposed here
 is mostly that one relies on Theano while the other one does not, but
 hopefully a non-Theano one can be created by simply wrapping around the
 Theano one.
+JB replies: Right, it would make more sense to distinguish them by the fact that
+one works on Theano objects, and the other on general Python callable functions.
+There is room for an iterative numpy interface, but I didn't make it yet.  Would
+that answer your question?
+Examples
+--------
+Simple stochastic gradient descent with extra updates:
+sgd([p], gradients=[g], updates={a:b}, step_size=.1) will return {a:b, p:p-.1*g}

Mercurial > pylearn

comparison doc/v2_planning/api_optimization.txt @ 1100:153cf820a975