pylearn: pylearn/algorithms/kernel_regression.py comparison

comparison pylearn/algorithms/kernel_regression.py @ 1505:723e2d761985

auto white space fix.

author	Frederic Bastien <nouiz@nouiz.org>
date	Mon, 12 Sep 2011 10:49:15 -0400
parents	bf5c0f797161
children

comparison

equal deleted inserted replaced

-:bf5c0f797161
+:723e2d761985
 that linear system matrix requires O(n^2) memory.
 So this learning algorithm should be used only for
 small datasets.
 * the linear system is
 (M + lambda I_n) theta = (1, y)'
 where theta = (b, alpha), I_n is the (n+1)x(n+1) matrix that is the identity
 except with a 0 at (0,0), M is the matrix with G in the sub-matrix starting
 at (1,1), 1's in column 0, except for a value of n at (0,0), and sum_i G_{i,j}
 in the rest of row 0.
 Note that this is gives an estimate of E[y|x,training_set] that is the
 same as obtained with a Gaussian process regression. The GP
 regression would also provide a Bayesian Var[y|x,training_set].
 It corresponds to an assumption that f is a random variable
 with Gaussian (process) prior distribution with covariance
 all_results_dataset=kernel_predictor(test_set) # creates a dataset with "output" and "squared_error" field
 outputs = kernel_predictor.compute_outputs(inputs) # inputs and outputs are numpy arrays
 outputs, errors = kernel_predictor.compute_outputs_and_errors(inputs,targets)
 errors = kernel_predictor.compute_errors(inputs,targets)
 mse = kernel_predictor.compute_mse(inputs,targets)
 The training_set must have fields "input" and "target".
 The test_set must have field "input", and needs "target" if
 we want to compute the squared errors.
 cls.__compiled = True
 def __init__(self):
 self.compile()
 class KernelRegressionEquations(KernelPredictorEquations):
 #M = T.matrix() # (n_examples+1) x (n_examples+1)
 inputs = T.matrix() # n_examples x n_inputs
 gamma = T.scalar()
 inv_gamma2 = 1./(gamma*gamma)
 inputs_square = T.sum(inputs*inputs,axis=1)
 #new_G = G+T.dot(inputs,inputs.T)
 #new_G = T.gemm(G,1.,inputs,inputs.T,1.)
 G = T.exp(-(row_vector(inputs_square)-2*T.dot(inputs,inputs.T)+col_vector(inputs_square))*inv_gamma2)
 sumG = T.sum(G,axis=0)
 __compiled = False
 @classmethod
 def compile(cls,linker='c|py'):
 if cls.__compiled:
 return
 def fn(input_vars,output_vars):
 outputs = self.compute_outputs(inputs)
 return [outputs,self.equations.compute_errors(outputs,targets)]
 def compute_mse(self,inputs,targets):
 errors = self.compute_errors(inputs,targets)
 return numpy.sum(errors)/errors.size
 def __call__(self,dataset,output_fieldnames=None,cached_output_dataset=False):
 assert dataset.hasFields(["input"])
 if output_fieldnames is None:
 if dataset.hasFields(["target"]):
 output_fieldnames = ["output","squared_error"]
 f = self.compute_outputs
 elif output_fieldnames == ["output","squared_error"]:
 f = self.compute_outputs_and_errors
 else:
 raise ValueError("unknown field(s) in output_fieldnames: "+str(output_fieldnames))
 ds=ApplyFunctionDataSet(dataset,f,output_fieldnames)
 if cached_output_dataset:
 return CachedDataSet(ds)
 else:
 return ds
 def kernel_predictor(inputs,params,*otherargs):
 p = KernelPredictor(params,*otherargs[0])
 return p.compute_outputs(inputs)

Mercurial > pylearn

comparison pylearn/algorithms/kernel_regression.py @ 1505:723e2d761985