pylearn: dataset.py comparison

comparison dataset.py @ 269:fdce496c3b56

deprecating __getitem__[fieldname] syntax

author	James Bergstra <bergstrj@iro.umontreal.ca>
date	Wed, 04 Jun 2008 19:04:40 -0400
parents	3f1cd8897fda
children	fa8abc813bd2

comparison

equal deleted inserted replaced

-:3f1cd8897fda
+:fdce496c3b56
 - dataset[i] returns an Example.
 - dataset[[i1,i2,...in]] returns a dataset with examples i1,i2,...in.
-- dataset[fieldname] an iterable over the values of the field fieldname across
-the dataset (the iterable is obtained by default by calling valuesVStack
-over the values for individual examples).
 - dataset.<property> returns the value of a property associated with
 the name <property>. The following properties should be supported:
 - 'description': a textual description or name for the dataset
 - 'fieldtypes': a list of types (one per field)
 A DataSet may have other attributes that it makes visible to other objects. These are
 A DataSet sub-class should always redefine the following methods:
 - __len__ if it is not a stream
 - fieldNames
 - minibatches_nowrap (called by DataSet.minibatches())
+For efficiency of implementation, a sub-class might also want to redefine
 - valuesHStack
 - valuesVStack
-For efficiency of implementation, a sub-class might also want to redefine
 - hasFields
 - __getitem__ may not be feasible with some streams
 - __iter__
 A sub-class should also append attributes to self._attribute_names
 (the default value returned by attributeNames()).
 """
 Return a DataSetFields object associated with this dataset.
 """
 return DataSetFields(self,fieldnames)
+def getitem_key(self, fieldname):
+"""A not-so-well thought-out place to put code that used to be in
+getitem.
+"""
+#removing as per discussion June 4. --JSB
+i = fieldname
+# else check for a fieldname
+if self.hasFields(i):
+return self.minibatches(fieldnames=[i],minibatch_size=len(self),n_batches=1,offset=0).next()[0]
+# else we are trying to access a property of the dataset
+assert i in self.__dict__ # else it means we are trying to access a non-existing property
+return self.__dict__[i]
 def __getitem__(self,i):
 """
 dataset[i] returns the (i+1)-th example of the dataset.
 dataset[i:j] returns the subdataset with examples i,i+1,...,j-1.
 dataset[i:j:s] returns the subdataset with examples i,i+2,i+4...,j-2.
 return MinibatchDataSet(
 Example(self.fieldNames(),[ self.valuesVStack(fieldname,field_values)
 for fieldname,field_values
 in zip(self.fieldNames(),fields_values)]),
 self.valuesVStack,self.valuesHStack)
-# else check for a fieldname
+raise TypeError(i, type(i))
-if self.hasFields(i):
-return self.minibatches(fieldnames=[i],minibatch_size=len(self),n_batches=1,offset=0).next()[0]
-# else we are trying to access a property of the dataset
-assert i in self.__dict__ # else it means we are trying to access a non-existing property
-return self.__dict__[i]
 def valuesHStack(self,fieldnames,fieldvalues):
 """
 Return a value that corresponds to concatenating (horizontally) several field values.
 This can be useful to merge some fields. The implementation of this operation is likely

Mercurial > pylearn

comparison dataset.py @ 269:fdce496c3b56