pylearn: dataset.py comparison

comparison dataset.py @ 101:a1740a99b81f

by default, in a minibatch without any fixed number of batchs, we need to finish at the end of the dataset. Now we return a minibatch at the end event if this minibacht size != the gived minibatch_size.

author	Frederic Bastien <bastienf@iro.umontreal.ca>
date	Tue, 06 May 2008 16:01:53 -0400
parents	a8da709eb6a9
children	8c0a1b11b007

comparison

equal deleted inserted replaced

-:574f4db76022
+:a1740a99b81f
 return self.next_row
 def next(self):
 if self.n_batches and self.n_batches_done==self.n_batches:
 raise StopIteration
+elif not self.n_batches and self.next_row ==self.L:
+raise StopIteration
 upper = self.next_row+self.minibatch_size
 if upper <=self.L:
 minibatch = self.iterator.next()
 else:
 if not self.n_batches:
-raise StopIteration
+upper=min(upper, self.L)
-# we must concatenate (vstack) the bottom and top parts of our minibatch
+# if their is not a fixed number of batch, we continue to the end of the dataset.
-# first get the beginning of our minibatch (top of dataset)
+# this can create a minibatch that is smaller then the minibatch_size
-first_part = self.dataset.minibatches_nowrap(self.fieldnames,self.L-self.next_row,1,self.next_row).next()
+assert (self.L-self.next_row)<=self.minibatch_size
-second_part = self.dataset.minibatches_nowrap(self.fieldnames,upper-self.L,1,0).next()
+minibatch = self.dataset.minibatches_nowrap(self.fieldnames,self.L-self.next_row,1,self.next_row).next()
-minibatch = Example(self.fieldnames,
+else:
-[self.dataset.valuesVStack(name,[first_part[name],second_part[name]])
+# we must concatenate (vstack) the bottom and top parts of our minibatch
-for name in self.fieldnames])
+# first get the beginning of our minibatch (top of dataset)
+first_part = self.dataset.minibatches_nowrap(self.fieldnames,self.L-self.next_row,1,self.next_row).next()
+second_part = self.dataset.minibatches_nowrap(self.fieldnames,upper-self.L,1,0).next()
+minibatch = Example(self.fieldnames,
+[self.dataset.valuesVStack(name,[first_part[name],second_part[name]])
+for name in self.fieldnames])
 self.next_row=upper
 self.n_batches_done+=1
 if upper >= self.L and self.n_batches:
 self.next_row -= self.L
 ds_nbatches =  (self.L-self.next_row)/self.minibatch_size

Mercurial > pylearn

comparison dataset.py @ 101:a1740a99b81f