log dataset.py @ 401:217c8789284b

age author description
Mon, 07 Jul 2008 12:02:34 -0400 Yoshua Bengio fixing merge
Thu, 19 Jun 2008 16:18:37 -0400 Joseph Turian Added more functions
Thu, 19 Jun 2008 12:35:41 -0400 Thierry Bertin-Mahieux dataset__call__() returns a FieldsSubsetDataSet, so still a subset of fields, but not cached any more. I added the function dataset.cached_fields_subset(self,*fieldnames) that returns the old version, cached, in case someone needs it. Current behaviour passes the tests.
Tue, 17 Jun 2008 17:12:43 -0400 Thierry Bertin-Mahieux Cached dataset seems debug, using n_batches... is n_batches around to stay?
Tue, 17 Jun 2008 16:47:33 -0400 Frederic Bastien debugged RenamedFieldsDataSet
Tue, 17 Jun 2008 14:46:10 -0400 Frederic Bastien bugfix, MinibatchDataSet.minibatches_nowrap() must raise StopIteration when needed
Tue, 17 Jun 2008 11:41:01 -0400 Frederic Bastien Automated merge with ssh://projects@lgcm.iro.umontreal.ca/hg/pylearn
Mon, 16 Jun 2008 17:26:51 -0400 Thierry Bertin-Mahieux n_minibatches in ArrayDataSet automatically computed
Mon, 16 Jun 2008 17:17:50 -0400 Frederic Bastien Automated merge with ssh://projects@lgcm.iro.umontreal.ca/hg/pylearn
Mon, 16 Jun 2008 17:17:01 -0400 Thierry Bertin-Mahieux debugging when fieldnames = None, now -> fieldnames = fieldNames()
Mon, 16 Jun 2008 17:07:44 -0400 Frederic Bastien Automated merge with ssh://projects@lgcm.iro.umontreal.ca/hg/pylearn
Mon, 16 Jun 2008 16:59:35 -0400 Thierry Bertin-Mahieux sorry for all the debugging, this push should be the good one (and damn you *self*)
Mon, 16 Jun 2008 16:52:39 -0400 Thierry Bertin-Mahieux debug in an assert of minibatches
Mon, 16 Jun 2008 17:07:41 -0400 Frederic Bastien bugfix in RenamedDataSet
Mon, 16 Jun 2008 16:44:11 -0400 Thierry Bertin-Mahieux redone my previous hack in MinibatchToSingleExampleIterator, tests should work again
Mon, 16 Jun 2008 16:38:03 -0400 Thierry Bertin-Mahieux IMPORTANT: minibatches now returns minibatch_nowrap with a minimum of assert before. Should implement the good behavior, e.g. returning only complete batches and let the user figure out what he wants.
Mon, 16 Jun 2008 16:11:36 -0400 Thierry Bertin-Mahieux merge
Mon, 16 Jun 2008 16:03:16 -0400 Thierry Bertin-Mahieux little hack in MiniBatchToSingleExampleIterator, there was a problem which I think was not a bug, we were receiving [array(3)] and everything was crashing. Hack is kinda slow
Mon, 16 Jun 2008 16:06:59 -0400 Yoshua Bengio Added untested RenamedFieldsDataSet
Thu, 12 Jun 2008 12:45:42 -0400 Thierry Bertin-Mahieux small bugs fixed with NArrayDataSet
Thu, 12 Jun 2008 12:35:47 -0400 Thierry Bertin-Mahieux NArraysDataSet improved, use arrays instead of matrix, also a dictionnary of field indexes
Wed, 11 Jun 2008 17:11:37 -0400 Thierry Bertin-Mahieux bug fixed when initializing NArraysDataSet with 1-d arrays. This now raises an Exception
Wed, 11 Jun 2008 17:05:56 -0400 Thierry Bertin-Mahieux bug fixed when one matrix is an array, a 1-d matrix
Wed, 11 Jun 2008 16:59:03 -0400 Thierry Bertin-Mahieux NArraysDataSet, a generalization ArrayDataSet where every field is a ndarray, is implemented. Not really tested aside basic stuff...
Wed, 11 Jun 2008 16:40:47 -0400 Thierry Bertin-Mahieux doc updated regarding __getitem__ returning LookupList and .subset returning a DataSet
Wed, 11 Jun 2008 16:28:09 -0400 Thierry Bertin-Mahieux getitem in ArrayDataSet is set up again, supposed to be faster than default one, has been tested agains the default behaviour. In particular, now always return a LookupList
Wed, 11 Jun 2008 13:57:34 -0400 Thierry Bertin-Mahieux bug fixed concerning the slicing, now ds[0:len(ds) + 1000 : 2] is accepted, same a python list
Wed, 11 Jun 2008 13:53:39 -0400 Thierry Bertin-Mahieux behaviour is now the same as a list in pylearn, so if len(ds) = 10, ds[10] raise an IndexError, same thing for ds[[1,10]], and ds[0:14:1] returns 10 elements
Wed, 11 Jun 2008 11:43:54 -0400 Thierry Bertin-Mahieux dataset.subset implemented
Wed, 11 Jun 2008 11:18:14 -0400 Thierry Bertin-Mahieux get item now returns LookupLists
Fri, 06 Jun 2008 17:50:29 -0400 James Bergstra ApplyFunctionDataSet passing
Fri, 06 Jun 2008 16:13:17 -0400 James Bergstra still merging
Fri, 06 Jun 2008 15:56:18 -0400 James Bergstra merge
Thu, 05 Jun 2008 18:38:42 -0400 James Bergstra trying to get default implemenations to work
Thu, 05 Jun 2008 13:46:26 -0400 Frederic Bastien bugfix in FieldSubsetDataSet
Thu, 05 Jun 2008 11:47:44 -0400 Frederic Bastien Automated merge with ssh://projects@lgcm.iro.umontreal.ca/hg/pylearn
Tue, 03 Jun 2008 16:13:42 -0400 Frederic Bastien Automated merge with ssh://projects@lgcm.iro.umontreal.ca/hg/pylearn
Tue, 03 Jun 2008 16:13:38 -0400 Frederic Bastien code cleanup
Tue, 03 Jun 2008 16:11:59 -0400 Frederic Bastien code cleanup and small optimisation
Wed, 04 Jun 2008 19:04:40 -0400 James Bergstra deprecating __getitem__[fieldname] syntax
Wed, 04 Jun 2008 18:48:50 -0400 James Bergstra reverting dataset
Wed, 04 Jun 2008 17:49:09 -0400 James Bergstra initial commit of amat
Tue, 03 Jun 2008 16:06:21 -0400 James Bergstra merged
Tue, 03 Jun 2008 16:05:28 -0400 James Bergstra beginning to hack on #20, fixing for Thierry
Tue, 03 Jun 2008 13:24:41 -0400 Frederic Bastien optimication in CachedDataSet.minibatches_nowrap
Tue, 03 Jun 2008 13:23:28 -0400 Frederic Bastien code clean up
Tue, 03 Jun 2008 13:22:45 -0400 Frederic Bastien implemented CachedDataSet.__iter__ as an optimization
Tue, 03 Jun 2008 12:25:53 -0400 Frederic Bastien optimization of CachedDataSet__getitem__
Mon, 02 Jun 2008 11:59:41 -0400 Frederic Bastien Optimisation in ArrayDataSet::__getitem__
Fri, 30 May 2008 10:19:16 -0400 delallea Hmm... that was committed by Fred I think, I got lost by Mercurial I think
Fri, 30 May 2008 10:14:46 -0400 delallea If I understand properly, the length of an unbounded stream is sys.maxint
Thu, 29 May 2008 10:30:59 -0400 Frederic Bastien optimization in ArrayDataSet::__iter__
Wed, 28 May 2008 14:09:19 -0400 James Bergstra re-enabling old ArrayDataSet indexing
Tue, 27 May 2008 13:46:03 -0400 Frederic Bastien Automated merge with ssh://projects@lgcm.iro.umontreal.ca/hg/pylearn
Fri, 16 May 2008 16:38:07 -0400 Frederic Bastien optimisation of ArrayDataSet
Tue, 27 May 2008 13:23:05 -0400 James Bergstra reinstating changeset 216 clobbered accidentally by 218
Fri, 23 May 2008 16:01:01 -0400 Thierry Bertin-Mahieux should have solved the problem with minibatches not handling subsets of fieldnames, although maybe not super efficient
Fri, 23 May 2008 12:22:54 -0400 Thierry Bertin-Mahieux small debugging
Thu, 22 May 2008 19:07:51 -0400 James Bergstra Modified ArrayDataSet's handling of column fields.
Wed, 21 May 2008 17:39:30 -0400 Thierry Bertin-Mahieux in __get__, problem if the i.stop was None, i being the slice, added one line replacing None by the len(self)
Thu, 15 May 2008 15:21:00 -0400 Frederic Bastien Automated merge with ssh://p-omega1@lgcm.iro.umontreal.ca/tlearn
Thu, 15 May 2008 13:09:26 -0400 Frederic Bastien bugfix
Thu, 15 May 2008 13:04:20 -0400 Frederic Bastien bugfix?
Tue, 13 May 2008 16:42:51 -0400 Frederic Bastien bugfix
Tue, 13 May 2008 16:02:32 -0400 Frederic Bastien bugfix
Tue, 13 May 2008 15:29:04 -0400 Frederic Bastien more explicit comment
Tue, 13 May 2008 15:27:45 -0400 Frederic Bastien the function should take the example unpacked as an input.
Thu, 15 May 2008 12:55:21 -0400 Yoshua Bengio Complete redesign of learner...
Wed, 14 May 2008 14:49:08 -0400 James Bergstra added __contains__ to Dataset, added parent constructor call to ArrayDataSet
Tue, 13 May 2008 13:23:05 -0400 Frederic Bastien bugfix. Otherwise the example was writed over and not a new one was returned
Mon, 12 May 2008 18:51:42 -0400 Joseph Turian Updated documentation
Mon, 12 May 2008 18:40:17 -0400 Joseph Turian Created exceptions.py
Mon, 12 May 2008 17:40:06 -0400 Frederic Bastien bugfix, we should advence by the minibatch size.
Mon, 12 May 2008 16:51:54 -0400 Frederic Bastien bugfix: make the iterator advance
Mon, 12 May 2008 16:13:48 -0400 Frederic Bastien merge
Mon, 12 May 2008 16:12:29 -0400 Frederic Bastien optimization
Mon, 12 May 2008 16:11:24 -0400 Yoshua Bengio Fixes in CacheDataSet
Mon, 12 May 2008 15:51:43 -0400 Frederic Bastien Automated merge with ssh://p-omega1@lgcm.iro.umontreal.ca/tlearn
Mon, 12 May 2008 15:51:30 -0400 Frederic Bastien bugfix
Mon, 12 May 2008 15:50:28 -0400 Yoshua Bengio Trying to make CacheDataSet work
Mon, 12 May 2008 15:08:18 -0400 Frederic Bastien Automated merge with ssh://p-omega1@lgcm.iro.umontreal.ca/tlearn
Mon, 12 May 2008 14:15:16 -0400 Frederic Bastien Automated merge with ssh://p-omega1@lgcm.iro.umontreal.ca/tlearn
Mon, 12 May 2008 14:13:39 -0400 Frederic Bastien ...
Mon, 12 May 2008 14:13:19 -0400 Frederic Bastien bugfix
Wed, 07 May 2008 16:26:28 -0400 Frederic Bastien typo
Mon, 12 May 2008 14:30:21 -0400 Yoshua Bengio Fixed bugs in dataset to make test_mlp.py work
Fri, 09 May 2008 17:38:57 -0400 Yoshua Bengio Fixes to ApplyFunctionDataSet and other things to make learner and mlp work
Thu, 08 May 2008 00:54:14 -0400 Joseph Turian Updated documentation slightly
Wed, 07 May 2008 20:51:24 -0400 Yoshua Bengio minor edits
Wed, 07 May 2008 16:22:28 -0400 Frederic Bastien bugfix
Wed, 07 May 2008 16:17:25 -0400 Frederic Bastien bugfix
Tue, 06 May 2008 22:24:55 -0400 bengioy Worked on OneShotTLearner and implementation of LinearRegression
Tue, 06 May 2008 16:34:48 -0400 Frederic Bastien bugfix, we keep all the line, but only a some columns
Tue, 06 May 2008 16:01:53 -0400 Frederic Bastien by default, in a minibatch without any fixed number of batchs, we need to finish at the end of the dataset. Now we return a minibatch at the end event if this minibacht size != the gived minibatch_size.
Tue, 06 May 2008 13:57:36 -0400 Frederic Bastien in ArrayDataSet.__init__ if a columns is an index, we change it to be a list that containt only this index. This way, we remove the special case where the columns is an index for all subsequent call.
Tue, 06 May 2008 13:50:54 -0400 Frederic Bastien bugfix in DataSet.minibatch to correctly wrap in all corner case.
Tue, 06 May 2008 10:52:45 -0400 Frederic Bastien removed duplicated example in comment
Mon, 05 May 2008 17:13:07 -0400 Frederic Bastien corrected comment
Mon, 05 May 2008 14:51:41 -0400 Frederic Bastien bugfix as numpy numpy don't support self.data[[i1,...],[i2,...]] when their is more then two i1 and i2
Mon, 05 May 2008 13:37:39 -0400 Frederic Bastien bugfix
Mon, 05 May 2008 11:14:28 -0400 Frederic Bastien bugfix
Mon, 05 May 2008 11:02:03 -0400 Frederic Bastien Automated merge with ssh://p-omega1@lgcm.iro.umontreal.ca/tlearn
Mon, 05 May 2008 10:56:58 -0400 Frederic Bastien bugfix
Mon, 05 May 2008 09:35:30 -0400 bengioy In the middle of designing TLearner
Sun, 04 May 2008 15:09:22 -0400 bengioy toying with different ways to implement learners
Sat, 03 May 2008 14:41:02 -0400 Yoshua Bengio Fixed errors raised by test_dataset
Sat, 03 May 2008 14:29:56 -0400 bengioy Coded untested ApplyFunctionDataSet and CacheDataSet
Fri, 02 May 2008 18:36:47 -0400 Joseph Turian Changed docs slightly
Fri, 02 May 2008 11:24:17 -0400 Frederic Bastien fixed test and removed print
Fri, 02 May 2008 11:10:20 -0400 Yoshua Bengio fixed the infinite loop
Fri, 02 May 2008 11:01:28 -0400 Yoshua Bengio trying to fix infinite loop
Fri, 02 May 2008 10:14:01 -0400 Frederic Bastien bugfix
Fri, 02 May 2008 09:55:38 -0400 Frederic Bastien bugfix: in MinibatchDataSet renamed the class variable fields to _fields as parent class have a function called field.
Thu, 01 May 2008 16:26:34 -0400 Frederic Bastien bugfix
Tue, 29 Apr 2008 17:45:16 -0400 Yoshua Bengio Added empty classes with doc: CachedDataSet and ApplyFunctionDataSet
Tue, 29 Apr 2008 16:09:17 -0400 Yoshua Bengio Automated merge with ssh://p-omega1@lgcm.iro.umontreal.ca/tlearn
Tue, 29 Apr 2008 15:05:12 -0400 Yoshua Bengio Efficient implementation of getitem for ArrayDataSet
Tue, 29 Apr 2008 14:40:44 -0400 Frederic Bastien Automated merge with ssh://p-omega1@lgcm.iro.umontreal.ca/tlearn
Tue, 29 Apr 2008 14:40:06 -0400 Frederic Bastien fix comment
Tue, 29 Apr 2008 13:40:13 -0400 bengioy Fixing MinibatchDataSet getitem