Mercurial > pylearn
comparison doc/v2_planning/dataset.txt @ 1082:f9f72ae84313
dataset: Added a couple points we did not have time to discuss during meeting
author | Olivier Delalleau <delallea@iro> |
---|---|
date | Fri, 10 Sep 2010 15:36:23 -0400 |
parents | 5c14d2ffcbb3 |
children | 4c00af69c164 |
comparison
equal
deleted
inserted
replaced
1081:446bd478953f | 1082:f9f72ae84313 |
---|---|
202 if we wanted to provide a virtual world for an agent as a dataset then we need | 202 if we wanted to provide a virtual world for an agent as a dataset then we need |
203 to provide 'actions' to get the next batch. Could this be done in the current | 203 to provide 'actions' to get the next batch. Could this be done in the current |
204 API? | 204 API? |
205 | 205 |
206 | 206 |
207 | 207 Field names and attributes |
208 ~~~~~~~~~~~~~~~~~~~~~~~~~~ | |
209 | |
210 OD: One important question is how to handle fields' names and characteristics. | |
211 For instance, it can be useful to know that the 3rd input field represents a | |
212 number of fingers, and is a non-negative discrete field whose numeric value is | |
213 meaningful (compared, to, say, an integer index that would correspond to an | |
214 animal's category). We mentioned metadata during the meeting, but we did not | |
215 get into its details: that may be a place where to put this kind of things. | |
216 | |
217 | |
218 Freeing memory | |
219 ~~~~~~~~~~~~~~ | |
220 | |
221 OD: It is sometimes useful to be able to free memory used by previous | |
222 computations. A typical example is when you load in memory the original | |
223 dataset, then perform various processing steps, ending with a new dataset that | |
224 you also store in memory before feeding it to the learner. Unless you very | |
225 carefully design your code to avoid it, your original dataset will still | |
226 remain in memory (as well as maybe the results of some computations performed | |
227 along the way). So there may be a use for a `clear()` method that would be | |
228 called by the topmost dataset (the one doing the final memory caching), and | |
229 would be forwarded iteratively to previous datasets so as to get back all this | |
230 wasted memory space. | |
231 |