view statscollector.py @ 101:a1740a99b81f

by default, in a minibatch without any fixed number of batchs, we need to finish at the end of the dataset. Now we return a minibatch at the end event if this minibacht size != the gived minibatch_size.
author Frederic Bastien <bastienf@iro.umontreal.ca>
date Tue, 06 May 2008 16:01:53 -0400
parents 2cd82666b9a7
children f62a03c9d485
line wrap: on
line source


from numpy import *

class StatsCollector(object):
    """A StatsCollector object is used to record performance statistics during training
    or testing of a learner. It can be configured to measure different things and
    accumulate the appropriate statistics. From these statistics it can be interrogated
    to obtain performance measures of interest (such as maxima, minima, mean, standard
    deviation, standard error, etc.). Optionally, the observations can be weighted
    (yielded weighted mean, weighted variance, etc., where applicable). The statistics
    that are desired can be specified among a list supported by the StatsCollector
    class or subclass. When some statistics are requested, others become automatically
    available (e.g., sum or mean)."""

    default_statistics = [mean,standard_deviation,min,max]
    
    __init__(self,n_quantities_observed, statistics=default_statistics):
        self.n_quantities_observed=n_quantities_observed

    clear(self):
        raise NotImplementedError

    update(self,observations):
        """The observations is a numpy vector of length n_quantities_observed. Some
        entries can be 'missing' (with a NaN entry) and will not be counted in the
        statistics."""
        raise NotImplementedError

    __getattr__(self, statistic)
        """Return a particular statistic, which may be inferred from the collected statistics.
        The argument is a string naming that statistic."""