annotate bin/pkldu.py @ 1484:83d3c9ee6d65

* changed MNIST dataset to use config.get_filepath_in_roots mechanism
author gdesjardins
date Tue, 05 Jul 2011 11:01:51 -0400
parents 509d6669429d
children
rev   line source
1459
509d6669429d pkldu - changed /bin/env to /usr/bin/env which is more standard I hope.
James Bergstra <bergstrj@iro.umontreal.ca>
parents: 1437
diff changeset
1 #!/usr/bin/env python
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
2 """
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
3 Script to analyze disk usage of pickled files. See usage.
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
4 """
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
5 __authors__ = "Ian Goodfellow, Razvan Pascanu"
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
6 __copyright__ = "(c) 2010, Universite de Montreal"
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
7 __contact__ = "Razvan Pascanu <r.pascanu@gmail>"
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
8
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
9 import cPickle, optparse, time, sys
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
10
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
11
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
12 usage = """
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
13 pkldu [OPTIONS] file indices
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
14
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
15 First argument of the program is the file to analyze. Following arguments
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
16 help you indexing in the object. For example :
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
17 pkldu.py foo.pkl .my_field [my_key] 3
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
18
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
19 will load an object obj from foo.pkl and analyze obj.my_field["my_key"][3]
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
20 """
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
21
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
22 space_units = [(' B', 1),
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
23 ('kB', 2**10),
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
24 ('MB', 2**20),
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
25 ('GB', 2**30),
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
26 ('TB', 2**40)]
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
27
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
28 time_units = [('s', 1),
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
29 ('m', 60),
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
30 ('h', 3600) ]
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
31
1433
14ba52c38f07 removed import to file that don't exist in this repo.
Frederic Bastien <nouiz@nouiz.org>
parents: 1423
diff changeset
32 def load(filepath):
14ba52c38f07 removed import to file that don't exist in this repo.
Frederic Bastien <nouiz@nouiz.org>
parents: 1423
diff changeset
33 f = open(filepath,'rb')
14ba52c38f07 removed import to file that don't exist in this repo.
Frederic Bastien <nouiz@nouiz.org>
parents: 1423
diff changeset
34 obj = cPickle.load(f)
14ba52c38f07 removed import to file that don't exist in this repo.
Frederic Bastien <nouiz@nouiz.org>
parents: 1423
diff changeset
35 f.close()
14ba52c38f07 removed import to file that don't exist in this repo.
Frederic Bastien <nouiz@nouiz.org>
parents: 1423
diff changeset
36 return obj
14ba52c38f07 removed import to file that don't exist in this repo.
Frederic Bastien <nouiz@nouiz.org>
parents: 1423
diff changeset
37
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
38 def format_string(s, maxlen):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
39 if len(s) > maxlen:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
40 s = s[:maxlen]
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
41 return s + ' '*(maxlen - len(s))
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
42
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
43 def prettyprint(size, units, human_readable = False):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
44 unit_name = units[0][0]
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
45 rval = size
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
46 if human_readable:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
47 for unit, val in units:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
48 if float(size)/val > 1:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
49 unit_name = unit
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
50 rval = float(size)/val
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
51 return (rval, unit_name)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
52
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
53
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
54 def analyze(options, filepath, indices):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
55
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
56 orig_obj = load(filepath)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
57 cycle_check = {}
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
58 obj_name = 'root_obj'
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
59 cycle_check[id(orig_obj)] = obj_name
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
60
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
61 for field in indices:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
62 if field.startswith('['):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
63 assert field.endswith(']')
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
64 obj_name += '[' + field[1:-1] + ']'
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
65 orig_obj = orig_obj[field[1:-1]]
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
66 elif field.startswith('.'):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
67 obj_name += '.' + field
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
68 orig_obj = getattr(orig_obj,field[1:])
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
69 else:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
70 obj_name + '[' + field + ']'
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
71 orig_obj = orig_obj[eval(field)]
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
72 if id(orig_obj) in cycle_check:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
73 print ( "You're going in circles, "+obj_name+" is the same as "
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
74 +cycle_check[id(orig_obj)])
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
75 quit()
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
76 cycle_check[id(orig_obj)] = obj_name
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
77
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
78 s = cPickle.dumps(orig_obj)
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
79 prev_bytes = len(s)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
80 print 'original object : \t\t\t\t%6.2f %s'%prettyprint(prev_bytes,
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
81 space_units,
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
82 options.human)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
83
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
84 t1 = time.time()
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
85 x = cPickle.loads(s)
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
86 t2 = time.time()
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
87 prev_t = t2 - t1
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
88 print 'load time: %6.2f %s'%prettyprint(prev_t, time_units,
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
89 options.human)
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
90
1436
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
91 print_entries = []
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
92 if isinstance(orig_obj, dict):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
93 print 'Object is a dictionary'
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
94 keys = [ key for key in orig_obj.keys() ]
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
95 for key in keys:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
96 key_name = format_string(key, 40)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
97 s = cPickle.dumps(orig_obj[key])
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
98 new_bytes = len(s)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
99 t1 = time.time()
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
100 x = cPickle.loads(s)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
101 t2 = time.time()
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
102 new_t = t2 - t1
1437
4b27456d3bce renamed output as key for dictionary ( from field). Now for dictionary we have keys, for tuples/list we have entries and for object fields
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1436
diff changeset
103 print_entry = 'key: %40s %6.2f %s ( loads in %6.2f %s)'%(
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
104 (key_name,) +
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
105 prettyprint(new_bytes, space_units, options.human) +
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
106 prettyprint(new_t, time_units, options.human) )
1436
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
107 if options.order is not 'none':
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
108 print 'Processed', key_name
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
109 print_entries += [(new_bytes, print_entry)]
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
110 else:
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
111 print print_entry
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
112
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
113 elif isinstance(orig_obj, (tuple, list)):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
114 print 'Object is a list/tuple of ', len(orig_obj), 'elements'
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
115 for idx, v in enumerate(orig_obj):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
116 s = cPickle.dumps(v)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
117 new_bytes = len(s)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
118 t1 = time.time()
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
119 x = cPickle.loads(s)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
120 t2 = time.time()
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
121 new_t = t2 - t1
1436
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
122 print_entry = 'entry: %03d \t\t\t\t %6.2f %s ( loads in %6.2f %s)' %(
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
123 (idx,)+
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
124 prettyprint(new_bytes, space_units, options.human) +
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
125 prettyprint(new_t, time_units, options.human) )
1436
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
126
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
127 if options.order is not 'none':
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
128 print 'Processed entry number ', idx
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
129 print_entries += [(new_bytes, print_entry)]
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
130 else:
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
131 print print_entry
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
132 else:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
133 print 'Object is a '+str(type(orig_obj))
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
134 for field in dir(orig_obj):
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
135 field_name = format_string( field, 40)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
136 if field.startswith('__') and not options.reserved:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
137 # We skip reserved fields
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
138 break
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
139 try:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
140 s = cPickle.dumps(getattr(orig_obj, field))
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
141 new_bytes = len(s)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
142 t1 = time.time()
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
143 x = cPickle.loads(s)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
144 t2 = time.time()
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
145 new_t = t2 - t1
1436
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
146 print_entry ='field: %40s %6.2f %s ( loads in %6.2f %s)' %(
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
147 (field_name,)+
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
148 prettyprint(new_bytes, space_units, options.human) +
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
149 prettyprint(new_t, time_units, options.human) )
1436
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
150
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
151 if options.order is not 'none':
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
152 print 'Processed field ', field_name
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
153 print_entries += [(new_bytes, print_entry)]
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
154 else:
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
155 print print_entry
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
156 except:
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
157 print 'Could not pickle field', field_name
1436
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
158 if options.order in ('desc','asc'):
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
159 reverse = False
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
160 if options.order == 'desc':
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
161 reverse = True
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
162 print_entries = sorted(print_entries
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
163 , key = lambda x:x[0]
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
164 , reverse = reverse)
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
165 for entry in print_entries:
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
166 print entry[1]
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
167
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
168
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
169 def process_options():
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
170
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
171 parser = optparse.OptionParser(usage)
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
172
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
173 parser.add_option( '-H'
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
174 , "--human-readable"
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
175 , dest = 'human'
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
176 , action="store_true"
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
177 , default=False
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
178 , help = (' If information should be presented in '
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
179 'human readable format')
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
180 )
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
181
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
182 parser.add_option( '-r'
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
183 , "--reserved-fields"
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
184 , dest = 'reserved'
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
185 , action="store_true"
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
186 , default=False
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
187 , help = (' If information about python reserved '
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
188 ' fields (i.e. starting with `__`) '
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
189 ' should be displayed' )
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
190 )
1436
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
191
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
192
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
193 parser.add_option( '-o'
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
194 , "--order-fields"
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
195 , dest = 'order'
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
196 , default= 'none'
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
197 , help = (' Order fields acording the their size.'
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
198 ' Possible values are {none, desc, asc}')
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
199 )
35b56d794d09 added option to order descending /ascending the fields acording to their size
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1435
diff changeset
200
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
201 return parser.parse_args()
1423
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
202
ea5d27727869 added pickle disk usage inspection utility 'pkldu'
Ian Goodfellow
parents:
diff changeset
203
1435
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
204 if __name__ == '__main__':
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
205 (options,args) = process_options()
3dd64c115657 revised version of pkldu that is a bit more structured code wise and outputs in human readable units
Razvan Pascanu <r.pascanu@gmail.com>
parents: 1434
diff changeset
206 analyze(options, args[0], args[1:])