# HG changeset patch # User Arnaud Bergeron # Date 1264699881 18000 # Node ID f6b6c74bb82f0f36eb5be1e7a7697203c3bc8485 # Parent ebf61603489be28099c9122e53a4a0c8070550ab Fix the datatypes. diff -r ebf61603489b -r f6b6c74bb82f transformations/pipeline.py --- a/transformations/pipeline.py Thu Jan 28 11:50:01 2010 -0500 +++ b/transformations/pipeline.py Thu Jan 28 12:31:21 2010 -0500 @@ -14,11 +14,17 @@ # DANGER: HIGH VOLTAGE -- DO NOT EDIT BELOW THIS LINE # ----------------------------------------------------------- -train_data = open('/data/lisa/data/nist/by_class/all/all_train_data.ft', 'rb') +outf = sys.argv[1] +paramsf = sys.argv[2] +dataf = '/data/lisa/data/nist/by_class/all/all_train_data.ft' +if len(sys.argv) >= 4: + dataf = sys.argv[3] + +train_data = open(dataf, 'rb') dim = tuple(ft._read_header(train_data)[3]) -res_data = numpy.empty(dim) +res_data = numpy.empty(dim, dtype=numpy.int8) all_settings = ['complexity'] @@ -29,7 +35,7 @@ for i in xrange(0, dim[0], BATCH_SIZE): train_data.seek(0) - imgs = ft.read(train_data, slice(i, i+BATCH_SIZE)) + imgs = ft.read(train_data, slice(i, i+BATCH_SIZE)).astype(numpy.float32)/255 complexity = random.random() p = i/BATCH_SIZE @@ -40,12 +46,12 @@ j += len(par) for k in range(imgs.shape[0]): - c = imgs[k] + c = imgs[k].reshape((32, 32)) for mod in mods: c = mod.transform_image(c) - res_data[i+k] = c + res_data[i+k] = c.reshape((1024,))*255 -with open(sys.argv[1], 'wb') as f: +with open(outf, 'wb') as f: ft.write(f, res_data) -numpy.save(sys.argv[2], params) +numpy.save(paramsf, params)