view pylearn/datasets/embeddings/convert.py @ 1476:8c10bda4bb5f

Configured default train/valid/test split for icml07.MNIST_rotated_background dataset. Defaults are the ones used by Hugo in the ICML07 paper and in all contracting auto-encoder papers.
author gdesjardins
date Fri, 20 May 2011 16:53:00 -0400
parents b054271b2504
children
line wrap: on
line source

#!/usr/bin/python
"""
Convert stdin sentences to word embeddings, and output YAML.
"""

import sys, string
import read
import yaml

output = []
for l in sys.stdin:
    l = string.strip(l)
    output.append((l, read.convert_string(l)))

print yaml.dump(output)