view pycaptcha/Captcha/data/words/README @ 137:728e232eaf45

Added script to separate OCR data in train, validation and test sets (raw data)
author boulanni <nicolas_boulanger@hotmail.com>
date Sat, 20 Feb 2010 02:12:57 -0500
parents 4775b4195b4b
children
line wrap: on
line source

These word lists are from various sources:

basic-english:
   http://simple.wikipedia.org/wiki/Basic_English_Alphabetical_Wordlist