Mercurial > ift6266
changeset 432:e2fd928a7de0
added description of nist19 and captcha data sources
author | goldfinger |
---|---|
date | Mon, 03 May 2010 02:43:08 -0400 |
parents | 9fcd0215b8d5 |
children | 858ee3c76497 |
files | writeup/techreport.tex |
diffstat | 1 files changed, 16 insertions(+), 0 deletions(-) [+] |
line wrap: on
line diff
--- a/writeup/techreport.tex Sat May 01 14:27:56 2010 -0400 +++ b/writeup/techreport.tex Mon May 03 02:43:08 2010 -0400 @@ -248,8 +248,24 @@ \begin{itemize} \item {\bf NIST} +The NIST Special Database 19 (NIST19) \ref{Grother} is a very widely used dataset for training and testing OCR systems. The dataset is +composed with over 800 000 digits and characters (upper and lower cases), with hand checked classifications, extracted from +handwritten sample forms of 3600 writers. The characters are labelled by one of the 62 classes corresponding to "0"-"9", +"A"-"Z" and "a"-"z". The dataset contains 8 series of different complexity. The fourth series, $hsf_4$, +experimentally recognized to be the most difficult one for classification task is recommended by NIST as testing set and is +used in our work for that purpose. The performances reported by previous work on that dataset mostly use only the digits. +Here we use the whole classes both in the training and testing phase. + + \item {\bf Fonts} \item {\bf Captchas} +The Captcha data source is an adaptation of the \emph{pycaptcha} library (a python based captcha generator library) for +generating characters of the same format as the NIST dataset. The core of this data source is composed with a random character +generator and various kinds of tranformations similar to those described in the previous sections. +In order to increase the variability of the data generated, different fonts are used for generating the characters. +Transformations (slant, distorsions, rotation, translation) are applied to each randomly generated character with a complexity +depending on the value of the complexity parameter provided by the user of the data source. Two levels of complexity are +allowed and can be controlled via an easy to use facade class. \item {\bf OCR data} \end{itemize}