Mercurial > ift6266
diff writeup/techreport.tex @ 426:a7fab59de174
change order of transformations
author | Xavier Glorot <glorotxa@iro.umontreal.ca> |
---|---|
date | Fri, 30 Apr 2010 16:29:17 -0400 |
parents | c06a3d9b5664 |
children | ace489930918 |
line wrap: on
line diff
--- a/writeup/techreport.tex Fri Apr 30 16:24:35 2010 -0400 +++ b/writeup/techreport.tex Fri Apr 30 16:29:17 2010 -0400 @@ -118,13 +118,6 @@ $\sigma$ and $\alpha$ were linked to complexity through the formulas $\alpha = \sqrt[3]{complexity} \times 10.0$ and $\sigma = 10 - 7 \times \sqrt[3]{complexity}$. -\subsection{Motion Blur} - -This is a GIMP filter we applied, a "linear motion blur" in GIMP terminology. The description will be brief as it is a well-known filter. - -This algorithm has two input parameters, $length$ and $angle$. The value of a pixel in the final image is the mean value of the $length$ first pixels found by moving in the $angle$ direction. An approximation of this idea is used, as we won't fall onto precise pixels by following that direction. This is done using the Bresenham line algorithm. - -The angle, in our case, is chosen from a uniform distribution over $[0,360]$ degrees. The length, though, depends on the complexity; it's sampled from a Gaussian distribution of mean 0 and standard deviation $\sigma = 3 \times complexity$. \subsection{Pinch} @@ -138,12 +131,13 @@ The value for $pinch$ in our case was given by sampling from an uniform distribution over the range $[-complexity, 0.7 \times complexity]$. +\subsection{Motion Blur} -\subsection{Distorsion gauss} -This filter simply adds, to each pixel of the image independently, a gaussian noise of mean $0$ and standard deviation $\frac{complexity}{10}$. +This is a GIMP filter we applied, a "linear motion blur" in GIMP terminology. The description will be brief as it is a well-known filter. -It has has a probability of not being applied, at all, of 70\%. +This algorithm has two input parameters, $length$ and $angle$. The value of a pixel in the final image is the mean value of the $length$ first pixels found by moving in the $angle$ direction. An approximation of this idea is used, as we won't fall onto precise pixels by following that direction. This is done using the Bresenham line algorithm. +The angle, in our case, is chosen from a uniform distribution over $[0,360]$ degrees. The length, though, depends on the complexity; it's sampled from a Gaussian distribution of mean 0 and standard deviation $\sigma = 3 \times complexity$. \subsection{Occlusion} @@ -161,6 +155,14 @@ This filter has a probability of not being applied, at all, of 60\%. + +\subsection{Distorsion gauss} + +This filter simply adds, to each pixel of the image independently, a gaussian noise of mean $0$ and standard deviation $\frac{complexity}{10}$. + +It has has a probability of not being applied, at all, of 70\%. + + \subsection{Background Images} This transformation adds a random background behind the letter. The background is chosen by first selecting, at random, an image from a set of images. Then we choose a 32x32 subregion of that image as the background image (by sampling x and y positions uniformly while making sure not to cross image borders). @@ -178,6 +180,7 @@ This filter also has a probability of not being applied, at all, of 75\%. \subsection{Spatially Gaussian Noise} + The aim of this transformation is to filter, with a gaussian kernel, different regions of the image. In order to save computing time we decided to convolve the whole image only once with a symmetric gaussian kernel of size and variance choosen uniformly in the ranges: $[12,12 + 20 \times complexity]$ and $[2,2 + 6 \times complexity]$. The result is normalized between $0$ and $1$. @@ -191,8 +194,9 @@ \subsection{Color and Contrast Changes} + This filter changes the constrast and may invert the image polarity (white on black to black on white). The contrast $C$ is defined here as the difference -between the maximum and the minimum pixel value of the image. A contrast value is sampled uniformly between $1$ and $1-0.85 \times complexity}$ +between the maximum and the minimum pixel value of the image. A contrast value is sampled uniformly between $1$ and $1-0.85 \times complexity$ (this insure a minimum constrast of $0.15$). We then simply normalize the image to the range $[\frac{1-C}{2},1-\frac{1-C}{2}]$. The polarity is inverted with $0.5$ probability.