Lecture 001

Image: a function $f(x, y)$ to some value.

Classical Image Processing

Matching Image Histogram: adjust contrast

Style Transfer: match mean and std for each channel

Above methods are point-processing method, no awareness of structure. What if I want to turn a white cat into a black cat?

Simple region-based transfer for color mapping

Antialiasing methods:

add more pixels
get rid of high frequency (lossy compression)
- moving average (1D)
- cross correlation filter (2D)

Cross-Correlation Kernel is a cross-correlation operation

Compared to Average Filter, Gaussian Filter get rid of more high frequency stuff

klzzwxh:0010 in Gaussian Filter — $\sigma$ in Gaussian Filter

Higher klzzwxh:0012 removes more high-frequency details — Higher $\sigma$ removes more high-frequency details

Convolution and Cross-Correlation are similar. Convolution flips the kernel by $180^\circ$ (or time-inverse if it is 1D) before performing the elementwise multiplication.

Convolution is commutative and associative.

Also, a gaussian kernel convolute with itself is another gaussian kernel but with sigma becomes $\sigma \sqrt{2}$

Downsample without pre-gaussian filtering

When applied pre-filtering to convolution layers in neural nets, the classification is more stable (smaller variance) against image translation. (ICML 2019 "Making Convolutional Networks Shift-Invariant Again")

Many libraries implement gaussian filtering wrong for >2x downsampling. This is because they don't change kernel size for different resizing ratio. (Downsampling more should have a larger kernel size)

For tasks like classification, you need diverse pre-processing artifact to make the classifier robust. However, for generative model, you need to avoid artifact to get the best quality of generated result.

Gaussian Pyamid: store different down-sampled version of the same image together.

Classification: cheaper to classify in low-resolution
Generation: cheaper to first generate coarse image

Apply filter then take derivative of image is the same as first taking derivative then apply filter

Padding:

constant black: will result black border
wrap around: wrong color
copy edge: good, but can see line effects near edge
reflect across edge: most common choice

Table of Content