CS194-26: Colorizing Prokudin-Gorskii Photo Collection

Overview

The goal of the project is to use image processing techniques to colorize photos from digitized Prokudin-Gorskii Photo Collection. To achieve this, I will extract the three color channel images, align them by placing them on top of each other to form a single colorized image. This project will automate the process of colorizing the images by using single-scale implementation on small images and multi-scale implementation on larger images with image pyramid. The project also implemented automatic contrast and better features for aligning such as using edge detection of rgb channel images.

Section I: Implementation Approaches

red channel: monastery.jpg

green channel: monastery.jpg

blue channel: monastery.jpg

I. Single-Scale implementation

The single-scale implementation is a naive algorithm that align the red, green and blue channels of the images by exhaustively searching over a window of possible displacements. In this case, a window size of 20 is used and [-20,20] pixels is the searching window, which works for all pictures below. During the process, each alignment is given a matching score by a image matching metric. This project is using normalized cross-correlation(NCC) as the image matching metric, for each two images from different channel, I compute the dot product between the two normalized vectors (image1./||image1|| and image2./||image2||) and larger dot product implies greater similarities between the two images therefore implies a better alignment.

II. Multi-Scale Implementation

The single-scale implementation doesn’t work well on larger images with high-resolution since the algorithm isn’t fast enough to perform the exhaustive search. Instead, a faster search procedure image pyramid is needed. When the program gets an image that is too large, it rescales the image by a factor of 2 and recursively passing in the rescaled image to the program until the image is small enough to perform the exhaustive search on the window size of 20. And with this process, we only need to perform the search window of [-20, 20] on the smallest image and [-1, 1] on the other larger scales of the image. Starting from the smallest image and going down the pyramid, the optimal displacement is passed from the smallest image to the original large image. A shift of x pixels in an image downsampled by 1/2 would correspond to a 2x pixel shift in the undownsampled image.

Section II: Problems & Solutions

1. The searching algorithm doesn't work well when we are searching for the entire image since the border of the image will hurt the result. To solve this problem, I crop the image so that we are only compute the image processing metric on the internal pixels only. Each image is cropped so that the 1/6 border of each side is not included when computing the metric and the result is showing below.

2. When performing the alignment process on Emir of Bukhara, the images to be matched do not actually have the same brightness values. Since the picture is saturated with blue, the channels are not aligning when using our previous image matching metrics. The solution to this problem is to align against the green channel rather than the blue, or to use better features such as aligning with the edge detection of the three different channels, as described in the extra credit section.

Section III: Final Results

These are the final results of colorized images after aligning them based on the NCC image matching metric. Also all of the displacement vectors are the shifts when aligning against the blue channel except for the Emir.tif, which is aligning against the blue channel.

I. Small Images

cathedral.jpg green: (row: 5, col: 2) red: (row: 12, col: 3)	monastery.jpg green: (row: -3, col: 2) red: (row: 3, col: 2)	nativity.jpg green: (row: 3, col: 1) red: (row: 8, col: 0)
settlers.jpg green: (row: 7, col: 0) red: (row: 15, col: -1)

II. Large Images

emir.tif blue: (row: -48, col: -24) red: (row: 57, col: 17)	harvesters.tif green: (row: 59, col: 17) red: (row: 123, col: 14)	icon.tif green: (row: 41, col: 17) red: (row: 90, col: 23)
lady.tif green: (row: 54, col: 8) red: (row: 115, col: 11)	self_portrait.tif green: (row: 78, col: 29) red: (row: 175, col: 37)	three_generations.tif green: (row: 51, col: 14) red: (row: 110, col: 12)
train.tif green: (row: 42, col: 6) red: (row: 86, col: 32)	turkmen.tif green: (row: 55, col: 21) red: (row: 116, col: 28)	village.tif green: (row: 64, col: 12) red: (row: 137, col: 22)

III. Extra Custom Images

girls.jpg green: (row: -1, col: 1) red: (row: 0, col: 1)	piony.jpg green: (row: 8, col: 2) red: (row: 16, col: 3)	roses.jpg green: (row: 3, col: 2) red: (row: 8, col: 4)
antiquities.tif green: (row: 12, col: 8) red: (row: 90, col: 17)	gate.tif green: (row: 42, col: -11) red: (row: 98, col: -26)	mosques.tif green: (row: 26, col: 11) red: (row: 70, col: 11)

Section IV: Extra Credit Improvements

1. Automatic Contrasting

To achieve automatic contrasing, I first tried to rescale image intensities such that the darkest pixel is zero and the brightest pixel is 1. However, this doesn't produce significantly different result. After that, I tried to implement the automatic contrasting by applying a function to each pixel that makes the dark pixels darker and bright pixels brighter instead of just focusing on the darkest and brightest pixels. The results are showing below.

lady.tif without auto_contrast	improved lady.tif
harvesters.tif without auto_contrast	improved harvesters.tif
icon.tif without auto_contrast	improved icon.tif

2. Aligning with edge detection

Instead of aligning based on RGB similarity, we can also use better features to align images, for example, using the edges of each three channels and perform the same alignment process using their edge detection images.

Edge detecing results

red channel for monastery.jpg	green channel for monastery.jpg	blue channel for monastery.jpg
red channel for girls.jpg	green channel for girls.jpg	blue channel for girls.jpg
red channel for settlers.jpg	green channel for settlers.jpg	blue channel for settlers.jpg
red channel for emir.jpg	green channel for emir.jpg	blue channel for emir.jpg

Edge detecing results

monastery.jpg by aligning with edges	girls.jpg by aligning with edges	settlers.jpg by aligning with edges
emir.jpg by aligning with edges

Section V: The Program

To run the program, put all input files in the inputs directory and create an outputs directory, all output images will be in the outputs directory after running the command: python main.py

Images of Russian Empire: Colorizing Prokudin-Gorskii Photo Collection

CS194-26 Image Manipulation and Computational Photography

Jingxi Huang cs194-26-aap

Overview

Section I: Implementation Approaches

I. Single-Scale implementation

II. Multi-Scale Implementation

Section II: Problems & Solutions

Section III: Final Results

I. Small Images

II. Large Images

III. Extra Custom Images

Section IV: Extra Credit Improvements

1. Automatic Contrasting

2. Aligning with edge detection

Edge detecing results

Edge detecing results

Section V: The Program