Project 1: Colorizing the Prokudin-Gorskii photo collection

This project entailed consuming photos from the Prokudin-Gorskii collection and converting them from their original form of 3 separate images of the R, G, and B channels into 1 aligned RGB image.

The process I implemented to combine the three different channels into one colored RGB image is as follows:

Divide the image vertically into 3 equal parts. The top image is the B channel, the middle image is the G channel, and the bottom image is the R channel.
Remove the black borders at the edges of each of the 3 channel images.
Run the 3 channel images through a sobel filter to extract edges.
Perform multiscale pyramid alignment. Recursively downsample the image and align.
Align at each recursive alignment step using sum of squared differences on the pixels of the edge extracted channel images.

Improvements

Cropping

One of the issues with the original images from the collection are a thick border of black and white at the edges of the image. This is bad on several levels. Firstly, it doesn't look good, and results in thick bands at the edges of the outputted RGB image. Secondly, it leads to worse results during the alignment phase, especially when aligning using raw pixel intensities. Since the black border is very large and inconsistent between the three channels, it can lead to weird alignments. To mitigate this issue, I wrote a simple auto-cropper that crops in the image from the edges until the black border has been reduced or completely removed. The auto-cropper works by stepping from the edges of the images, and only stopping when the row or column of the image has fewer than a predefined threshold of pure black or pure white pixels.

Below are results of cropping on cathedral.jpg:

Cathedral image blue channel before crop

Edge extraction

Prior to preprocessing the channel images with a sobel filter to extract edges, I was aligning the images using sum of squared differences on the raw intensity values for each channel. This lead to failures when parts of the image were dominated by regions that were primarily one color. For example, in the monastery image, the bottom dirt section is much more red than blue. This resulted in an alignment that pushed the red channel away from the blue channel, as the difference in pixel intensities in that dirt section dominated.

Image alignment using raw pixel intensities Image alignment using sobel filter Edge extracted image

Monastery aligned using raw channel intensities.

Monastery aligned using edges extracted with sobel filter.

The edge detection image extracted using sobel filter.

Image alignment using raw pixel intensities	Image alignment using sobel filter	Edge extracted image
Monastery aligned using raw channel intensities.	Monastery aligned using edges extracted with sobel filter.	The edge detection image extracted using sobel filter.

White balancing

As another improvement after aligning the images, I perform white balancing using gray world theory. What this entails is taking the average intensity of all the pixels in all channels, and then normalizing each channel individually so that each channel has an average pixel intensity equal to this overall average pixel intensity. This is called the gray world theory, because it is based on the hypothesis that on average, the world is gray (cone activation in short, medium, and long cones are all equal over the entire image). The results of the white balancing are displayed to the right of each image below.

Image Results