CS 194 Project 1: Images of the Russian Empire

Richard Chen

Russian photographer Sergei Mikhailovich Prokudin-Gorskii captured images of the Russian Empire in the 1900s by taking pictures of the same scene three times each, placing either a red, blue, and green filter in front of each. The goal of this project is to develop an algorithm that can process these images and recreate an rgb image of the original scene.

Approach

For the small images, a brute force search is satisfactory. I found the displacements of the green and red plates in relation to the blue plate by searching from a range of [-15,15) pixel shifts in both the x and y directions. I evaluated these shifts using a normalized cross correlation. The higher the normalized cross correlation is between two images, the more "similar" they should be to each other.

This aproach works fairly well when images are not that big and a [-15,15) range is satisfactory. But it doesn't work when the images are a lot bigger. Instead I implemented an image pyramid. I start with downscaled versions of the r,g,b channels and gradually work my way up to the pictures full resolution.

For the large images, I downscaled each dimension by a factor of 64 each. Then I found the pixel shift from these downscaled versions searching a range of [-3,3] in both x and y. When I found this value, I would store this information (keeping in mind that this was for a downscaled version). I would then shift the image according to my value, and recursively call the alignment method, except this time with a downscale factor of 32 from the original instead of 64. Every recursive call gets more and more precise to the true pixel shift, until I reach the full resolution.

I had a lot of problems with emir.tif. It seems the problem is the patterns on his clothes. Because there are so many similar patterns, the alignment method gets confused and gives incorrect values. The results improved slightly when I cropped different amounts of the image, but there was no real insight to playing with these margins, so I didn't try to play that game. Most likely to get a clear image, I would need to implement some sort of edge detection. Shaving off 5% from every margin seemed to give decent performance on all other pictures though.

Automatic Cropping

No Crop