Project 1: Images of the Russian Empire

CS194-26 Image Manipulation and Computational Photography

David Xiong // cs194-26-abr

Introduction

In 1907, Sergei Mikhailovich Prokudin-Gorskii set out on a quest to document the Russian Empire with color photographs. There was just one problem - he only had black-and-white negatives available to him, as color photography hadn't been invented yet. The idea he came up with was simple: he record three exposures of every scene onto a glass plate using a red, a green, and a blue filter. Luckily, his glass plate negatives survived the fall of the Russian Empire in the late 1910s and were purchased by the Library of Congress - who have recently made the collection available online.

The goal of this assignment was to take the digitized Prokudin-Gorskii glass plate images and, using image processing techniques, automatically produce a color image with as few visual artifacts as possible. In order to do this, I needed to extract the three color channel images, place them on top of each other, and align them so that they form a single RGB color image.

[Adapted from CS194-26 Project 1 Specification]

Preparing the Images

I first sliced each image into equal thirds, aseach color plate takes up roughly 1/3 of the total input image. However, there was the problem of the black borders - this was not much of a problem for the smaller .jpg images, but it proved to be a pain for the larger .tff ones as the large borders led to bad alignment. In the end, I simply decided to crop each image - this was enough to get rid of the black border without losing too much content.

Naive Alignment

My first steps were to test alignment on the smaller .jpg images. As per a hint on the specification, I tried simply using Normalized Cross Correlation (NCC) on those images. This worked fantastically for the smaller images, but once I moved to the larger .tffs things started to fall apart - there was too large of a gap between different features and the images began to look mismatched.

Naive Alignment Images Examples of images processed using naive alignment

Pyramid Alignment

Instead, for the larger images, I decided to use pyramid alignment. For each of the pairs of colors I wanted to align, I would recursively quarter the size of the image until it was reasonably small to work with, and then performed naive alignment on that base case. When the function works its way up the stack, we can use these guesses from the smaller images to further refine our resulting displacement vector - giving us a nicely aligned image by the time the main function call returns.