CS194 Project 1 - Jing Yuan

The goal of this assignment is to take the digitized Prokudin-Gorskii glass plate images and, using image processing techniques, automatically produce a color image with as few visual artifacts as possible In order to do achieve this goal, extraction of the three color channel images, placing them on top of each other, and alignment of the color channel images are needed to create a single RGB color image.

Methodology

The input is a glass late image with three figures taken under RGB filters separately The work flow of this project is: 1> Separate the raw figure into three part (in the order of B, G, and R); 2> Use possible metrics (SSD or NCC) to score how well the images match; 3> Align three figures together and form a colorful figure In order to improve the performance of the code, I optimize the displacement of small and large figures in two algorithms.

Picture align Solution for small figure

First of all, I develop the align method for small figures (.jpg in this project) Firstly, I corp the raw figure into three part by evenly separating the height into three parts Then, I use 70% of the total figure to match those three figures, which was cropped from the center of the figure After that, using Blue channel as reference image, I match Green and Red channels with Blue channel respectively by searching over a window of possible displacements, [-20,20] pixels Finally, I overlap three figures together after applying the optimal displacements In this process I used both sum of Squared Differences (SSD) and normalized cross-correlation (NCC) and the results turned out that NCC worked better than SSD, thus I used NCC as the matching metric for all the figures.

While this solution works alright with small images, the problem becomes that this is terribly inefficient and that I would need to expand the window size of possible displacement vectors from [-20,20] to something much larger for bigger images Therefore, a better algorithm is needed for the large figures.

Picture align Solution for large figure

To allow for processing on much larger images, referring to the .tif files in this project, I added a function implementing image pyramid scheme, which scaled the image down by n-fold of their original size Once the width of the figure is smaller than 500 pixel and number of the folder is larger than 9, the function returned the best scale-down fold Then, there will be two optimization process for large figures In the 1st optimization, the scaled image went through all the optimization process of the small figure and I will get the estimated best displacement for the scaled figure, and I scaled that up by n Then, in the 2nd optimization, I checked around that vector for some small radius [-10, 10] pixel in both vertical and horizontal direction I only use 30% of the total figure to find the optimal displacements for the 2nd Optimization.

I then take the best estimate for a displacement vector and apply that to the original image, thus obtaining an effective and much more efficient best displacement vector with which to overlap the images together.

Bell & Whistles: Automatic cropping

I also tried to crop the image borders by detecting the edges between the borders and the image First, I inspect each raw figures and find that the boarder are always black and white So, starting from the edge of the figure, I search rightward and the upward the figure to the middle of the figure separately, If the value of the pixel larger than 0.1 (representing black) and also smaller than 0.9 (representing black), the searching will stop and return the corp-width and corp-height that are needed to be cropped from the figure And I also choose 20 start points along the width or height of the figure Then, I remove the largest and smallest corp-width and corp-height and average out the remaining corp-width and corp-height as the final size.

But, from the results I find that this algorithm successfully crop the white borders In some figures, the black borders still exist So, I re-inspect the figure, I find that the borders are not purely white or black The possible solution could be that increasing the contrast of the figure.

Results

Offset Vector (dy,dx).
If dx>0,it means the corresponding picture will move downward
If dy>0,it means the corresponding picture will move rightward

There are three figures from right to left: Raw figures, Aligned figures, Cropped figures

1st Opt: Green Offset Vector: (2,5) Red Offset Vector: (3,12)