CS194-26 Project 1

By Kevin Li

Overview

In this project, our task was to create a single colored-image using three images of different color channels that are stacked on to op one another. To complete this task, we utilized techniques and algorithms to translate the images so that the images aligned on top of one another to create a single image combining the three color channels.

Approach

My approach to aligning the three images was…

IF the image is small and compressed (ie. jpg files), use iteration to determine the translation that generates the smallest sum of squared distance(our metric to determine the accuracy of the aligned image)
OTHERWISE depending on the size of the image, we determine how many times we downscale an image (by a factor of 2) and use recursion to continue to downsize until we hit the base case. (Image Pyramid)
Solve the smallest sum of squared distance for the base case and pass the result up until we have finished the recusion process.
In essence we are starting from the top of the pyramid and search for the best displacement and pass that displacement down to the next layer of the pyramid.

Challenges

While working on this project one challenge that I encountered was my lack of familiarity with both Python and Matlab. Since this was my first major project where I had to use Python, it took quite a while to set up my Python and Jupyter Notebook environment. Also, I spent a lot of time trying to get familiar with the different Python tools at my disposal (such as skimage) and learning how to utilize them effectively. As a result, I spent a lot of time tinkering with the different libraries and using a lot of trial and error.
Another challenge that I encountered was dealing with images that were lighter in tone (ie. melons.tif and workshop.tif). Specifically in the melons.tif image, the image was naturally lighter in comparison with the other example images and this led to the melons being drastically different shades in each of the 3 color channels. I believe that this drastic contrast between the melons in the images led to a higher SSD when aligning the dark melons in the red color-channel image with the naturally dark stand in the blue color-channel image.

Example Images

To demonstrate the effectiveness of my approach, here are my results on some of the example images that we were given.

cathedral.jpg
G: [3, 0]
R: [9, 2]

tobolsk.jpg
G: [3, 2
R: [7, 4]

castle.tif
G: [32, 2]
R: [97, 4]

emir.tif
G: [-1, 33]
R: [57, 50]

workshop.tif
G: [52, 0]
R: [-43, -38]

train.tif
G: [42, 6]
R: [85, 32]

three_generations.tif
G: [48, 16]
R: [109, 11]

melons.tif
G: [102, 49]
R: [107, 0]

harvester.tif
G: [58, 18]
R: [122, 17]

self_portrait.tif
G: [77, 29]
R: [129, 29]

onion_church.tif
G: [50, 26]
R: [108, 37]

lady.tif
G: [48, 7]
R: [107, 11]

icon.tifG:
G: [48, 18]
R: [96, 23]

Self-Selected Images

lake.tif
G: [38, 14]
R: [73, 14]

rock.tif
G: [37, 33]
R: [92, 58]

wetland.tif
G: [39, -2]
R: [-42, -13]

desert.tif
G: [18, -7]
R: [65, -5]