CS194-26 Project 3 - Fun with Frequencies and Gradients!¶

Eilam Levitov - cs194-26-acx
This notebook runs on python 2.7

Part 1 - Frequency Domain¶

3.1.1 Warm up¶

Here I will be demonstrating the sharpening technique developed in class.

High level procedure explanation¶

Read in an image
Low-pass filter the image using a gaussian filter
Use the low-pass filtered image to extract the high-pass signals from original image
Add mentioned high-pass signals to original image

# read in the image, convert to applicable representation, and display

# Use gaussian filter to extract high frequencies

Contrast and Compare¶

# Original vs Sharp bunny

Scaled difference =  0.0191524604712

Another Example¶

# Original vs Sharp road (Joshua Tree)

Scaled difference =  0.0418601636967

Note¶

After applying the gaussian filters, some pixels become negative (red) or saturated (blue), thus a slight modification was needed to improve the final image. In order to fix that, I tried: (1) normalizing the image, (2) shifting and scaling, (3) both, and finally (4) clipping

In the end, (4) was found to be most effective, thus is the one demonstrated above.

Conclusion (3.1.1)¶

Looking at bunny the sharpness effect is most noticeable on the fur and the little carrot. The road image on the other hand, the effect is much more noticeable all around. We deduct from it that an image that orignally contained a lot of high freqeuncies will be more suitable for this sort of sharpening procedure.

3.1.2 Hybrid Images¶

We use Hybrid Images as described in the paper by Oliva et al. Hybrid Images is a tenchnique that produces static images with two interpretations, which change as a function of viewing distance. This technique enables us to layer the high frequencies from one image on top of low frequencies of another. Finally, the results shows that when seen up close, the viewer wil observe (mainly) the high frequency image, while from afar the low frequency image

High level procedure explanation¶

Read in 2 images
Low-pass filter both images using a gaussian filter (as suggested in SIGGRAPH06 paper)
Use the low-pass filter from one image to extract the high frequnecies from the corresponding image
Add the high frequencies from the mentioned image to the (different) low frequency image

# Read, convert, and display images

# Set cutoff frequencies, separate frequencies of each image, display

# Average both low and high frequency images, and display hybrid image

Frequency Analysis¶

# Take the Fourier Transform and display detailed plot of frequencies in each image

Another Example¶

# Mirrored faces

# Frequency Analysis

Bells and Whistles¶

Using color to (try) enchance hybrid image effect

# Mirrored faces, with color!

# Low Frequncies in each channel

# High Frequncies in each channel

# Hybrid Frequncies in each channel

Bells and Whistles¶

Combining several images using a high-pass, band-pass, and low-pass filters

# Read, convert, and display images

# First filtering layer

# Display result of 1st layer

# Second filtering layer

# Display result

Frequency Analysis for 3 layered image¶

# 3 image blend frequency plot

Conclusion (3.1.2)¶

The Hybrid Image technique of layering high and low frequencies provides a way to create textures that become visible as a function of viewing distance. As seen above, it is necessary to find compatible signals (or, equivalently, preprocess the signals thoroughly) in order to get good results. In our case, the eagle and F16 combined rather well, while the image of the couple seems quite visually odd (although funny). Finally, although the quality of the image could have been better, the simplicity of the technique makes it an attractive tool when considering such effects.

Frequency Analysis:¶

Fourier transform allows us to see the abundancy of frequencies in each signal. As seen above, the low frequency signal is much more abundant in low frqeuncies, and high frequency follows accordingly. The entire bandwich image shows a balanced distribution of frequencies, which is what we would expect.

*A note on the presented frequency analysis: Gaussian is far from an ideal low-pass, thus as can be seen above, there are some residual low frequenices in the high frequency plots, as well as high frequencies in the low frequency plots.

3.1.3 Gaussian and Laplacian Stacks¶

In this part we will make use of Laplacian and Gaussian Stacks. Gaussian Stacks are generated by continuously filtering each subsequent level of an image, consequently stacking lower and lower frequencies. Using the Gaussian Stack we can generate a Laplacian Stack, which stores high frequencies, starting from the highest frequencies. To make the Laplacian stack, at each level i we subtract the corresponding (i-1)th from the ith level of the Gaussian Stack, giving us the residual high frequencies in-between the Gaussian Stack levels.

# Decomposition and Reconstruction of Laplacian Pyramid (not sub/upsampling in our stack case)

Note - Our stack implementation is identical except we do not undersample. Thus, the image sizes stay the same, giving rise to the name `stack`.¶

# image found on pinterest, made by Teal Scott

Gaussian Stack¶

# 4 layered Gaussian stack

Laplacian Stack¶

# 4 layered Laplacian stack

Another Example - The Starry Night by Vincent Van Gogh¶

# Read, convert, and dislay image

# 5 layered Gaussian and Laplacian Stack

Back to supeaglain from 3.1.2¶

# Read, convert, and display images

# 5 layeres Laplacian and Gaussian Stack

# Display Gaussian Stacks

# Display Laplacian Stacks

Average them together to generate a hybrid image (as in 3.1.2)¶

# Display result

ITS A MATCH!¶

Conclusion (3.1.3)¶

In this section we showed the beauty and power of the Gaussian and Laplacian Stacks. As can be seen above, this technique allows the observer to visualize the frequency analysis of the image, as well as generate a hybrid image using different levels of the laplacian stack!

3.1.4 Multiresolution Blending¶

We use the Multiresolution Spline technique described in the paper by Burt et al. for combining two or more images into a larger image mosaic. This method is applied in a layered manner, where we take advantage of the Gaussian and Laplacian stacks in order to establish a visibly appealing blended image.

High level procedure explanation¶

Read in 2 images and a mask
Make a Gaussian stack of the mask in ordre to have subtle transition from an img1 to img2 within the blended image
Make a Laplacian stack of img1 and img2
For every ith layer of the stacks, combine the two images in the Laplacian stacks by adding the two images with the corresponding mask from the Gaussian stack.

The Oraple¶

# Read, convert, and display images

Gaussian Stack¶

# Display Gaussian Stack

Laplacian Stack¶

# Display Laplacian Stack

Mask¶

# Display Masks Gaussian Stacks

Result - Oraple!¶

# Display result

Next Up: Brooklynn Gate Bridge¶

# On the left - Golden Gate Bridge; Right - Brooklyn Bridge

Gaussian Stack¶

# Display Gaussian Stacks

Laplacian Stack¶

# Display Laplacian Stacks

Mask¶

# Display Mask

# Display result

Another example - jt¶

# JLo and Trump

Conclusion (3.1.4)¶

The method of Multiresolution blending technique provides an elegent way to blend images. The main strength of the technique is it's transition, which by applying it in a layered manner prove to be superior to prior techniques. But, quite like former techniques, this method also has its shortcoming, where white balancing can become an issue. For example, we can see in the Bridge example above, even manual balancing provides an imperfect transition. On the other hand, it showed a perfect transition in the case of headshots - see jt.

Part 2 - Gradient Domain Fusion¶

In this part of the project we takes advantage of the fact the human eye is tuned to the gradients and process the images in the gradient domain. As we know our visual perception is highly sensitive to changes, rather than real values, and so we a task like blending would theoretically be very suitable to such novel techniques.

Part 3.2.1 - Toy Problem¶

In this part we effectively check if theory aligns with application, or more correctly, make sure our math is correct.
Our problem is as such: If we have 1 known pixel and the changes (gradients) of the image, are we able to perfectly reconstruct the image?
We have formulated the problem as a linear algebra problem, specifically as Ax=b, using 3 questions:

minimize ( v(x+1,y)-v(x,y) - (s(x+1,y)-s(x,y)) )^2 the x-gradients of v should closely match the x-gradients of s
minimize ( v(x,y+1)-v(x,y) - (s(x,y+1)-s(x,y)) )^2 the y-gradients of v should closely match the y-gradients of s
minimize (v(1,1)-s(1,1))^2 The top left corners of the two images should be the same color

where s denotes the source vector, and v is our result vector.

We will try this on an image from Toy Story!

# Read, convert, and display image

First we find b by taking the derivative of the image (convolve with laplacian kernel), while leaving the top left corner as is.¶

# find, display, and ravel the b vector

Now we will find the forward operator A, which encodes within it the changes of each pixel, as well as the true value of s[0,0]¶

# Find A

Using Scipy's sparse solve `spsolver` we can efficiently solve this least squares problem!¶

# Solve Ax=b and shift + rescale image back to [0,1] interval

CPU times: user 253 ms, sys: 39.3 ms, total: 292 ms
Wall time: 255 ms

Reshape, inject, and display result!¶

# Reshape and show result

normalized norm2 Error =  0.021266667121

Comparison¶

# Display images next to each other for comparison

Conclusion (3.2.1)¶

Reconstructed image looks good with very minimial error, thus our math is correct -> we can continue!

3.2.1 Multiresolution Blending¶

In the last part of the project we use the Poisson Blending technique described in the paper by Perez et al. for seamless editing of image regions. This technique follows closely to what we have done just prior in section 3.1.1, where me use the gradients while keeping some known values to reconstruct. The difference in this part is we injects the source's gradient patch into the target, leaving the border pixels as is, and solving it using least squares method. This will allow us to transition from one image to another without the need to white balance.

High level procedure explanation¶

Read in 2 images
Generate a mask to define the borders to the injection site
Take the gradient of the source image
Inject the patch into the corresponding location within the target
Equate border pixels to correspodning target pixels.
Solve least square problem to seamlessly blend the two images

We'll being with some Grayscale testings

# Read, convert, and display image

# Read, convert, and display image

# Manual Alignment

# Naive blending (for completeness)

Find b vector, convolving the little penguin image with the laplacian kernal while keeping the border pixels as is¶

# Find b

Find forward operator A¶

# Find A

Solve for x, our blended image¶

# Solve Ax=b

CPU times: user 3.05 s, sys: 704 ms, total: 3.75 s
Wall time: 4.45 s

# Reshape, show result and error from original

normalized norm2 Error =  0.196734077701

And finally, gradient fusion!¶

# Display Gradient Fusion

And now that it works on grayscale, we can continue to color¶

Another Example - Mordor Peaks¶

# Read, convert, and display (aligned) images

Mask¶

Find our b vector¶

# Find b
b = b.ravel(order='F')

Find our forward operator A¶

# Find A

Solve, reshape, apply to target, and display!¶

# Gradient Fusion

San Francisco is now under the rule of Sauron!

For the last part I will compare the bridge image blending using the two different methods¶

Our Mask¶

Find b vector¶

Find forward Operator A¶

Gradient Fusion!¶

Comparison of methods on blending the Bridge image¶

Comparison Conclusion¶

For my surprise, in this example it seems as if the multiresolution blending takes the cake! Although the smooth transition of the graient fusion is superior, the white balance is actually too strong even after manually decreasing the average. Consequently, we have shown that in fact each method has it's advantages, and disadvantages, and each works better for different images.

Bloopers¶

Again demonstrating on the bridge image, if I hadn't manually adjusted the average value of pixels in the golden gate bridge image, the result of the gradient fusion would have been abysmal! Thus, once again, we see the power of this technique can actually overly effect the process and result in poor blended image.

Lesson Learned¶

In this project we played with frequencies and gradients. The most valued piece of information that I have learned in the process is the true power of the gradient domain (and efficiently solving sparse matricies). Besides that, I would like to mention that although the scientific community keeps coming up with more advanced, noval techniques (ehm ehm machine learning people), sometimes one must remember that a simple method might be more suitable for a specific task!

CS194-26 Project 3 - Fun with Frequencies and Gradients!¶

Part 1 - Frequency Domain¶

3.1.1 Warm up¶

High level procedure explanation¶

Contrast and Compare¶

Another Example¶

Note¶

Conclusion (3.1.1)¶

3.1.2 Hybrid Images¶

High level procedure explanation¶

Frequency Analysis¶

Another Example¶

Bells and Whistles¶

Bells and Whistles¶

Frequency Analysis for 3 layered image¶

Conclusion (3.1.2)¶

Frequency Analysis:¶

3.1.3 Gaussian and Laplacian Stacks¶

Note - Our stack implementation is identical except we do not undersample. Thus, the image sizes stay the same, giving rise to the name stack.¶

Gaussian Stack¶

Laplacian Stack¶

Another Example - The Starry Night by Vincent Van Gogh¶

Back to supeaglain from 3.1.2¶

Average them together to generate a hybrid image (as in 3.1.2)¶

ITS A MATCH!¶

Conclusion (3.1.3)¶

3.1.4 Multiresolution Blending¶

High level procedure explanation¶

The Oraple¶

Gaussian Stack¶

Laplacian Stack¶

Mask¶

Result - Oraple!¶

Next Up: Brooklynn Gate Bridge¶

Gaussian Stack¶

Laplacian Stack¶

Mask¶

Another example - jt¶

Conclusion (3.1.4)¶

Part 2 - Gradient Domain Fusion¶

Part 3.2.1 - Toy Problem¶

First we find b by taking the derivative of the image (convolve with laplacian kernel), while leaving the top left corner as is.¶

Now we will find the forward operator A, which encodes within it the changes of each pixel, as well as the true value of s[0,0]¶

Using Scipy's sparse solve spsolver we can efficiently solve this least squares problem!¶

Reshape, inject, and display result!¶

Comparison¶

Conclusion (3.2.1)¶

3.2.1 Multiresolution Blending¶

High level procedure explanation¶

Find b vector, convolving the little penguin image with the laplacian kernal while keeping the border pixels as is¶

Find forward operator A¶

Solve for x, our blended image¶

And finally, gradient fusion!¶

And now that it works on grayscale, we can continue to color¶

Another Example - Mordor Peaks¶

Mask¶

Find our b vector¶

Find our forward operator A¶

Solve, reshape, apply to target, and display!¶

For the last part I will compare the bridge image blending using the two different methods¶

Our Mask¶

Find b vector¶

Find forward Operator A¶

Gradient Fusion!¶

Comparison of methods on blending the Bridge image¶

Comparison Conclusion¶

Bloopers¶

Lesson Learned¶

Thanks for sticking through, Eilam¶

Note - Our stack implementation is identical except we do not undersample. Thus, the image sizes stay the same, giving rise to the name `stack`.¶

Using Scipy's sparse solve `spsolver` we can efficiently solve this least squares problem!¶