HW4: Image stitching

GIF-4105/7105 H2017

Author: Henrique Weber

The goal of this assignment is to create a panorama from a set of photos. For that the following steps were implemented:

Manually select some feature points on each image.
Find the homography matrix that align each pair of neighbor pictures.
Transform the source image so as to be in the same projective space as the target image.
Stitch images by taking the target image and placing it in the location given by the multiplication inverse of the homography matrix.

All these steps will be illustrated bellow together with the results.

Part 0: Warmup

The first part of the assignment was to build the function "imgTrans = applyTransformation(img, H)", which would transform the image "img" by the homography matrix "H". In theory it suffices to apply H to all pixels positions of the image and then copying its intensities to their new position. In practice however such approach leads to three problems: (1) the final image will potentially have holes in places which were there was no correspondency from the original image, (2) there will be many pixels collapsing to the same location if the image needs to be warped (which is almost always the case) and (3) we do not know the size of the transformed image. To overcome problems (1) and (2) we can use the inverse of the matrix H and search on the original image for the pixels intensities for every point in the final image. And to overcome problem (3) we can apply H to the extremes point locations of the original image and from there take the minimum and maximum values to know the final size of the image. The result can be seen bellow.

	          
Target image.
Image after applying the transformation.

Part 1: Manual matching

The second part of the assignment consisted in stitch images to compose a panorama. For that the images must all be taken with a camera having (aproximately) the same center of projection. In order to know where exactly the pictures collide we must provide an homography matrix H that takes one picture and project it in the same plane as its neighbor picture. The matrix H in turn can be estimated by finding 4 pair of points that describe the same location in both images. Having this 4 (or more) pairs of points gives us the possibility to create a system of linear equations to retrieve the matrix H. In this part of the assignment, these pair of points were chosen manually. Since small errors in the location of those pixels lead to big errors in the homography estimation, it is a better practice to choose more than that. Bellow are the results.

Manual results for sequence 1.

Image 1 with points related to image 2.
Image 2 with points related to image 1.
Image 2 with points related to image 3.
Image 3 with points related to image 2.

The result is an interpolation of all images in their right place. This "right place" was for me the most difficult part of the assignment! I had a hard time trying to find a way to keep track of the image places while placing them in the final panorama. Finally my solution was to create a vector containing a translation for each image. As I deform a new image, I calculate its position and update all images that have been already deformed to reflect this new arrangement. In this result I also took the maximum intensity value from both images where they overlap. It was just an experiment to blend both images without visible artifacts in the sky and also to reduce the blur caused by misalignment.

Target image.

Manual results for sequence 2.

Image 1 with points related to image 2.
Image 2 with points related to image 1.
Image 2 with points related to image 3.
Image 3 with points related to image 2.

Here the result is similar to the previous one and also to the result presented in the assignment description. No additional points were needed aside from the ones already provided with the assignment.

Target image.

Manual results for sequence 3.

Image 1 with points related to image 2.
Image 2 with points related to image 1.
Image 2 with points related to image 3.
Image 3 with points related to image 2.

This serie of pictures was the most difficult one since there was no much points to match. Also many attempts did not generate a good composition, so I had to try many times until the result bellow was possible.

Target image.

Once we have the feature points for all images it's time to match pairs of features between neighbor pictures. For that we compare the distance from all points on one image to all points on the other image. Following the paper we use the ratio between the smallest distance and the second smallest distance between one points and its two closer "siblings" on the other picture. We also use the mean of the distance to the second closest point as a threshold to determine if a pair of points is close enough to be considered a good match.
In the images bellow we use the MATLAB function showMatchedFeatures(I1,I2,matchedPoints1,matchedPoints2); to see matchedPoints1 and matchedPoints2 over images I1 and I2.
In the sequence, we use RANSAC to find the homography matrix that best describes (given a finite number of attempts) the transformation between one set of points and the other. For that we chose 4 pair of points at random and calculate the homography matrix that describes them. Then we count the number of other pairs in the set that are in accordance with this homography given the previously mentioned threshold generated with the mean to the second closest neighbor. After a fixes number of trials (I used 100) we pick the homography that gathered the biggest number of inliers and considered it as the description of the transformation we need to perform to align the images. In the end we reuse the original pairs of inliers to recalculate the homography in order to let it more accurate.

Automatic results for sequence 1.

The resulting panorama took time to be computed on my computer (around 5min) but the result is mostly good. Since I also used the max between two pixels to get the final value we lost part of the cable in the left of the image. The boat is also blurry but it looks like it was moving while the photos were being taken.

Automatic results for sequence 2.

Above are the points that were found by RANSAC to describe the transformation. The result looks better than the one that was created manually.

Automatic results for sequence 3.

Image 1 with points related to image 2.

Image 2 with points related to image 1.

Image 2 with points related to image 3.

Image 3 with points related to image 2.

Image 1 with points related to image 4.

Image 4 with points related to image 1.

Image 2 with points related to image 5.

Image 5 with points related to image 2.

Image 3 with points related to image 6.

Image 6 with points related to image 3.

Here is another result with the pictures provided with the assignment. Here I average both images where they overlap, so it's possible to see the blur where they are not perfectly aligned.

Result over my own pictures.

Image 1 with points related to image 2.

Image 2 with points related to image 1.

Image 2 with points related to image 3.

Image 3 with points related to image 2.

Image 1 with points related to image 4.

Image 4 with points related to image 1.

Here is one result with my own pictures. The alignment looks good and also the exposure of all pictures are similar, which helped to create a nice panorama.

Another result with my own pictures.

Image 1 with points related to image 2.

Image 2 with points related to image 1.

Image 2 with points related to image 3.

Image 3 with points related to image 2.

Image 1 with points related to image 4.

Image 4 with points related to image 1.

In this serie of photos I slowly turn around the camera, which gives a better result (which can be seen by the borders that are well alligned).

Bells and whistles: my family on a sign (Source).

To place the picture in the right place it was enough to define the four corners in each image and then compose them.

Bells and whistles: grafitti (Source) on ancient China (Source).

Here the grafitti was composed in a wall. To use only the grafitti the background was set to black, so when we compose them we can ignore the background.