> CS 534
> HW #5 Projects
HW #5 Projects
- Alex Kocher, Blake Nigh and Konnor Beaulier
Adherent Raindrop Detection and Removal in Videos
With the emergence of wearable technologies such as wearable cameras like GoPros,
adverse weather directly affects how the video of these cameras turns out. In the case
of rain, raindrops can stick to the lens of the camera and cause blurring and even more
distortion in the output picture or video. We decided to look at these raindrops as
removable through various techniques, thus removing the blur and distortion.
- Andrew Chase, Bryce Sprecher and James Hoffmire
Motion Detection and Segmentation using Optical Flow
This paper discusses the detection, analysis and segmentation of motion in video using
optical flow algorithms. Optical flow detects motion within each neighborhood of pixels by
registering changes in the color and intensity of pixels from frame to frame. Vectors
indicating the direction and magnitude of detected motion are created, and groups of
similar vectors are then segmented from each other. The primary purpose of optical flow is
to use segmentation to isolate individual moving objects within a video. This paper first
introduces the fundamentals of optical flow and offers an overview of existing literature
related to general motion detection. We next discuss the theory behind optical flow,
followed by a description of our method and implementation in MATLAB. Finally, we share
our results along with our takeaways about optical flows applications and limitations.
Web page
- Meiirbek Ashirgaziyev and Aoyu Fan and Alexandra Grupe
Texture Transfer: A Matlab Implementation
In this paper, we present an image-based Matlab implementation of texture transfer. This implementation does not use image quilting for texture synthesis, of which texture transfer is an extension. Instead, it imposes specific constraints on input images concerning the image dimensions and the relative size of the two input images. If these constraints are maintained, this implementation of texture transfer produces high-quality output images.
Web page
- Amr Hassaballah, Jimmy Yuan and Austin Schaumberg
Materialized Tracer
This paper discusses the approach and
implementation to a new video effect coined
the "Materialized Tracer" effect. Within
this paper we will describe our method in
detail, including pseudocode and some of the
general parameters used to produce a given
tracer's frequency and density. Commonly
speaking, the technique is a fusion of
foreground segmentation combined with
a clever approach to image overlaying in
order to produce the effect. Moreover, we
will provide an analysis on our algorithm's
efficiency in addition to a variety of our
project's initial results.
Web page
- Brett Geschke, John Carmichael and Steven Radeztsky
Pics2Comic
Our project will be an Android application that will allow any user to
submit images with the goal to create a comic from those images. For the
sake of simplicity the user will submit 2-4 images. Those pictures will then
be cartoonified using bilateral filtering and edging techniques learned in
lecture. Our application will then display those images in a layout similar to
a comic book. The user will have the ability to write their own captions for
each picture. To make it more of an interactive experience, the user can
click on each image to view the image and its caption. This effect will
create an interactive comic book experience so that people of all ages and
artistic abilities can use our application.
Web page
- Chao Li, Bryan Suzan and Alayna Truttmann
Photo Booth Utilizing Non-Photorealistic Rendering
This project aims at filtering input images with various effects, via means of color quantization,
bilateral filtering, tone mapping, color adjustments, and more. Specifically, the filters we
implemented were sketch, X-ray, sepia, black and white, cartoon, and pop art. Our program
allows users to quickly and easily select images and customize filter effects on them through our
user-friendly GUI. Users can also compare different effects through a photo booth style setup.
Users can then share these images if they choose through social media.
Web page
- Evan Hernandez and David Liang
Style Transfer
Artistic style transfer is the process of merging the style of one
image with the content of another. This is a useful technique
because it saves animators drawing time, particularly when
a large sequence of stylized images is necessary. Furthermore, the
effects are visually appealing, so the technique can be applied simply
to produce beautiful images for any purpose. At the same time,
artistic style transfer is difficult because the notions of style and
content are abstract and difficult to separate. After all, content
and style aren't quantitative by any means, so creating a digital
representation of these two characteristics seems nearly impossible.
However, Gatys et al. have developed a style transfer algorithm
that uses a deep convolutional neural network (CNN) to divorce
image style and content and Ruder et al. propose an extension
of this technique to video. In this paper, we will describe these
algorithms, our implementation, and our results.
Web page
- Garrett Andrews, Matthew Muccianti , Elliott Janssen Saldivar and Aabhas Singh
3D Mesh Reconstruction from a Structure-from-Motion Point Cloud
Simultaneous Localization And Mapping (SLAM) is commonly used to represent and
estimate spatial uncertainty for autonomous algorithms such as collision avoidance systems
and scene reconstruction for robotics. We utilize drone video to create a 3D model of an object
or scene post-production in MATLAB. To represent the model, we attempt to use a MATLAB
implementation of structure-from-motion as well as Microsoft Photosynth to create a point
cloud of the scene. Three algorithms (Marching Cubes by MeshLab, Poisson Reconstruction by
MeshLab, and our custom algorithm in MATLAB) were tested against this point cloud to
determine which algorithm can best recreate the 3D representation of the structure. While
mostly used for 2D applications, MeshLab offers an implementation of Delaunay triangulation
so we compare its results with the other three.
Web page
- Joseph Hushek, Haohua Wang and He Wang
Finger Face
Inspired by face stickers made popular by Snapchat, we decided to create a program to
add stickers to fingers. In the project, we detect a hand within an image by a range of skin colors,
select the hand, use contour features to identify the fingertips, then add a face sticker on each of
the fingertips. The result is an image of a funny-looking hand.
- Jacob Draeger, Alex Herreid and Raghav Bhagwat
Face Swap
For our project, we wanted to create a face swapping program using MATLAB and computational photography skills. we used MATLAB's inbuilt implementation of the Viola-Jones algorithm for face detection and feature point recognition. This was a function of MATLAB's Computer Vision Toolbox. We created our own face mask of 13 specific points and used the polygon function of the face detection method to draw our mask around the area of the face. After identifying the faces, we crop them out and swap them and use a blending algorithm to give a more uniform result. We designed our custom face mask to give us more accurate results and implemented multiple snapchat style masks.
Web page
- Jing Qian, Xiaofei Liu and Ruihao Zhu
Shinkai Makoto's Imagination: An Attempt to Mimic the Japanese Anime Style
The project is motivated by the artwork of Japanese anime director, Shinkai Makoto. This report demonstrates the process of converting a real life landscape image into the typical hand drawing style of Shinkai Makoto. The process is divided into three steps, including detecting the sky region of an input image, animefying the input image and replacing the original sky region with the selected sky source image. While we successfully transform some input images to the anime style, there are also some limitations in this program. One is that sky detection algorithm is not robust for all kinds of input images and the other is that color adjustment works better for input images with higher brightness and saturation.
Web page
- Christina Stiff, Nicholas Smith and Joanne Lee
Automatic Background Decolorizer
This project outlines a method to take a color image as input, distinguish between the
foreground and background regions using the k-means++ clustering algorithm, and decolor the
background region to emphasize the in-color foreground subject. In our implementation, we
use color as the feature vector and assume the cluster with the most assigned pixels to be the
background region that is decolored. The pixels assigned to the other clusters remain in color.
Web page
- John Louk, Brian Nelson and Riley Morrison
Stellar Photo Matching
Capturing the beauty and wonder of the night sky has been the goal of many
photographers. However with the rise of cell-phone cameras and light pollution, taking photos of
stars has become quite challenging. Phone cameras do not have the same capabilities as true
cameras and cannot get the same level of detail. The goal of our Stellar Photo Matching program
is to improve these poor quality star images and make appear natural. This is done by matching
star patterns in the image to patterns in a star database using the Murtagh method. Then our
method will project new stars from the database onto the original image aligning based on
homography. The output image will show the stars that would have appeared had the photo been
taken under better conditions. Our program, tested on matching the Orion constellation to its
surrounding stars, has been successful and proves that star matching can improve the quality of
star photos effectively.
Web page
- Jake Becco, Justin Aniban and Joseph Hoffmann
Fingerprint Matching
Fingerprint recognition has been researched for a long period of time and it has
shown fantastic results in the real world. However, due to the near infinite
possibilities of fingerprint variations, fingerprint recognition can be a challenging
problem (Raja et al., 2009) . When comparing two fingerprints there are many
different facets that can go wrong, and the most important is the feature matching
algorithm. With the advancement of technology, fingerprint scanners are becoming
more widely used. However, these scanners are becoming smaller, such as on
phones, and only sense only a limited portion of the fingerprint. Therefore, with
these small fingerprints, only a small portion of overlap can occur causing
difficulties in determining matches (Figure 1). Because of this, advanced matching
algorithms must be used to determine matches with a respectable success rate. The
most successful algorithm is based off minutiae points.
- Josh Chaimson, Bryce Greiber and Sambhav Jain
Face Replacement
Face replacement has gained a lot of popularity through many new applications -- one of them being the face swap feature on Snapchat. Our algorithm will take in input images, allow the user to select an image that needs face replacement, and use landmark features to help choose what photo will be best to choose parts of the face from. When choosing the image, users will also be able to choose what particular part of the photo they want to replace by creating a mask. The last step is to blend the replacement face into the new head.
- Kailee Tapia
Adding Text to Images
The TextToImage program allows users to import an image and text. The user
then determines how they want the text aligned both vertically (top-aligned, bottomaligned,
or centered) and horizontally (left-aligned, right-aligned, or centered) on the
image, as well as how large they would like the text, what font they would like used,
and whether or not they want it bold, italic, or plain. The program then uses the height
and width of both the image and the text to determine the correct alignment based on
the user's input. The program also allows the user to input what they would like the
new image saved as.
Web page
- Lingfeng Huang and Fang Wang
Discovering Panoramas in Web Videos
Panoramas have been widely used in may applications in multimedia, but the main
constraint for panoramas is that they must be taken by people who physically present at the
place. In this project, we implement our version of Discovering Panoramas in Web Videos by
Liu et al. Our goal is to solve two problems: First, the program should be able to select
optimal segments within a given web videos to synthesis a panorama using visual quality
measurement introduced in Liu et al.[1] Second, the program should be able to generate a
set of panoramas if multiple panoramas can be synthesized in a given video. The whole
procedure is a optimization problem where we optimize the three criteria which are wide
field of view, mosaicability, and high image quality.
Web page
- Arjun Gurumurthy, Ashok Marannan and Manoj Nagarajan
Machine Learning based Efficient Image Colorization
Image colorization is a process that adds color to grayscale images. Manual image colorization
is tedious and prone to human error. Existing approaches to colorize images automatically use
either pixel-based scanning (expensive), are scribble-based (manual), assume strong correlation
with target grayscale image, or require large number of training examples. In this project, we
aim to look at approaches that perform automatic image colorization with a small number of
similar reference images using Machine Learning. Initially, (1) we plan to design features which
would capture different properties of a grayscale image for training ML models. Then, (2)
grouping pixels into superpixels enables capturing of local information with reduced
complexity. If we have a corpus of training images, (3) one or more of Machine Learning
algorithms like support vector regression or k-means could be used to train an algorithm to
colorize parts of images with reasonable accuracy. Based on the output of the model, (4) we
perform some post-processing (like smoothing) to re-assign colors of segments predicted with
low confidence. Via these four steps, we color images with reasonable accuracy using only a
small subset of training images than needed in other approaches.
Project report
- Mary Feng and Sabrina Yu
Image Analogies for Artistic Filters
This project focuses on implementing image analogies for artistic filters as outlined in the paper
of the same name (Hertzmann et al., 2001). Given a pair of training images A (an unfiltered
image) and A' (some "filtered" version of image A), the same "filter" is applied to B to produce
B', thus completing the image analogy. While image analogies can be used for a wide variety of
effects depending on the training pair provided, we chose to focus on artistic filters. We found
that some test images and filters provided better results than others. Although substantial effort
was put into this project, there were various challenges (especially time, since synthesizing B'
takes many hours even for small images) leading to output images not looking as pleasant as
desired due to noise. Possible reasons for this are discussed.
Project repository
- Matthew Nicol, Brad Miller and James Merrill
Partial Image Placement Software (PIPS)
A fluid object to picture placement and blending method is something many people could use in their daily lives. With how expensive Photoshop and other alternatives are, and the difficulty of using them, having a program that can do everything from cutting out the object in the image to placing it a second image and blending it is something worth having.
- Micaela Connors, Angus Kinsey and Jason Chen
Visual Timeline
The goal of this project is to highlight differences between two contrasting
images taken of the same scene at different points in history or at different
times of day by combining them to create a visual timeline output image. This
will be accomplished by compositing two input images into one blended image
where each image takes up about half of the space in the final processed image.
In order to combine the images, we will use the Laplacian Pyramid Blending
method to seamlessly combine the images, creating a composite output image
of the two input images in the middle and leaving the originals unchanged
further from the center.
Web page
- Jackson Milkey, Michael Salmon and Andrew Zeitlow
Cartoonization of Images
Based on the paper "Realtime Video Abstraction," our project aims to implement a form of
nonphotorealistic
rendering . A variety of methods are used to abstract an image: detailed
regions are exaggerated while visually uninteresting areas are artificially reduced. With an
abstracted image, simple color quantization is employed to achieve a smooth, cartoonlike
appearance.
Web page
- Neha Godwal, Sidharth Mudgal and Yipeng Zhang
Converting 2D to 3D
Due to advancements in visuals and emerging market of virtual reality, there is a vested interest of users to create 3D content. In today's world, the creation of 3D movies can be done using stereo cameras, which is generally not affordable by many people whereas 2D video cameras are easily accessible to everyone. In this project, our aim is to achieve automatic conversion of 2D videos to 3D using deep neural networks. Our approach is to train deep neural network model on stereo pairs extracted from 3D movies/videos. As compare to other algorithms for 2D to 3D conversion, we expect deep-learning model performs better in quantitative and human subject evaluations.
Web page
- Wangtao Lian, Rulan Zheng and Ruiqi Yin
Generating Artistic Drawing Styles
Phones and camera have become accessible in the modern era; hence, taking a photo has
become common to public. Some artists have used these snapshots as inspiration to create
different art forms; however, not everyone who wants to convert images to aesthetic outcome has
the necessary skills. In this project, we have created a program to resolve this issue by asking the
user to choose the artistic styles they want. Our project aims to generate four different styles to
an input image, and the output image could be converted into a cartoonified picture, a contour
drawing, pencil drawings in black and white, and in colors.
Web page
- Jacob Holiday and Sahil Verma
Face Swapping/Filters
Applications that add filters/effects to one's face have become extremely
popular as of late. Mobile phone users have developed a new demand for
applications which can take their selfies to the next level by adding fun
effects. The goal of this project was to learn more about the technologies that
go into face detection, feature recognition, and image warping in order to
help others gain a better understanding of the core concepts behind hugely
popular applications like Snapchat. Previous projects demonstrate this same
goal although with less accuracy in filter placement and more pixilation and
noise around the edges of filters. To enhance filter placement, we utilize
OpenCV and DLib for accurate face detection and detailed facial feature
point recognition. In addition, we explore a simple method by which to
inform the program how a filter should fit onto to a face. We also implement
a method of alpha blending for clean compositing of transparent filters over
images. We hope this project gives others the confidence to explore this
quickly emerging area of computational photography and further improve
on our work.
Web page
- Eric Arndt, Justin Xayarinh and Shuruthy Yogarajah
Pencil Sketching
Technology has advanced to the point where a digital artist can create a photorealistic image,
which is any image that appears to be real, but is not. Even though this is the case, for a
number of years computational photographers have been creating various ways to make digital
photos look hand drawn, as if cameras have never existed. This is called non-photorealistic
rendering. Our algorithm is based off of Lu et al. [2012] and other related non-photorealistic
rendering algorithms. It begins with any photo and produces an output that mimics a
pencil-drawn sketch. This pencil-drawn sketch is created by combining a line drawing image
with a tonal texture image.
Web page
- Shu Chen, You Wu and Sijia Zhang
Turning Scene Photos into Anime Art
After seeing this year's phenomenal film, "Your Name," by Shinkai Makoto, we feel
strongly motivated to virtually bring this kind of pure sky back to real life. Shinkai
Makoto's film is known for beautifully designed images that every frame of his film can
be used as wallpaper. We feel like we can dedicate to transform real life scenes in the
world to a gorgeous piece of art.
This project, turning scene photos into an anime art is a project involving multiple
processes. In general, the project includes a segmentation algorithm and two filter
algorithms. When fulfilling what we are aiming for, we encounter a problem. Since real
life clouds do not look like the marshmallow clouds in the anime, if only apply an oil
paint filter, it does not achieve the effect we are looking for. As the result, we decide to
take out the whole sky area and replace it with a pre-set anime sky full of prominent
clouds. The overall process of the project can be divided into six parts. First, we read the
input image and adjust its size to a pre-set value. Second, we detect the sky area of the
image and crop it out by utilizing a segmentation algorithm. Third, we apply customized
filter as well as the bilateral filter to the rest of the image. Fourth, we apply the edge
sharpening technique to enhance the image's overall quality. Fifth, we adjust other
factors of the image such as brightness, exposure, saturation, and etc. Last, we put the
previously designed anime sky to the image to form an intact final output image. This
document provides an analytical elucidation of the design, the process, the inspiration,
the implementation, the methodology and the final result. The references we consulted
are listed at the end and mentioned throughout the report.
- Cheng Xiang, Siyu Chen and Yizhe Qu
Face Detection and Adding Stickers
We decided to work on a Matlab program that achieves face recognition and modification on the detected faces. Face detection is a commonly used technology these days. Such image processing technique is used to identify human faces in digital images. Many popular developers now have deployed this feature, such as Apple and Microsoft. In addition to the face detection feature, we decided to add modifications such as adding a pair of glasses over all the faces we detected from a photo.
Web page