Name	Name	Last commit message	Last commit date
parent directory ..
glyphs_sample.png	glyphs_sample.png
readme.md	readme.md

Computer Vision (CS 763) - Spring 2019 Final submitted Projects

pix2pix: Image-to-Image Translation with Conditional Adversarial Networks
Vamsi Krishna Reddy Satti

pix2pix uses a conditional generative adversarial network to efficiently design a general-purpose image-to- image translation system. Image-to-image translation involves learning a mapping from images from one distribution to corresponding images in another distribution. Many kinds of problems can be viewed as an image-to-image translation problem, including image colorization, edges to object visualization, style transfer etc. This project is a pytorch implementation of pix2pix on map-sattelite domain translation.
Anatomically-aware Facial Expression Synthesis
Vighnesh Reddy Konda, Middepogu Manoj

To generate continuous expressions, we implemented a GAN conditioning scheme based on Action Units (AU) annotations, which describes in a continuous manifold the anatomical facial movements defining a human expression.

Our implementation allows controlling the magnitude of activation of each AU and combine several of them. Additionally, we trained the model using an unsupervised strategy, that only requires images annotated with their activated AUs, and exploit attention mechanisms that make our network robust to changing backgrounds and lighting conditions.
Context-aware Captions from Context-agnostic Supervision
Talluri Sai Teja, Suraj Soni, N Jagadeep Sai

A PyTorch implementation of the paper "Context-aware Captions from Context-agnostic Supervision". The objective is to produce pragmatic, context aware descriptions of images (captions that describe differences between images or visual concepts) using context agnositic data (captions that describe a concept or an image in isolation).
Pixel-Link
Mayank Kumar Singh, Renuka Sharma, Nagandla Varun Kumar
In this project, we are attempting to detect all kinds of text in the wild. The technique used for text detection is based on the paper PixelLink: Detecting Scene Text via Instance Segmentation (https://arxiv.org/abs/1801.01315) by Deng et al. The text instances present in the scene images lie very close to each other, and it is challenging to distinguish them using semantic segmentation. So, there is a need of instance segmentation.
Neural Network Compression
Akhil Gakhar, Prashant Kumar Sharma, Kurapati Hitesh

Main objective of this project is to explore ways to compress deep neural networks, so that the state of the art performance can be ahieved over a resource-constrained devices eg. embedded devices.
MonoSLAM
Neelesh Verma, Swadha Sanghvi, Maitrey Gramopadhye

We present an algorithm to recover 3D trajectory of a camera using feature based sparse slam using a single camera (also called MonoSLAM).
Bokeh Effect
Yash Khemchandani, Mashkaria Satvik Mehulbhai, Arpit Aggarwal

We aim to build a robust model that produces bokeh effect from an input image using Deep Learning without using advanced camera lenses or dual lenses. We approach this problem with GANs for depth estimation of input image. We therefore model the Bokeh mode problem to background-foreground seperation (segmentation) problem.
Depth Estimation
Sunkesula Sai Praneeth Reddy, Vaddi Niranjan, Mude Chaithanya Naik
Depth Estimation from Single Image using CNN, CNN+FC, CNN-Residual network. We compare the performances of the three approaches.
Image Blending using GP-GAN
Mayank Singhal, Sanchit Jain, Chinthareddy Sai Charith Reddy

Given two images source, destination and a mask, our goal is to blend destination into source in a manner that is visually appealing. We implemented an encoder-decoder network which takes low resolution(64X64) composite image(source cropped onto destination) and generates a low resolution image(64X64) which looks more natural than the composite. Using this low resolution image and using Laplacian pyramid we tried to optimize Gaussian-Poisson Equation (i) by gradient descent, and (ii) by Pyramid Blending.
Image2Depth
Sailor Devangkumar Prashantbhai, Vikrant Nagpure, R Shrinivas

In this project, we attemp to perform depth prediction from monocular images for Autonomous Driving. We propose to solve this problem by using a multiscale regression CNN.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

Computer Vision (CS 763) - Spring 2019 Final submitted Projects

FilesExpand file tree

projects

Directory actions

More options

Directory actions

More options

Latest commit

History

projects

Folders and files

parent directory

readme.md

Computer Vision (CS 763) - Spring 2019 Final submitted Projects