I am a Research Engineer at Google DeepMind, working on conversation agents that renovate some of the largest Google products.
Previously, I was an ML Research Engineer at Twitter Cortex, where I transformed research ideas like language embeddings and continuous learning to its ranking systems. I also spent a year on vision proptotypes at SenseTime in its early days.
We propose a hybrid hashing method to combine frequency hashing and double hashing techniques for model size reduction of large-scale recommender systems.
To generate high-quality novel view, we design a non-discrete scene representation for 3D transformation and use the edge info in the input image for spatial filtering.
Resemble to the human visual system (HVS), stereo machines collapse under imbalanced stereo inputs. We show that guided by the rough object contour, the corrupted view can be restored, and stereopsis can be regenerated.
As most face images share some common global structures which can be modeled well by sketch information, we propose to learn face sketches first to help the motion blur estimation.
By exploiting the properties of the gradient domain, we establish a selfie-friendly stylization framework that preserved natural skin color and facial structure.
Final Year Thesis (Undergraduate), 2018
pdf /
slides
We bridge the gap of image quality in mobile phones and DSLRs by designing a multi-domain image translation framework, which learns the comprehensive enhancement transformation from heterogeneous real-world datasets.
Based on over 400K posts crawled from League of Legends NA forum, we establish a framework that assesses the player experience from the sentiment polarity of the posts.
Dynamic Searchable Symmetric Encryption (DSSE) aims at making possible queries and updates over an encrypted database on an untrusted server, with minimum exposure about user data to the server.