Ankit Goyal

I am a Ph.D. student in Computer Science at Princeton University. I am a member of Princeton Vision and Learning Lab, advised by Prof. Jia Deng. I completed my Masters in CSE from University of Michigan and Bachelors in EE from IIT Kanpur. Previously, I have interned at Microsoft Research India with Prateek Jain in Summer 2016 and with Shrikanth Narayanan and Tanaya Guha at USC in Summer 2015.

Email  /  CV  /  Google Scholar  /  LinkedIn  /  Twitter


I am interested in understanding various aspects of intelligence, especially reasoning and common sense. In particular, I want to develop computation models for various reasoning skills that humans possess.

PackIt: A Virtual Environment for Geometric Planning
Ankit Goyal, Jia Deng
ICML 2020
[code] [slides]

Simultaneously reasoning about geometry and planning action is crucial for intelligent agents. This ability of geometric planning comes in handy while grocery shopping, rearranging room, warehouse management etc. We create PackIt, a virtual environment that caters to geometric planning.

Think Visually: Question Answering through Virtual Imagery
Ankit Goyal, Jian Wang, Jia Deng
ACL 2018
[code] [poster]

We study geometric reasoning in the context of question-answering. We introduce Dynamic Spatial Memory Network (DSMN), a deep network architecture designed for answering questions that admit latent visual representations.

ProtoNN: Compressed and Accurate kNN for Resource-scarce Devices
C Gupta, AS Suggala, A Goyal, HV Simhadri, BP, AK, SG, RU, MV, P Jain
ICML 2017

Resource-Efficient Machine Learning
Prateek Jain, Chirag Gupta, AS Suggala, Ankit Goyal, HV Simhadri
US Patent Applicaiton

We propose ProtoNN, a novel algorithm that addresses the problem of real-time and accurate prediction on resource-scarce devices.

A Multimodal Mixture-Of-Experts Model for Dynamic Emotion Prediction in Movies
Ankit Goyal, Naveen Kumar, Tanaya Guha, Shrikanth S. Narayanan

We address the problem of continuous emotion prediction in movies. We propose a Mixture of Experts (MoE)-based fusion model that dynamically combines information from the audio and video modalities for predicting the emotion evoked in movies.

Object Matching Using Speeded Up Robust Features
NK Verma, Ankit Goyal, A Harsha Vardhan, Rahul Kumar Sevakula, Al Salour
IES 2016

We propose a robust algorithm which is capable of detecting all the instances of a particular object in a scene image using Speeded Up Robust Features.

Template Matching for Inventory Management using Fuzzy Color Histogram and Spatial Filters
NK Verma, Ankit Goyal, Anadi Chaman, Rahul K Sevakula, Al Salour
ICIEA 2015

We propose a methodology for object counting using color histogram based segmentation and spatial filters.


I have been a Teaching Assistant for the following courses:

  • COS529: Advanced Computer Vision at Princeton University [Winter 2020]
  • COS429: Computer Vision at Princeton University [Fall 2018]
  • EECS442: Computer Vision at University of Michigan [Fall 2017, Winter 2018]

[Web Cite]