Compressive Visual Question Answering
Description
Compressive sensing theory allows to sense and reconstruct signals/images with lower sampling rate than Nyquist rate. Applications in resource constrained environment stand to benefit from this theory, opening up many possibilities for new applications at the same time. The traditional inference pipeline for computer vision sequence reconstructing the image from compressive measurements. However,the reconstruction process is a computationally expensive step that also provides poor results at high compression rate. There have been several successful attempts to perform inference tasks directly on compressive measurements such as activity recognition. In this thesis, I am interested to tackle a more challenging vision problem - Visual question answering (VQA) without reconstructing the compressive images. I investigate the feasibility of this problem with a series of experiments, and I evaluate proposed methods on a VQA dataset and discuss promising results and direction for future work.
Date Created
The date the item was original created (prior to any relationship with the ASU Digital Repositories.)
2017
Agent
- Author (aut): Huang, Li-Chin
- Thesis advisor (ths): Turaga, Pavan
- Committee member: Yang, Yezhou
- Committee member: Li, Baoxin
- Publisher (pbl): Arizona State University