Video Question Answering Systems(VQA)

Sumedh Pendurkar

Sumedh Pendurkar

Shivajinagar, Maharashtra

0 0
  • 0 Collaborators

A lot of research has been done on Text-based Question Answering Systems. However, very little work was done on answering questions whose answer lies in the image until 2016. This was mainly, due to the locality of the answer in image lies in a certain part of the image which needs to be focused. After 2016 various attention mechanism based QA were proposed to solve this problem. However, the accuracy still remains low for open-ended questions. Moreover, a complex model needs to be designed for a system that intends to answer questions based on a video. Here, sequence-to-sequence modeling is required for both, CNN features as well as text-based features. Integrating these features is a major complex challenge which I intend to tackle. ...learn more

Project status: Under Development

Artificial Intelligence

Intel Technologies
AI DevCloud / Xeon, Intel Opt ML/DL Framework

Overview / Usage

Under Development

Methodology / Approach

Tentative: Use sequence-to-sequence modeling to get features from video and text.

Technologies Used

python3, keras, pytorch. (Intel version of these softwares and packages optimized for DevCloud)

Comments (0)