Scene Understanding

0 0
  • 0 Collaborators

Combination of recent advances in CV and Machine Translation to produce image captions. Uses CNN, RNN and Transfer Learning. ...learn more

Artificial Intelligence

Groups
DeepLearning, Student Developers for AI

Overview / Usage

A model to directly generate the sequence of the words most relevant to the image conditioned on the image and previously generated words. Therefore, they can produce novel combinations of words that might never have occurred in the training data. Implementation of Google Show and Tell Model using Tensorflow.

Comments (0)