Image Captioning

Amir Khan

Noida, Uttar Pradesh

1 0

0 Collaborators

an artiﬁcial intelligence problem where a textual description must be generated for a given photograph. ...learn more

Project status: Published/In Market

Artificial Intelligence

Code Samples [1]

Overview / Usage

A challenging artiﬁcial intelligence problem require to generate text descriptions from the given image data. it requires both methods from computer vision to understand the content of the image and a language model from the ﬁeld of natural language processing to turn the understanding of the image into words in the right order.

Given an Image - Model will detect the textual description about the image using computer vision and natural language understanding.

Methodology / Approach

Getting Data - Flickr8k dataset

Preprocess the Image Data, using VGG Network to extract the features from the Image Data

Preprocess the Text Data using NLP text preprocessing (remove punctuations, convert to lowe case, remove all words with numbers in them, Remove all words that are one character or less in length (e.g. ‘a’))

Vectorize text data

build the feature extractor model(cnn) + build sequence model(lstm model)

save the model with low validation loss

evaulate the test image using best saved model.

Technologies Used

keras

nlp

opencv

Repository

https://github.com/Amir22010/NLP_Deep_Learning/tree/master/Image_Captioning_NLP_VISION

Comments (0)

You have disabled JavaScript

We are sorry, but without JavaScript we are currently unable to display the latest activity feed. Please, enable Javascript in your browser.

Image Captioning

Amir Khan

Overview / Usage

Methodology / Approach

Technologies Used

Repository

Login to continue

This action requires you to be logged in.

Thanks for voting. Please leave a comment.

Image Captioning

Amir Khan

Overview / Usage

Methodology / Approach

Technologies Used

Repository

Login to continue

This action requires you to be logged in.