Supporting Image-to-Text (Image Captioning) in Apache Tika for Image MIME Types

0 0
  • 0 Collaborators

The project was done as my Google Summer of Code'17 project. Primary objective of this project was to implement an image captioning parser. ...learn more

Artificial Intelligence

Groups
Student Developers for AI

Overview / Usage

Image captions are a small piece of text, usually of one line, added to the metadata of images to provide a brief summary of the scenery in the image. It helps text based Information Retrieval(IR) systems to "understand" the scenery in images. It is a very useful feature, yet a challenging and interesting problem in the domain of computer vision.

The objective of this project is providing Apache Tika (https://github.com/apache/tika), image captioning capabilities and a scalable architecture to deal with deep learning models in the future.

Comments (0)