Tool Description#

Matrix Data Extractor (MDE) is a web-based application that identifies document table regions on PDF documents using Computer Vision based Deep Learning algorithm and extracts data to text files by applying Optical Character Recognition (OCR). It supports to transfer extracted data to MongoDB database table. A search functionality is also provided to retrieve data on user interface based on Keyword (e.g. Manufacturer Name, Technical Datasheet Name, Keyword for Table Data) search.

Guidelines#

Before you get started, take a look at the link and make yourself familiar with how to use the tool.

Getting Started#

The tool is available at: GitHub Link.