Di-Plast-Wiki: Data Extractor

This is version . It is not the current version, and thus it cannot be edited.

Back to current version Restore this version

Tool Description#

Matrix Data Extractor (MDE) is a web-based application that identifies document table regions on PDF documents using Computer Vision based Deep Learning algorithm and extracts data to text files by applying Optical Character Recognition (OCR). It supports to transfer extracted data to MongoDB database tables. A search functionality is also provided to retrieve data on user interface based on Keyword matching (e.g. Manufacturer Name, Technical Datasheet Name, Keyword for Table Data).

Guidelines#

Before getting started, please take a look at Data Extractor/MatrixDataExtractor_UserGuide.pdf

and make yourself familiar with how to use the tool.

Getting Started#

The code for the tool is available at https://github.com/cslab-hub/MatrixDataExtractor

Table Detection : Annotated Datasets, Model Weights and Model Inference#

Table detection model weights and datasets can be provided on request. It is not publicly available. Also a Jupyter Notebook can be provided on request to show model inference result on domain specific dataset.

Secret Key for 'backend' Django Web Application:#

Please use Secret Key as '!zhn#9$0pvr!+jp5q0f-vhvkfp0w$@tpvy4kf20pb89vf#w1q-' in mde.env file.

Toolkit overview
Data Analytics	Data Validation
Sensor Tool	Data Infrastructure Wiki
VSM	Exploratory Pattern Analytics
Matrix	Data Extractor

Tool Description#

Guidelines#

Getting Started#

Table Detection : Annotated Datasets, Model Weights and Model Inference#

Secret Key for 'backend' Django Web Application:#

Let us guide you to the right tools for your problems#

Toolkit#

Di-Plast Project#

Help#

Legal Notice #

Tool Description#

Guidelines#

Getting Started#

Table Detection : Annotated Datasets, Model Weights and Model Inference#

Secret Key for 'backend' Django Web Application:#

Let us guide you to the right tools for your problems#

Toolkit#

Di-Plast Project#

Help#

Legal Notice#

Legal Notice #