(Please insert an screenshot of the Tool)

Type of tool: Web application

Required skills:

- Elementary User: No programming

- Advanced User: Python, Basic Deep Learning (PyTorch)

Short description of the tool:

- Detailed description: Data Extractor/MatrixDataExtractor_UserGuide.pdf(info)

Disclaimer:

(Disclaimer Text)

How to use/download/access it:

e.g. Get the GitHub https://github.com/cslab-hub/MatrixDataExtractor, copy the code into your computer and start using

_Use case/problem:

Selecting material (recyclate) for specific product requirements

Description of the problem the tools solves:

[General] + [Tool-specific]

Contact person of the tool: A

Related tools:

- Analyse and Visualize your process data with data analytics -> Data Analytics

- Get guidance to set up a working data infrastucture -> Data Infrastructure Wiki

- Find the right sensor to survey your process -> Sensor Tool

- Improve internal information and material flow -> VSM

- Match material requirements with material properties -> Matrix

Tool Description#

Matrix Data Extractor (MDE) is a web-based application that identifies document table regions on PDF documents using Computer Vision based Deep Learning algorithm and extracts data to text files by applying Optical Character Recognition (OCR). It supports to transfer extracted data to MongoDB database tables. A search functionality is also provided to retrieve data on user interface based on Keyword matching (e.g. Manufacturer Name, Technical Datasheet Name, Keyword for Table Data).

Guidelines#

Before getting started, please take a look at Data Extractor/MatrixDataExtractor_UserGuide.pdf(info) and make yourself familiar with how to use the tool.

Getting Started#

The code for the tool is available at https://github.com/cslab-hub/MatrixDataExtractor

Table Detection : Annotated Datasets, Model Weights, Model Inference#

Table detection model weights and datasets can be provided on request. It is not publicly available. Also a Jupyter Notebook can be provided on request to show model inference result on domain specific dataset.

Secret Key for 'backend' Django Web Application:#

Please use Secret Key as 'SECRET_KEY=!zhn#9$0pvr!+jp5q0f-vhvkfp0w$@tpvy4kf20pb89vf#w1q-' in mde.env file without single quotes.