Erdre - Erroneous data repair for Industry 4.0.

UPDATE: Erdre has been forked to d2m, where we continue the development of a more general machine learning pipeline for tabular and time series data, expanding beyond the scope of erroneous data repair.

A machine learning pipeline enabling Responsible AI:

Explainable AI, using SHAP, LIME or both.
Uncertainty estimation, using Bayesian dropout for neural networks.
Carbon emissions tracking and reporting, using CodeCarbon.

Erdre lets you easily create and evaluate machine learning models for tabular and time series data, with built-in data profiling and feature engineering.

Usage

Tested on:

Linux
macOS
Windows with WSL 2

Clone/download this repository.
Place your datafiles (csv) in a folder with the name of your dataset (DATASET) inside assets/data/raw/, so the path to the files is assets/data/raw/[DATASET]/.
Update params.yaml with the name of your dataset (DATASET), the target variable, and other configuration parameters.
Build Docker container:

docker build -t d2m -f Dockerfile .

Run the container:

docker run -p 5000:5000 -it -v $(pwd)/assets:/usr/d2m/assets -v $(pwd)/.dvc:/usr/d2m/.dvc d2m

Open the website at localhost:5000 to use the graphical user interface.

Creating models on the command line

Copy params.yaml from the host to the container (find CONTAINER_NAME by running docker ps):

docker cp params.yaml  [CONTAINER_NAME]:/usr/d2m/params.yaml

Inside the interactive session in the container, run:

docker exec [CONTAINER_NAME] dvc repro

Name		Name	Last commit message	Last commit date
Latest commit History 277 Commits
assets		assets
docs		docs
src		src
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dvc.yaml		dvc.yaml
params.yaml		params.yaml
params_default.yaml		params_default.yaml
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Erdre - Erroneous data repair for Industry 4.0.

Usage

Creating models on the command line

About

Releases

Packages

Languages

License

SINTEF-9012/Erdre

Folders and files

Latest commit

History

Repository files navigation

Erdre - Erroneous data repair for Industry 4.0.

Usage

Creating models on the command line

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages