Next Generation Evidence

This repository contains code for combining and harmonizing primary and secondary evidnce from different sources (Pubmed, ClinicalTrials.gov, CIViC, GGPONC). It consists of a database, an application server with a REST API, and a Vue.js frontend.

An online demo is available at: https://we.analyzegenomes.com/nge/

The code for the web frontend is maintained separately: https://gitlab.hpi.de/florian.borchert/nge_app

Installation

We use poetry as a build tool.. Therefore, the dependencies can be installed by running

poetry install

To also install the dev dependencies, run

poetry install --with dev

On an M1 Mac, the installation might fail due to a bug in pygraphviz, a fix can be found here.

Additionally, pre-commit is used to run a few checks and fixes before commits. In order to use them, run

poetry run pre-commit install

Usage

Environment Variables

The system expects the following environment variables to be set. We used a .env file placed in the root directory of the repository for this purpose:

PUBMED_API_KEY for accessing eUtils (only needed for populating the DB)
UMLS_API_KEY for downloading UMLS (needed for populating the DB and for the API)

Populating the Database

To download the necessary data and to populate the database, run

`poetry run populate`

You may run to populate individual parts of the database individually, e.g.,:

`poetry run populate ggponc`

Guidelines

Get access to the latest GGPONC release and place its contents in data/ggponc/ (or adapt the part in the config.ini.

PubMed

The code to download and process PubMed articles is released separately.

ClinicalTrials.gov

poetry run populate automatically identifies the latest monthly dump from AACT and downloads it if necessary.

CIViC

poetry run populate automatically identifies the latest nightly dump from CIViC and downloads it if necessary.

Starting the application server

To start the application server and REST API, please run

poetry run api

Evaluation

An overview of the systems features and its evaluation can be found it the notebooks in the repository's root directory.

Documentation

To show the documentation, run

poetry shell
pdoc integration

Citation

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
api		api
classification		classification
data		data
evaluation		evaluation
integration		integration
registries		registries
.gitignore		.gitignore
00_Stats.ipynb		00_Stats.ipynb
01_Timelags.ipynb		01_Timelags.ipynb
02a_Eval_Oesophagus.ipynb		02a_Eval_Oesophagus.ipynb
02b_Eval_Hodgkin.ipynb		02b_Eval_Hodgkin.ipynb
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Next Generation Evidence

Installation

Usage

Environment Variables

Populating the Database

Guidelines

PubMed

ClinicalTrials.gov

CIViC

Starting the application server

Evaluation

Documentation

Citation

About

Languages

License

hpi-dhc/nge_db

Folders and files

Latest commit

History

Repository files navigation

Next Generation Evidence

Installation

Usage

Environment Variables

Populating the Database

Guidelines

PubMed

ClinicalTrials.gov

CIViC

Starting the application server

Evaluation

Documentation

Citation

About

Resources

License

Stars

Watchers

Forks

Languages