Skip to content

Latest commit

 

History

History
47 lines (31 loc) · 1.31 KB

README.md

File metadata and controls

47 lines (31 loc) · 1.31 KB

Benchmarking Reference Implementation

Here we define input, intermediate and output formats for the benchmarking reference implementation.

The goal of this reference implementation is to

  • support different benchmark types
  • allow adding new comparison types
  • counts consistently and accurately for different truth sets
  • support common truth datasets
  • allow anyone to contribute

The following image outlines the basic infrastructure:

Infrastructure

Generic File Formats

All file formats are based on common standards in genomics:

Inputs

For the description of truth and query formats, see inputs.md

Stratification Regions

The format for stratification regions is defined in stratification.md

Outputs

The output formats are defined in outputs.md

Intermediate Outputs

Intermediate formats for passing data from the comparison engine to the counting step are described in intermediate.md.