Skip to content

Commit

Permalink
Updated documentation.
Browse files Browse the repository at this point in the history
  • Loading branch information
jfjlaros committed Mar 10, 2021
1 parent 6bc446c commit 49ac1e5
Show file tree
Hide file tree
Showing 7 changed files with 52 additions and 6 deletions.
4 changes: 3 additions & 1 deletion README.rst
Original file line number Diff line number Diff line change
Expand Up @@ -27,9 +27,11 @@ Demultiplex: FASTA/FASTQ demultiplexer
Versatile NGS demultiplexer with the following features:

- Support for FASTA and FASTQ files.
- Support for gzip and bzip2 compressed files.
- Support for multiple reads per fragment, e.g., paired-end.
- Handles barcodes in the header and in the reads.
- Handles barcodes at *unknown* locations in reads (e.g., PacBio or Nanopore barcodes).
- Handles barcodes at *unknown* locations in reads (e.g., PacBio or Nanopore
barcodes).
- Support for selection of part of a barcode.
- Allows for mismatches, insertions and deletions.
- Barcode guessing by frequency or fixed amount.
Expand Down
Binary file added data/demultiplex.fq.bz2
Binary file not shown.
Binary file added data/demultiplex.fq.gz
Binary file not shown.
45 changes: 45 additions & 0 deletions docs/faq.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,45 @@
Frequently asked questions
==========================

Can this program work with dual barcodes / indexes?

Yes, but not directly. Because of the large amount of dual (or more)
indexing approaches, the user interface would become incomprehensible. This
is why we have decided to support only the basic cases. In order to support
an arbitrary amount of barcodes see Section :ref:`multiple_barcodes`.


Can you add support for removing barcodes after demultiplexing?

We used to have this type of functionality (selecting parts of a read) in a
previous version, but because of the large number of complicated barcoding
schemas (multiple barcodes in one read, barcodes in multiple reads, etc.),
we found that this interface was not flexible enough. Instead, we recommend
to use a more generic tool for post processing the demultiplexed files, The
Fastools_ ``select`` command for example.


My sequencing run was pretty bad, can / should I increase the number of allowed
mismatches?

It depends on which barcodes were used. Most barcode sets are designed to
allow for single nucleotide read errors. When multiple errors occur, it may
not be possible to uniquely assign a read to a barcode. You can use the
Barcode_ ``test`` command to see if your barcode set allows for multiple
error correction.


I do not know which / how many barcodes were used. How can I demultiplex my
file?

The best thing to do is to contact your sequencing provider and ask which
barcodes were used. If this is not possible for some reason, you may want
to ``guess`` subcommand described in Section :ref:`illumina`. If the
barcodes are in the read instead of the header, you may want to use a tool
like FastQC_ to find overrepresented sequences. These may be the barcodes
you are looking for.


.. _FastQC: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
.. _Fastools: https://fastools.readthedocs.io/
.. _Barcode: https://barcode.readthedocs.io/
2 changes: 1 addition & 1 deletion docs/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -6,5 +6,5 @@

installation
usage
library
faq
credits
4 changes: 0 additions & 4 deletions docs/library.rst

This file was deleted.

3 changes: 3 additions & 0 deletions docs/usage.rst
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,7 @@ that particular subcommand. For example:

demultiplex demux -h

.. _illumina:

Illumina FASTQ files
--------------------
Expand Down Expand Up @@ -148,6 +149,8 @@ command:
demultiplex demux -r -e 6 barcodes.csv file.fq


.. _multiple_barcodes:

Multiple barcodes
-----------------

Expand Down

0 comments on commit 49ac1e5

Please sign in to comment.