Sequana documentation

Current version: 0.8.3, Jul 30, 2020

Python version

Python 3.6, 3.7.3; most modules are Python2.7 compatible.

Source

See http://github.com/sequana/sequana.

Issues

Please fill a report on github

How to cite

For Sequana in general including the pipelines, please use

Cokelaer et al, (2017), 'Sequana': a Set of Snakemake NGS pipelines, Journal of Open Source Software, 2(16), 352, JOSS DOI doi:10.21105/joss.00352

For the genome coverage tool (sequana_coverage), please cite:

Dimitri Desvillechabrol, Christiane Bouchier, Sean Kennedy, Thomas Cokelaer Sequana coverage: detection and characterization of genomic variations using running median and mixture models GigaScience, Volume 7, Issue 12, December 2018, giy110, https://doi.org/10.1093/gigascience/giy110

For Sequanix (GUI for Snakemake pipeline), please cite:

Dimitri Desvillechabrol, Rachel Legendre, Claire Rioualen, Christiane Bouchier, Jacques van Helden, Sean Kennedy, Thomas Cokelaer Sequanix: A Dynamic Graphical Interface for Snakemake Workflows Bioinformatics, bty034, https://doi.org/10.1093/bioinformatics/bty034 Also available on bioRxiv(DOI: https://doi.org/10.1101/162701)

Warning

March 2020

Important notes for developers. Major refactoring for version 0.8 now pushed on master branch. Please use this new branch to create your own branches. If you still want to use the previous version, please checkout the branch master_20_03_2020 instead.

What is Sequana ?

Sequana is a versatile tool that provides

  1. A Python library dedicated to NGS analysis (e.g., tools to visualise standard NGS formats).

  2. A set of pipelines dedicated to NGS in the form of Snakefiles (Makefile-like with Python syntax based on snakemake framework) with more than 80 re-usable rules (see Rules).

  3. Original tools to help in the creation of such pipelines including HTML reports.

  4. Standalone applications:
    1. sequana_coverage ease the extraction of genomic regions of interest and genome coverage information

    2. sequana_taxonomy performs a quick taxonomy of your FastQ. This requires dedicated databases to be downloaded.

    3. Sequanix: GUI for snakemake workflows, a GUI for Snakemake workflows (hence Sequana pipelines as well)

Currently, the available pipelines cover quality control (e.g. adapters removal, phix removal, trimming of bad quality bases), variant calling, characterisation of the genome coverage, taxonomic classification, de-novo assembly, RNA-seq. See the Pipelines section for more information.

Sequana can be used by developers to create new pipelines and by users in the form of applications ready for production. Moreover, Sequanix can be used to set the parameters of pipelines and execute them easily with a graphical user interface.

To join the project, please let us know on github.

Installation

conda install sequana

Examples

Visit our example gallery to use the Python library

NGS pipelines

Learn about available Snakemake pipelines

Standalone applications

Standalone applications including Sequanix (GUI for snakemake) and the sequana_coverage tool.

User guide and reference

Indices and tables