16. Glossary
- BAI
The index file for a file generated in the BAM format. (This is a non-standard file type.)
- BAM
Binary version of the Sequence Alignment Map (SAM) format.
- BED
Format that defines the data lines displayed in an annotation track.
- DSRC
A compression tool dedicated to FastQ files
- FASTA
FASTA-formatted sequence files contains either nucleic acid sequence (such as DNA) or protein sequence information. FASTA files store multiple sequences in a single file.
- GFF
General Feature Format, used for describing genes and other features associated with DNA, RNA and Protein sequences.
- JSON
A human-readable data serialization language commonly used in configuration files. See https://en.wikipedia.org/wiki/JSON
- Module
A directory that contains a snakemake rule and an associated README file. This is especially relevant for the Sequana pipelines. See Developer guide.
- SAM
Sequence Alignment Map is a generic nucleotide alignment format that describes the alignment of query sequences or sequencing reads to a reference sequence or assembly
- Snakefile
A file that contains one or several Snakemake rules
- VCF
Variant Call Format, for use with the variant calling pipeline
- YAML
A human-readable data serialization language commonly used in configuration files. See https://en.wikipedia.org/wiki/YAML