16. Glossary#
- BAI#
The index file for a file generated in the BAM format. (This is a non-standard file type.)
- BAM#
Binary version of the Sequence Alignment Map (SAM) format.
- BED#
Format that defines the data lines displayed in an annotation track.
- DSRC#
A compression tool dedicated to FastQ files
- FASTA#
FASTA-formatted sequence files contains either nucleic acid sequence (such as DNA) or protein sequence information. FASTA files store multiple sequences in a single file.
- GFF#
General Feature Format, used for describing genes and other features associated with DNA, RNA and Protein sequences.
- JSON#
A human-readable data serialization language commonly used in configuration files. See https://en.wikipedia.org/wiki/JSON
- Module#
A directory that contains a snakemake rule and an associated README file. This is especially relevant for the Sequana pipelines. See Developer guide.
- SAM#
Sequence Alignment Map is a generic nucleotide alignment format that describes the alignment of query sequences or sequencing reads to a reference sequence or assembly
- Snakefile#
A file that contains one or several Snakemake rules
- VCF#
Variant Call Format, for use with the variant calling pipeline
- YAML#
A human-readable data serialization language commonly used in configuration files. See https://en.wikipedia.org/wiki/YAML