Fusion calling in DNA¶

See the fusions hydra-genetics module documentation for more details on the softwares for fusion calling. Default hydra-genetics settings/resources are used if no configuration is specified.

dag plot

Pipeline output files:¶

results/dna/{sample}_{type}/fusion/{sample}_{type}.gene_fuse_report.tsv (with UMI option only)
results/dna/{sample}_{type}/fusion/{sample}_{type}.fuseq_wes.report.csv

Fusions calling using GeneFuse¶

DNA fusion calling is performed by GeneFuse v0.6.1 on fastq-files. It uses a gene transcript target file to limit the number of targets to analyze.

Configuration¶

References

Fasta reference genome
Gene transcript file with genomic positions for all exons include in the analysis

Cluster resources

Options	Value
mem_mb	36864
mem_per_cpu	6144
threads	6
time	"8:00:00"

GeneFuse Filtering and report¶

The output from GeneFuse is filtered and then reported into a fusion report using the in-house script report_gene_fuse.py (rule and config). The following filter criteria is used:

Fusions must have at least 6 unique supporting reads.
Very noisy fusion pairs found in almost all samples (defined in filter_fusions_20230214.csv) are removed:
- NPM1::ALK
- CLTC::NTRK3
- MSH2_ALK
- MSH2_HIP1
Noisy fusion pairs found in some samples (defined in filter_fusions_20230214.csv) are filtered individually on the number of uniquely supporting reads:
- LMNA::EZR 9
- ABL1::STRN 7
- EZR::ALK 8
- RSPO2::BRAF 8
- LMNA::HIP1 12
- NPM1::BICC1 11
- RSPO2::ERG 13

Result file¶

results/dna/{sample}_{type}/fusion/{sample}_{type}.gene_fuse_report.tsv

Fusions calling using FuSeq_WES¶

DNA fusion calling is performed by FuSeq_WES v1.0.1 on bam-files. It uses a gene transcript target file to limit the number of targets to analyze.

Configuration¶

References

Software settings

Options	Value	Description
params	`cnv_amplfication_genes.tsv`	Genes and surrounding regions to call small CNVs in

Cluster resources

Options	Value
mem_mb	12288
mem_per_cpu	6144
threads	2
time	"24:00:00"

FuSeq_WES Filtering and report¶

The output from FuSeq_WES is filtered and then reported into a fusion report using the in-house script filter_report_fuseq_wes.py (rule and config). The following filter criteria is used:

Configuration¶

Software settings

Options	Value	Description
filter_on_fusiondb	True	Only keep fusions found in the fusion database
gene_white_list	`fuseq_wes_gene_white_list.txt`	Only keep fusions with at least one gene in the gene white list
gene_fusion_black_list	`false_positive_fusion_pairs.txt`	Remove fusions pairs in fusion pair gene black list
gtf	`hg19.refGene.gtf`	Transcript annotation
min_support	30	Minimal total number of supporting reads
transcript_black_list	`fuseq_wes_transcript_black_list.txt`	Transcripts that should not be used in annotation

Result file¶

results/dna/{sample}_{type}/fusion/{sample}_{type}.fuseq_wes_report.tsv

Fusions calling using JuLI¶

Fusion calling for ctDNA is performed by JuLI. The process involves: 1. Calling: Call fusions in DNA data. 2. Annotation: Annotate fusions called by JuLI. 3. Filtering: Filter fusions using the in-house script filter_juli.py.

Result file¶

results/dna/{sample}_{type}/fusion/{sample}_{type}.juli.filtered.fusions.txt