Fusion calling in DNA

See the fusions hydra-genetics module documentation for more details on the softwares for fusion calling. Default hydra-genetics settings/resources are used if no configuration is specified.


dag plot

Pipeline output files:

  • results/dna/{sample}_{type}/fusion/{sample}_{type}.gene_fuse_report.tsv (with UMI option only)
  • results/dna/{sample}_{type}/fusion/{sample}_{type}.fuseq_wes.report.csv

Fusions calling using GeneFuse

DNA fusion calling is performed by GeneFuse v0.6.1 on fastq-files. It uses a gene transcript target file to limit the number of targets to analyze.

Configuration

References


Cluster resources

Options Value
mem_mb 36864
mem_per_cpu 6144
threads 6
time "8:00:00"

GeneFuse Filtering and report

The output from GeneFuse is filtered and then reported into a fusion report using the in-house script report_gene_fuse.py (rule and config). The following filter criteria is used:

  • Fusions must have at least 6 unique supporting reads.
  • Very noisy fusion pairs found in almost all samples (defined in filter_fusions_20230214.csv) are removed:
    • NPM1::ALK
    • CLTC::NTRK3
    • MSH2_ALK
    • MSH2_HIP1
  • Noisy fusion pairs found in some samples (defined in filter_fusions_20230214.csv) are filtered individually on the number of uniquely supporting reads:
    • LMNA::EZR 9
    • ABL1::STRN 7
    • EZR::ALK 8
    • RSPO2::BRAF 8
    • LMNA::HIP1 12
    • NPM1::BICC1 11
    • RSPO2::ERG 13

Result file

  • results/dna/{sample}_{type}/fusion/{sample}_{type}.gene_fuse_report.tsv


Fusions calling using FuSeq_WES

DNA fusion calling is performed by FuSeq_WES v1.0.1 on bam-files. It uses a gene transcript target file to limit the number of targets to analyze.

Configuration

References


Software settings

Options Value Description
params cnv_amplfication_genes.tsv Genes and surrounding regions to call small CNVs in

Cluster resources

Options Value
mem_mb 12288
mem_per_cpu 6144
threads 2
time "24:00:00"

FuSeq_WES Filtering and report

The output from FuSeq_WES is filtered and then reported into a fusion report using the in-house script filter_report_fuseq_wes.py (rule and config). The following filter criteria is used:

Configuration

Software settings

Options Value Description
filter_on_fusiondb True Only keep fusions found in the fusion database
gene_white_list fuseq_wes_gene_white_list.txt Only keep fusions with at least one gene in the gene white list
gene_fusion_black_list false_positive_fusion_pairs.txt Remove fusions pairs in fusion pair gene black list
gtf hg19.refGene.gtf Transcript annotation
min_support 30 Minimal total number of supporting reads
transcript_black_list fuseq_wes_transcript_black_list.txt Transcripts that should not be used in annotation

Result file

  • results/dna/{sample}_{type}/fusion/{sample}_{type}.fuseq_wes_report.tsv

Fusions calling using JuLI

Fusion calling for ctDNA is performed by JuLI. The process involves: 1. Calling: Call fusions in DNA data. 2. Annotation: Annotate fusions called by JuLI. 3. Filtering: Filter fusions using the in-house script filter_juli.py.

Result file

  • results/dna/{sample}_{type}/fusion/{sample}_{type}.juli.filtered.fusions.txt