The following bioinformatics tools have been developed in the lab
Data Processing
Je is a suite to handle barcoded fastq files with (or without) Unique Molecule Identifiers (UMIs) and filter read duplicates using these UMIs:
If you have barcodes and/or UMIs in your fastq files, you’ll most likely enjoy Je. Je currently offers 4 tools :
- demultiplex to demultiplex multi-samples fastq files which reads contain barcodes and UMIs (or not)
- demultiplex-illu to demultiplex fastq files according to associated index files (contain the sample encoding barcodes). Reads can additionally contain UMIs (inline)
- clip to remove UMIs contained in reads of fastq files that do not need sample demultiplexing
- markdupes to filter BAM files for read duplicates taking UMIs into account
In short, Je demultiplex, demultiplex-illu and clip add extracted barcodes and UMIs to the read headers and reformat read headers to fulfill read mappers requirements. Indeed most read mappers (bowtie, bwa…) expect headers for read_1 and read_2 to be strictly identical. After mapping, markdupes identifies PCR (and optical) read duplicates based on their mapping positions and UMIs found in read headers.
Data Analysis
Implementation of WASP pipeline with inclusion of INDELs
WASP is a suite of tools for unbiased allele-specific read mapping and discovery of molecular QTLs described in: “WASP: allele-specific software for robust discovery of molecular quantitative trait loci“. WASP has two parts, which can be used independently of each other:
- Read filtering tools that correct for biases in allele-specific mapping.
- A Combined Haplotype Test (CHT) that tests for genetic association with a molecular trait using counts of mapped and allele-specific reads.
The original WASP pipeline does not handle INDELs and discards reads overlapping them. WASP-INDEL includes INDELs in both WASP steps, controlling for mapping biases caused by INDELs (read filtering) and testing them for genetic associations (CHT).Author: Adam Raboniwitz, EMBL Heidelberg Maintainer: Adam Raboniwitz adamrabs@hotmail.com