Discover and run vetted analysis pipelines on Galaxy

Ready-to-use, open-source pipelines with sample data and training materials to make progress quickly and reliably.

Get started with one of our most popular workflows, or browse the full library below.

RNA-Seq Analysis: Paired-End Read Processing and Quantification

Complete RNA-Seq analysis for paired-end data: Processes raw FASTQ data through adapter and bad quality removal (fastp), alignment with STAR using ENCODE parameters, gene quantification via multiple methods (STAR and featureCounts), and expression calculation (FPKM with Cufflinks/StringTie, normalized coverage with bedtools). Produces count tables, normalized expression values, and genomic coverage tracks. Supports stranded and unstranded libraries, generating both HTSeq-compatible counts and normalized measures for downstream analysis.

Transcriptomics

1.2

Updated on Jan 27, 2025

ChIP-seq Analysis: Paired-End Read Processing

Complete ChIP-seq analysis for paired-end sequencing data. Processes raw FASTQ files through adapter removal (fastp), alignment to reference genome (Bowtie2), and stringent quality filtering (MAPQ greater than 30, concordant pairs only). Peak calling with MACS2 optimized for paired-end reads identifies protein-DNA binding sites. Generates alignment files, coverage, peak calls, and quality metrics for downstream analysis.

Epigenetics

1.0

Updated on Oct 3, 2025

ATAC-seq Analysis: Chromatin Accessibility Profiling

Complete ATAC-seq analysis pipeline for paired-end reads. Processes raw FASTQ data through adapter and bad quality removal (cutadapt), alignment (Bowtie2 end-to-end), and filtering (removes MT reads, discordant pairs, low mapping quality <30, PCR duplicates). Generates 5' cut site pileups (±100bp), performs peak calling, and quantifies reads in 1kb summit-centered regions. Produces two normalized coverage tracks (per million mapped reads and per million reads in peaks) and fragment length distribution plots for quality assessment.

Epigenetics

1.0

Updated on Nov 28, 2024

Scaffolding with Hi-C data VGP8

This workflow performs the scaffolding of a genome assembly using HiC data with YAHS. Can be used on any assembly with Hi-C data, and the assembly in the gfa format.

Vertebrate Genome Project

3.3

Updated on Dec 10, 2025

Metagenomic Genes Catalogue Analysis

Metagenomic analysis, from raw reads to gene catalog. Uses Megahit to assemble contigs and Prodgial to predict CDSs on contigs to provide the gene catalog. Finally, functional, taxonomic, and antimicrobial resistance information is provided.

Genome Annotation

Metagenomics

Microbiome

1.1

Updated on Dec 8, 2025

Host or Contamination Removal on Short-Reads

This workflow takes paired-end Illumina fastq(.gz) files and runs Bowtie to map the reads against a reference genome (human, by default) and keep only the reads that do not align. MultiQC is used to aggregate the mapping reports.

0.2

Updated on Dec 8, 2025

Host or Contamination Removal on Long-Reads

This workflow takes Nanopore fastq(.gz) files and runs Minimap2 to map the reads against a reference genome (human, by default). It filters the output to keep only the unmapped reads and generates mapping statistics that are aggregated into a MultiQC report.

0.2

Updated on Dec 8, 2025

RNA-Seq Differential Expression Analysis with Visualization

Identifies differentially expressed genes between exactly two experimental conditions from count tables. The workflow performs statistical testing, applies filters based on adjusted p-value and log2 fold change thresholds, and generates publication-quality visualizations including volcano plots, MA plots, and heatmaps. Takes two collections of count tables as input and produces filtered gene lists and interactive plots for interpreting expression differences. Optimal for simple two-condition experimental designs.

Transcriptomics

0.5

Updated on Dec 8, 2025

Short-read quality control and trimming

This workflow performs quality control and trimming on paired-end Illumina fastq(.gz) files using fastp and aggregates the quality control reports with MultiQC

0.2

Updated on Dec 8, 2025

Genome annotation with Braker3

This workflow performs genome annotation using Braker3 and evaluates the quality of the annotation with BUSCO and genome annotation statistics.

0.1

Updated on Dec 5, 2025

Variant calling and consensus construction from paired end short read data of non-segmented viral genomes

Variant calling and consensus sequence generation for batches of Illumina PE sequenced viruses with uncomplicated and stable genome structure (like e.g. Morbilliviruses).

Variant Calling

Virology

0.2

Updated on Dec 4, 2025

Post-Assembly Quality Control and Contamination Check for Bacterial Genomes

This workflow performs quality and contamination control analysis on assembled contigs to assess bacterial genome quality and taxonomic assignment

Metagenomics

Microbiome

1.0.2

Updated on Dec 1, 2025

Bacterial Genome Assembly using Shovill

Assembly of bacterial paired-end short read data

Genome assembly

2.0.1

Updated on Dec 1, 2025

Raw Read Quality and Contamination Control For Genome Assembly

Short paired-end read analysis to provide quality analysis, read cleaning and taxonomy assignation directly from raw reads

1.1.11

Updated on Nov 28, 2025

K-mer profiling and reads statistics VGP1

Evaluation of Pacbio Hifi Reads and genome profiling. Create Meryl Database used for the estimation of assembly parameters and quality control with Merqury. Part of the VGP pipeline.

Vertebrate Genome Project

0.7

Updated on Nov 10, 2025

Genome Assembly from Hifi reads with HiC phasing - VGP4

Assemble Genome using PacBio HiFi and HiC data from the same individual for phasing. Prerequisite: Run k-mer profiling workflow (VGP1). This workflow uses HiFiasm for contigging, and generates assembly statistics, BUSCO reports, Merqury histograms, and the genome assembly contigs in fasta and GFA format.

Vertebrate Genome Project

0.5

Updated on Oct 27, 2025

Assembly decontamination VGP9

Decontamination (foreign contaminants and mitochondrial sequences) of a genome assembly after the final scaffolding step. Uses NCBI FCS GX to identify foreign contaminants and Blast to identify mitochondrial sequences. Part of the VGP Suite.

Vertebrate Genome Project

1.2

Updated on Oct 20, 2025

Pox Virus Illumina Amplicon Workflow from half-genomes

A workflow for the analysis of pox virus genomes sequenced as half-genomes (for ITR resolution) in a tiled-amplicon approach

Virology

0.4

Updated on Oct 8, 2025

Metagenome-Assembled Genomes (MAGs) generation

This workflow produces Metagenome-Assembled Genomes (MAGs) from paired-end metagenomic reads. It includes assembly, binning, refinement, dereplication, annotation, taxonomic classification, quality assessment, and abundance estimation. All results are summarised in a single integrated report and aggregated tables.

Microbiome

0.4

Updated on Oct 7, 2025

ChIP-seq Analysis: Single-End Read Processing

Complete ChIP-seq analysis for single-end sequencing data. Processes raw FASTQ files through adapter removal (fastp), alignment to reference genome (Bowtie2), and quality filtering (MAPQ greater than 30). Peak calling with MACS2 uses a fixed extension of 200bp to identify protein-DNA binding sites. Generates alignment files, coverage, peak calls, and quality metrics for downstream analysis.

Epigenetics

1.0

Updated on Oct 3, 2025

ChIP-seq Analysis: Paired-End Read Processing

Epigenetics

1.0

Updated on Oct 3, 2025

Purge duplicate contigs from a diploid assembly VGP6

Purge contigs marked as duplicates by purge_dups (could be haplotypic duplication or overlap duplication). The contigs are purged from the first assembly (hap1, pri...), added to the second assembly (hp2, alt... ), then the 2nd assembly is purged as well. If you think only one of the assemblies needs purging, use the VGP6b workflow. This workflow is the 6th workflow of the VGP pipeline. It is meant to be run after one of the contigging steps (Workflow 3, 4, or 5).

Vertebrate Genome Project

0.10.4

Updated on Sep 29, 2025

Genome Assembly with Pacbio Hifi reads and Trio data for phasing - VGP5

Generate phased assembly based on PacBio HiFi reads and parental Illumina data for phasing. Part of the VGP workflow suite, it needs to be run after the Trio k-mer Profiling workflow VGP2. This workflow uses HiFiasm for contigging, and generates assembly statistics, BUSCO reports, Merqury plots, and the genome assembly contigs in fasta and GFA format.

Vertebrate Genome Project

0.9.9

Updated on Sep 29, 2025

Purging duplicates in one haplotype VGP6b

Purge contigs marked as duplicates by purge_dups in a single haplotype (could be haplotypic duplication or overlap duplication). If you think the purged contigs might belong to the other haplotype, use the workflow VGP6 instead. This workflow is the 6th workflow of the VGP pipeline. It is meant to be run after one of the contigging steps (Workflow 3, 4, or 5).

Vertebrate Genome Project

0.8.4

Updated on Sep 29, 2025

Genome Assembly from Hifi reads - VGP3

Generate a genome assembly based on PacBio HiFi reads. Part of the VGP suite, it needs to be run after the VGP1 k-mer profiling workflow. The assembly contigs are built using HiFiasm, and the workflow generates assembly statistics, BUSCO reports, Merqury plots, and the contigs in fasta and GFA formats.

Vertebrate Genome Project

0.3.4

Updated on Sep 29, 2025

Mitogenome Assembly VGP0

Generate mitochondrial assembly based on PacBio HiFi reads. Part of the VGP suite, it can be run at any time independently of the other workflows. This workflow uses MitoHiFi and a mitochondrial reference to assemble the mitochondrial genome from PacBio reads. You do not need to provide the reference yourself, only the Latin name of the species.

Vertebrate Genome Project

0.2.2

Updated on Aug 25, 2025

Generate Nx and Size plots for multiple assemblies

Generate Nx and size plots for multiple assemblies to compare the evolution of assembly quality through the scaffolding process. Inputs are the fasta files for each assembly to compare.

Vertebrate Genome Project

0.1.8

Updated on Aug 25, 2025

Bacterial Genome Annotation

Annotation of an assembled bacterial genomes to detect genes, potential plasmids, integrons and Insertion sequence (IS) elements.

Genome Annotation

1.2.0

Updated on Jul 21, 2025

Multiplex Tissue Microarray Analysis

Perform background subtraction, nuclear segmentation, feature quantification, cellular phenotyping, spatial analysis, and interactive visualization of registered TMA core multiplex tissue images

Imaging

0.1.1

Updated on Jun 22, 2025

End-to-End Tissue Microarray Analysis

Complete multiplex tissue image (MTI) analysis pipeline for tissue microarray (TMA) data imaged using cyclic immunofluorescence: Performs illumination correction, stitching and registration, and tissue microarray segmentation. Tissue-segmented images undergo nuclear segmentation, cell/nuclei feature quantification (mean marker intensities, cell coordinates, and morphological features), and cell phenotyping. Produces outputs that are compatible with downstream single-cell/spatial analysis and interactive image viewers including: Pyramidal OME-TIFF images, nuclear segmentation masks (TIFF), quantified feature tables (CSV, h5ad) with cell type annotations, and an interactive Vitessce dashboard that combines image viewing with linked single-cell data visualizations.

Imaging

0.1.1

Updated on Jun 21, 2025

PretextMap Generation from 1 or 2 haplotypes

This workflow generates Hi-C contact maps for genome assemblies in the Pretext format. It is compatible with one or 2 haplotypes. It includes tracks for PacBio read coverage, Gaps, and telomeres. The Pretext files can be open in PretextView for the manual curation of genome assemblies.

Vertebrate Genome Project

1.0beta6

Updated on Jun 9, 2025

Single-Cell RNA-seq Preprocessing: 10X Genomics CellPlex Multiplexed Samples

Comprehensive preprocessing for 10X Genomics CellPlex multiplexed single-cell RNA-seq data. Processes Cell Multiplexing Oligo (CMO) FASTQ files with CITE-seq-Count including required CellPlex-specific translation steps. Simultaneously processes gene expression FASTQ files with STARsolo and DropletUtils for alignment and cell filtering, and formats outputs for seamless import into Seurat/Scanpy (Read10X function).

Transcriptomics

Single Cell

0.6.3

Updated on Jun 9, 2025

Single-Cell RNA-seq Preprocessing: 10X Genomics v3 to Seurat and Scanpy Compatible Format

Complete preprocessing pipeline for 10X Genomics v3 single-cell RNA-seq data. Aligns raw FASTQ files using STARsolo, performs cell calling and quality filtering with DropletUtils, and formats outputs for seamless import into Seurat/Scanpy (Read10X function).

Transcriptomics

Single Cell

0.6.3

Updated on Jun 9, 2025

Genome assembly with Flye

Assemble long reads with Flye, then view assembly statistics and assembly graph

Genome assembly

0.3

Updated on May 19, 2025

Mass spectrometry: LC-MS preprocessing with XCMS

This workflow is composed with the XCMS tool R package (Smith, C.A. 2006) able to extract, filter, align and fill gapand the possibility to annotate isotopes, adducts and fragments using the CAMERA R package (Kuhl, C 2012). https://training.galaxyproject.org/training-material/topics/metabolomics/tutorials/lcms-preprocessing/tutorial.html

Computational Chemistry

Metabolomics

1.1

Updated on May 5, 2025

Mass spectrometry: GCMS with metaMS

This workflow is composed with the XCMS tool R package (Smith, C.A. 2006) able to extract and the metaMS R package (Wehrens, R 2014) for the field of untargeted metabolomics. https://training.galaxyproject.org/training-material/topics/metabolomics/tutorials/gcms/tutorial.html

Computational Chemistry

Metabolomics

0.2

Updated on May 5, 2025

MGnify's amplicon pipeline v5.0

MGnify's amplicon pipeline v5.0. Including the Quality control for single-end and paired-end reads, rRNA-prediction, and ITS sub-WFs.

Microbiome

0.3

Updated on May 5, 2025

MGnify's amplicon pipeline v5.0 - Quality control PE

Quality control subworkflow for paired-end reads.

Microbiome

0.3

Updated on May 5, 2025

Influenza A isolate subtyping and consensus sequence generation

This workflow performs subtyping and consensus sequence generation for batches of Illumina PE sequenced Influenza A isolates.

Virology

0.2

Updated on May 2, 2025

kmer-profiling-hifi-trio-VGP2

Create Meryl Database used for the estimation of assembly parameters and quality control with Merqury. Part of the VGP pipeline.

Vertebrate Genome Project

0.1.6

Updated on Apr 30, 2025

QCxMS Spectra Prediction from SDF

Workflow to predict EI mass spectra using QCxMS starting from a single SDF file, containing the 3D coordinates of all atoms in the molecule. These files can typically be obtained from PubChem.

Computational Chemistry

Metabolomics

0.1

Updated on Apr 28, 2025

SARS-CoV-2 Illumina Amplicon pipeline - iVar based

Find and annotate variants in ampliconic SARS-CoV-2 Illumina sequencing data and classify samples with pangolin and nextclade

SARS-COV-2

Variant Calling

0.3.1

Updated on Apr 28, 2025

Molecular formula assignment and recalibration with MFAssignR package.

This workflow can be used to assign multi-element molecular formulas to ultrahigh resolution mass spectra.

Metabolomics

0.1

Updated on Apr 16, 2025

COVID-19: variation analysis on ARTIC PE data

The workflow for Illumina-sequenced ARTIC data builds on the RNASeq workflow for paired-end data using the same steps for mapping and variant calling, but adds extra logic for trimming ARTIC primer sequences off reads with the ivar package. In addition, this workflow uses ivar also to identify amplicons affected by ARTIC primer-binding site mutations and tries to exclude reads derived from such tainted amplicons when calculating allele-frequencies of other variants.

SARS-COV-2

Variant Calling

0.5.4

Updated on Mar 17, 2025

Allele-based Pathogen Identification

Microbiome - Variant calling and Consensus Building

Microbiome

0.1.4

Updated on Mar 10, 2025

COVID-19: consensus construction

Build a consensus sequence from FILTER PASS variants with intrasample allele-frequency above a configurable consensus threshold. Hard-mask regions with low coverage (but not consensus variants within them) and ambiguous sites.

SARS-COV-2

0.4.3

Updated on Mar 10, 2025

COVID-19: variation analysis on WGS SE data

This workflows performs single end read mapping with bowtie2 followed by sensitive variant calling across a wide range of AFs with lofreq

SARS-COV-2

Variant Calling

0.1.6

Updated on Mar 10, 2025

MGnify's amplicon pipeline v5.0 - Quality control SE

Quality control subworkflow for single-end reads.

Microbiome

0.3

Updated on Mar 10, 2025

MAPseq to ampvis2

The MAPseq to Ampvis workflow processes MAPseq OTU tables and associated metadata for analysis in Ampvis2. This workflow involves reformatting MAPseq output datasets to produce structured output files suitable for Ampvis2.

Microbiome

0.2

Updated on Mar 10, 2025

Taxonomic abundance summary tables for a specified taxonomic rank

This workflow creates taxonomic summary tables for a specified taxonomic rank out of MAPseq's OTU tables output collection.

Microbiome

0.2

Updated on Mar 10, 2025

MGnify's amplicon pipeline v5.0 - rRNA prediction

Classification and visualization of SSU, LSU sequences.

Microbiome

0.2

Updated on Mar 10, 2025

MGnify amplicon summary tables

This workflow creates taxonomic summary tables out of the amplicon pipeline results.

Microbiome

0.2

Updated on Mar 10, 2025

MGnify's amplicon pipeline v5.0 - ITS

Classification and visualization of ITS regions.

Microbiome

0.2

Updated on Mar 10, 2025

Single-Cell Mixture Analysis: baredSC 1D Log-Normalized Models

Applies baredSC algorithm to fit and combine one-dimensional Gaussian mixture models (from 1 to N components) on log-normalized single-cell gene expression data. Enables identification of subpopulations based on expression of genes of interest and provides statistical assessment of the optimal number of components in heterogeneous cell populations.

Transcriptomics

Single Cell

0.6

Updated on Mar 10, 2025

Single-Cell Mixture Analysis: baredSC 2D Log-Normalized Models

Applies baredSC algorithm to fit and combine two-dimensional Gaussian mixture models (from 1 to N components) on log-normalized single-cell gene expression data. Enables identification of subpopulations based on co-expression patterns of two genes of interest and provides statistical assessment of the optimal number of components in heterogeneous cell populations.

Transcriptomics

Single Cell

0.6

Updated on Mar 10, 2025

Genome annotation with Maker

This workflow allows for genome annotation using Maker and evaluates the quality of the annotation.

Genome Annotation

0.1

Updated on Mar 6, 2025

lncRNAs annotation workflow

This workflow runs the FEELnc tool to annotate long non-coding RNAs. Before annotating these long non-coding RNAs, StringTie will be used to assemble the RNA-seq alignments into potential trancriptions. The gffread tool provides a genome annotation file in GTF format.

Genome Annotation

0.1

Updated on Mar 5, 2025

dada2 amplicon analysis pipeline - for paired end data

dada2 amplicon analysis for paired end data The workflow has three main outputs: - the sequence table (output of makeSequenceTable) - the taxonomy (output of assignTaxonomy) - the counts which allow to track the number of sequences in the samples through the steps (output of sequence counts)

Metabarcoding

0.3

Updated on Feb 17, 2025

RNA-Seq Analysis: Paired-End Read Processing and Quantification

Transcriptomics

1.2

Updated on Jan 27, 2025

RNA-Seq Analysis: Single-End Read Processing and Quantification

Complete RNA-Seq analysis for single-end data: Processes raw FASTQ data through adapter and bad quality removal (fastp), alignment with STAR using ENCODE parameters, gene quantification via multiple methods (STAR and featureCounts), and expression calculation (FPKM with Cufflinks/StringTie, normalized coverage with bedtools). Produces count tables, normalized expression values, and genomic coverage tracks. Supports stranded and unstranded libraries, generating both HTSeq-compatible counts and normalized measures for downstream analysis.

Transcriptomics

1.2

Updated on Jan 27, 2025

Consensus Peak Calling for ChIP-seq Single-End Replicates

Identifies high-confidence consensus peaks from ChIP-seq single-end replicate experiments. The workflow calls peaks on individual replicates and identifies their intersection. To control for sequencing depth differences, it subsamples all replicates to the smallest library size, performs peak calling on the combined normalized data, and retains only peaks whose summits overlap with intersections from a user-defined minimum number of replicates.

Epigenetics

1.3

Updated on Jan 27, 2025

Consensus Peak Calling for ChIP-seq Paired-End Replicates

Identifies high-confidence consensus peaks from ChIP-seq paired-end replicate experiments. The workflow calls peaks on individual replicates and identifies their intersection. To control for sequencing depth differences, it subsamples all replicates to the smallest library size, performs peak calling on the combined normalized data, and retains only peaks whose summits overlap with intersections from a user-defined minimum number of replicates.

Epigenetics

1.3

Updated on Jan 27, 2025

Consensus Peak Calling for ATAC-seq and CUT&RUN Replicates

Identifies high-confidence consensus peaks from ATAC-seq or CUT&RUN replicate experiments. The workflow calls peaks on individual replicates and identifies their intersection. To control for sequencing depth differences, it subsamples all replicates to the smallest library size, performs peak calling on the combined normalized data, and retains only peaks whose summits overlap with intersections from a user-defined minimum number of replicates.

Epigenetics

1.3

Updated on Jan 27, 2025

CUT&RUN/CUT&TAG Analysis: Protein-DNA Interaction Mapping

Complete CUT&RUN/CUT&TAG analysis workflow for paired-end sequencing data. Processes raw FASTQ files through adapter removal (cutadapt) and alignment (Bowtie2 with dovetail option enabled). Applies quality filtering (MAPQ ≥ 30, concordant pairs only), converts BAM to BED format, and performs peak calling using MACS2 with parameters optimized for the punctate signal profile characteristic of CUT&RUN/CUT&TAG experiments.

Epigenetics

0.14

Updated on Jan 27, 2025

Clinical Metaproteomics Verification Workflow

In proteomics research, verifying detected peptides is essential for ensuring data accuracy and biological relevance. This tutorial continues from the clinical metaproteomics discovery workflow, focusing on verifying identified microbial peptides using the PepQuery tool.

Proteomics

Metaproteomics

0.2

Updated on Dec 16, 2024

Functional annotation of protein sequences

This workflow uses eggNOG mapper and InterProScan for functional annotation of protein sequences.

Genome Annotation

0.1

Updated on Dec 4, 2024

ATAC-seq Analysis: Chromatin Accessibility Profiling

Epigenetics

1.0

Updated on Nov 28, 2024

Clinical Metaproteomics Data Interpretation

This workflow will perform taxonomic and functional annotations using Unipept and statistical analysis using MSstatsTMT.

Proteomics

Metaproteomics

0.1

Updated on Nov 19, 2024

Clinical Metaproteomics Discovery Workflow

Workflow for clinical metaproteomics database searching

Proteomics

Metaproteomics

0.1

Updated on Nov 18, 2024

Generate a Clinical Metaproteomics Database

The workflow begins with the Database Generation process. The Galaxy-P team has developed a workflow that collects protein sequences from known disease-causing microorganisms to build a comprehensive database. This extensive database is then refined into a smaller, more relevant dataset using the Metanovo tool.

Proteomics

Metaproteomics

0.1

Updated on Nov 18, 2024

Single-Cell Pseudobulk Differential Expression Analysis with edgeR

Performs differential gene expression analysis on single-cell data using a pseudobulk approach. Aggregates cell-level counts from an annotated AnnData object by cell type or other metadata using Decoupler, then applies edgeR for robust statistical testing between conditions. Includes data preprocessing steps for compatibility with edgeR and generates interactive volcano plots to visualize significantly differentially expressed genes.

Transcriptomics

Single Cell

0.1.1

Updated on Nov 18, 2024

Segmentation and counting of cell nuclei in fluorescence microscopy images

This workflow performs segmentation and counting of cell nuclei using fluorescence microscopy images. The segmentation step is performed using Otsu thresholding (Otsu, 1979). The workflow is based on the tutorial: https://training.galaxyproject.org/training-material/topics/imaging/tutorials/imaging-introduction/tutorial.html

Imaging

0.2

Updated on Nov 7, 2024

QIIME2 Ia: multiplexed data (single-end)

Importing single-end multiplexed data (not demultiplexed yet)

Metabarcoding

0.3

Updated on Nov 4, 2024

QIIME2 Ib: multiplexed data (paired-end)

Importing paired-end multiplexed data (not demultiplexed yet)

Metabarcoding

0.3

Updated on Nov 4, 2024

QIIME2 Ic: Demultiplexed data (single-end)

Importing demultiplexed data (single-end)

Metabarcoding

0.3

Updated on Nov 4, 2024

QIIME2 Id: Demultiplexed data (paired-end)

Importing demultiplexed data (paired-end)

Metabarcoding

0.3

Updated on Nov 4, 2024

QIIME2 IIa: Denoising (sequence quality control) and feature table creation (single-end)

Use DADA2 for sequence quality control. DADA2 is a pipeline for detecting and correcting (where possible) Illumina amplicon sequence data. As implemented in the q2-dada2 plugin, this quality control process will additionally filter any phiX reads (commonly present in marker gene Illumina sequence data) that are identified in the sequencing data, and will filter chimeric sequences.

Metabarcoding

0.3

Updated on Nov 4, 2024

QIIME2 IIb: Denoising (sequence quality control) and feature table creation (paired-end)

Metabarcoding

0.3

Updated on Nov 4, 2024

QIIME2-III-V-Phylogeny-Rarefaction-Taxonomic-Analysis

This workflow - Reconstruct phylogeny (insert fragments in a reference) - Alpha rarefaction analysis - Taxonomic analysis

Metabarcoding

0.2

Updated on Nov 4, 2024

QIIME2 VI: Diversity metrics and estimations

The first step in hypothesis testing in microbial ecology is typically to look at within- (alpha) and between-sample (beta) diversity. We can calculate diversity metrics, apply appropriate statistical tests, and visualize the data using the q2-diversity plugin.

Metabarcoding

0.2

Updated on Nov 4, 2024

Gene Ontology and KEGG Pathway Enrichment Analysis

Performs functional enrichment analysis of gene sets using GOseq, identifying over-represented Gene Ontology terms and KEGG pathways. This workflow accounts for gene length bias in RNA-seq data when calculating enrichment statistics. Input requires differentially expressed genes and gene lengths. Generates comprehensive tables and visualizations of enriched GO terms across all three ontologies (Biological Process, Molecular Function, Cellular Component) as well as KEGG pathway enrichment results.

Transcriptomics

0.1

Updated on Nov 3, 2024

Paired end variant calling in haploid system

Workflow for variant analysis against a reference genome in GenBank format

Variant Calling

0.1

Updated on Oct 29, 2024

AMR Gene Detection

Antimicrobial resistance gene detection from assembled bacterial genomes

Genome Annotation

1.1.7

Updated on Oct 11, 2024

Single-Cell RNA-seq Analysis: Scanpy Preprocessing and Clustering

End-to-end analysis of single-cell RNA-seq data using the Scanpy/AnnData ecosystem. Imports count matrices, applies quality control filtering of low-quality cells and genes, normalizes and scales data, performs dimension reduction (PCA), and identifies cell clusters using the Louvain algorithm. Generates publication-ready visualizations including UMAP plots colored by clusters and marker genes.

Single Cell

0.1

Updated on Oct 9, 2024

BREW3R

Extends 3' ends of gene annotations using BAM files (from STAR alignments) and a reference GTF. Specifically designed for 3'-biased sequencing techniques like 10X scRNA-seq or BRB-seq that primarily capture transcript 3' ends. The BREW3R tool enhances annotations by using evidence from RNA-seq data to improve 3' UTR definitions, which is particularly important for accurate quantification in single-cell and bulk RNA-seq experiments.

Transcriptomics

0.2

Updated on Oct 7, 2024

COVID-19: variation analysis reporting

This workflow takes a VCF dataset of variants produced by any of the *-variant-calling workflows in https://github.com/galaxyproject/iwc/tree/main/workflows/sars-cov-2-variant-calling and generates tabular lists of variants by Samples and by Variant, and an overview plot of variants and their allele-frequencies.

SARS-COV-2

Variant Calling

0.3.4

Updated on Sep 24, 2024

Clinical Metaproteomics Quantitation

Clinical Metaproteomics 4: Quantitation

Proteomics

Metaproteomics

0.1

Updated on Aug 14, 2024

Scaffolding-BioNano-VGP7

Scaffold a genome assembly using Bionano optical maps data. Part of the VGP suite, it can be run on the assembly in GFA format generated by the contigging step in workflow 3,4, or 5. If you used a different method to generate your assembly, you can use the tools gfastats in Galaxy to generate a GFA from your assembly fasta, and create a simple text file containing the estimated genome size.

Vertebrate Genome Project

0.1.4

Updated on Aug 13, 2024

sra_manifest_to_concatenated_fastqs_parallel

This workflow takes as input a SRA_manifest from SRA Run Selector and will generate one fastq file or fastq pair of file for each experiment (concatenated multiple runs if necessary). Output will be relabelled to match the column specified by the user.

Data Fetching

0.7

Updated on Jun 17, 2024

MetaProSIP OpenMS 2.8

Automated inference of stable isotope incorporation rates in proteins for functional metaproteomics

Proteomics

0.2

Updated on Jun 17, 2024

Parallel Accession Download

Downloads fastq files for sequencing run accessions provided in a text file using fasterq-dump. Creates one job per listed run accession.

Data Fetching

0.1.14

Updated on May 27, 2024

Taxonomy Profiling and Visualization with Krona

Microbiome - Taxonomy Profiling

Microbiome

0.1

Updated on Apr 25, 2024

Nanopore Preprocessing

Microbiome - QC and Contamination Filtering

Microbiome

0.1

Updated on Apr 25, 2024

Pathogen Detection PathoGFAIR Samples Aggregation and Visualisation

Pathogens of all samples report generation and visualization

Microbiome

0.1

Updated on Apr 24, 2024

Gene-based Pathogen Identification

Nanopore datasets analysis - Phylogenetic Identification - antibiotic resistance genes detection and contigs building

Microbiome

0.1

Updated on Apr 18, 2024

RNA Velocity Analysis: Velocyto for 10X Data from Bundled Output

Processes 10X Genomics single-cell RNA-seq data using Velocyto to quantify spliced and unspliced transcript counts for RNA velocity analysis. Automatically extracts cell barcodes from standard 10X bundled output and generates a loom file containing separate counts for spliced exons, unspliced introns, and ambiguous regions. Enables downstream trajectory inference and cellular dynamics analysis.

Single Cell

0.2

Updated on Feb 5, 2024

RNA Velocity Analysis: Velocyto for 10X Data with Filtered Barcodes

Processes 10X Genomics single-cell RNA-seq data using Velocyto to quantify spliced and unspliced transcript counts for RNA velocity analysis. Takes pre-filtered cell barcodes as input and generates a loom file containing separate counts for spliced exons, unspliced introns, and ambiguous regions. Enables downstream trajectory inference and cellular dynamics analysis in tools like scVelo.

Single Cell

0.2

Updated on Feb 5, 2024

MMGBSA calculations with GROMACS

MMGBSA simulation and calculation

Computational Chemistry

0.1.5

Updated on Nov 27, 2023

Create GRO and TOP complex files

Computational Chemistry

0.1.4

Updated on Nov 20, 2023

Fragment-based virtual screening using rDock for docking and SuCOS for pose scoring

Virtual screening of the SARS-CoV-2 main protease with rDock and pose scoring

Computational Chemistry

0.1.5

Updated on Nov 20, 2023

dcTMD calculations with GROMACS

Perform dcTMD free energy simulations and calculations

Computational Chemistry

0.1.5

Updated on Nov 20, 2023

COVID-19: variation analysis on WGS PE data

This workflows performs paired end read mapping with bwa-mem followed by sensitive variant calling across a wide range of AFs with lofreq

SARS-COV-2

Variant Calling

0.2.4

Updated on Nov 20, 2023

COVID-19: variation analysis of ARTIC ONT data

This workflow for ONT-sequenced ARTIC data is modeled after the alignment/variant-calling steps of the [ARTIC pipeline](https://artic.readthedocs.io/en/latest/). It performs, essentially, the same steps as that pipeline’s minion command, i.e. read mapping with minimap2 and variant calling with medaka. Like the Illumina ARTIC workflow it uses ivar for primer trimming. Since ONT-sequenced reads have a much higher error rate than Illumina-sequenced reads and are therefor plagued more by false-positive variant calls, this workflow does make no attempt to handle amplicons affected by potential primer-binding site mutations.

SARS-COV-2

Variant Calling

0.3.2

Updated on Nov 20, 2023

Generic variation analysis reporting

This workflow takes a VCF dataset of variants produced by any of the variant calling workflows in https://github.com/galaxyproject/iwc/tree/main/workflows/sars-cov-2-variant-calling and generates tabular lists of variants by Samples and by Variant, and an overview plot of variants and their allele-frequencies.

Variant Calling

0.1.1

Updated on Nov 20, 2023

Generic variation analysis on WGS PE data

Workflow for variant analysis against a reference genome in GenBank format

Variant Calling

0.1.1

Updated on Nov 20, 2023

BigWig Replicates Averaging Workflow

Calculates average signal values across replicate BigWig files to generate a single consolidated track. Requires input file identifiers formatted as "sample_name_replicateID". The tool automatically groups files by sample name, computes the mean signal at each position, and outputs consolidated BigWig files named after the common sample prefix (without the replicate ID suffix).

Epigenetics

Transcriptomics

0.2

Updated on Sep 27, 2023

Repeat masking with RepeatModeler and RepeatMasker

Genome Annotation

0.1

Updated on Sep 21, 2023

Hi-C Processing: FASTQ to Balanced Cool Files

Comprehensive Hi-C data processing workflow that converts paired-end FASTQ files to balanced contact matrices. Uses HiCUP for pre-processing (adapter trimming, mapping, filtering) with fragment midpoint coordinate assignment, filters reads by mapping quality, and generates sorted tabix files. The final output is a balanced cool file at user-specified resolution created with cooler, ready for multi-resolution contact matrix analysis.

Epigenetics

0.3

Updated on Sep 8, 2023

Capture Hi-C Processing: FASTQ to Balanced Cool Files

End-to-end processing of Capture Hi-C paired-end sequencing data. Transforms raw FASTQ files into balanced contact matrices using HiCUP for processing Hi-C reads (adapter trimming, mapping, filtering for valid pairs) with additional filtering for MAPQ and captured regions. The workflow generates sorted tabix files via cooler and produces balanced cool files at user-specified resolution for downstream analysis and visualization.

Epigenetics

0.3

Updated on Sep 8, 2023

Hi-C Format Conversion: Juicer Medium to Cooler Files

Converts Hi-C interaction data from Juicer medium format (tabix files) to balanced cooler format. This streamlined workflow takes a collection of Juicer medium tabix files and a reference genome name as input and generates balanced cool files at user-specified resolution. Enables seamless transition between different Hi-C analysis ecosystems while maintaining data integrity.

Epigenetics

0.3

Updated on Sep 8, 2023

Hi-C Data Processing: FASTQ to Valid Interaction Pairs

Processes Hi-C paired-end FASTQ files to generate validated interaction pairs using HiCUP. The workflow truncates reads at ligation junctions, maps to reference genome, assigns to restriction fragments, and filters out experimental artifacts (self-ligated, dangling ends, internal fragments, and size outliers). Removes duplicates and converts outputs to formats compatible with Juicebox/cooler using fragment midpoints as coordinates. Final filtering by mapping quality ensures high-confidence interaction data.

Epigenetics

0.3

Updated on Sep 8, 2023

Assembly polishing with long reads

Racon polish with long reads, x4

Genome assembly

0.1

Updated on Jul 15, 2023