Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide.

Overview

SARS-CoV-2 processing requests

Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide.

Prerequisites

This automation system is set up to work with ARTIC-amplified paired-end Illumina sequence data, the most common type of SARS-CoV-2 sequencing data today.

Usage

  • Fork the repo and create a new file in the file_requests/ directory.
    • The file should contain a header line, followed by a list of web links to the files you want to analyze. See the example file provided.
    • Links need to be formatted as follows: <base_url>/<sample ID>_[12].<file_extension> (1 representing the forward strand and 2 the reverse strand of paired-end data). If your data is not accessible in this way, or unpublished, it's not a problem - just create an issue and describe what you need.
  • Create a PR with your changes. We will review and merge it as soon as possible.

Analysis of your data

  • After merging, the data will be uploaded to Galaxy Europe and processed by our collection of SARS-CoV-2 genomic sequence analysis workflows, which will produce highly-sensitive per-sample variant calls, per-batch variant reports and reliable consensus sequences for all your samples.

  • Depending on the amount of other jobs running on our server and on the size of your data batch, processing may take between a few hours and a day.

  • Once ready, the complete analysis will become available as a set of published histories on the server.

    💡 Hint: Your histories will carry the filename from your pull request in their name.

  • Key result files - BAM, VCF and consensus sequence FASTA files for each sample in your batch - will also be pushed automatically to a publicly readable FTP server hosted by BSC.

  • After a few days your results will also be included in the viral Beacon project dashboard.

Links


The analyses will be performed using the Galaxy platform and open source tools from BioConda and BioContainers. The workflows will run on the de.NBI-cloud and form part of the Galaxy COVID-19 efforts with partners around the world. For more information please visit https://github.com/galaxyproject/SARS-CoV-2.

Galaxy Project   European Galaxy Project   Australian Galaxy Project   bioconda   XSEDE   TACC   de.NBI   ELIXIR   PSC   Indiana University   Galaxy Training Network   Bio Platforms Australia   Australian Research Data Commons   VIB   ELIXIR Belgium   Vlaams Supercomputer Center   EOSC-Life   Datamonkey   IFB   CRG   BSC  

You might also like...
Compute execution plan: A DAG representation of work that you want to get done. Individual nodes of the DAG could be simple python or shell tasks or complex deeply nested parallel branches or embedded DAGs themselves.

Hello from magnus Magnus provides four capabilities for data teams: Compute execution plan: A DAG representation of work that you want to get done. In

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

Posture: Pose Tracking and Machine Learning for prescribing corrective suggestions to improve posture and form while exercising. This repository conta

Privacy as Code for DSAR Orchestration: Privacy Request automation to fulfill GDPR, CCPA, and LGPD data subject requests.
Privacy as Code for DSAR Orchestration: Privacy Request automation to fulfill GDPR, CCPA, and LGPD data subject requests.

Meet Fidesops: Privacy as Code for DSAR Orchestration A part of the greater Fides ecosystem. ⚡ Overview Fidesops (fee-dez-äps, combination of the Lati

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

Habitat: A Runtime-Based Computational Performance Predictor for Deep Neural Network Training Habitat is a tool that predicts a deep neural network's

Bayesian algorithm execution (BAX)

Bayesian Algorithm Execution (BAX) Code for the paper: Bayesian Algorithm Execution: Estimating Computable Properties of Black-box Functions Using Mut

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing Environment Tested on Ubuntu 14.04 64bit and 16.04 64bit Installation # disabl

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

Angora Angora is a mutation-based coverage guided fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without s

Driller: augmenting AFL with symbolic execution!

Driller Driller is an implementation of the driller paper. This implementation was built on top of AFL with angr being used as a symbolic tracer. Dril

A torch.Tensor-like DataFrame library supporting multiple execution runtimes and Arrow as a common memory format

TorchArrow (Warning: Unstable Prototype) This is a prototype library currently under heavy development. It does not currently have stable releases, an

Comments
  • Submitting / Updating samples on a weekly basis

    Submitting / Updating samples on a weekly basis

    Hi all,

    (INAB-CERTH) is sequencing roughly 100-200 samples on a weekly basis, as part of our collaboration with the National Health Organization of Greece and in the context of a flagship project on genomic surveillance.

    We deposit the raw sequences (fastq files, sequenced using Illumina and based on a slightly adapted ARCTIC protocol) to ENA under the Project ID: PRJEB44141.

    Although we are already analyzing these samples using the computational infrastructure of INAB-CERTH, I would be very keen in having these samples analyzed also through the Galaxy workflow - ideally also performing a re-analysis of older samples in order to assess the impact of changes in the pipeline to the final sequence / lineage assignments.

    My question is this: the direct URLs of all ~2000 paired-end reads is already here. Should I make a copy of the links using the provided template file, which I will be changing every week, or is there a different process in place?

    Thank you again for setting this up, and for your time and efforts here!

    opened by fpsom 19
  • File for the ENA Project PRJEB44141 samples

    File for the ENA Project PRJEB44141 samples

    This file contains 2142 samples uploaded to ENA by INAB-CERTH in the context of the Greek-vs-Corona Flagship project (https://greecevscorona.gr/), as well as the Greek Genomic Surveillance Network.

    opened by fpsom 1
  • Strip file extension from element identifier

    Strip file extension from element identifier

    Currently filename extensions make it into the collection element identifer in Galaxy and from there into history names. It's too easy to forget this during review.

    opened by wm75 0
Owner
useGalaxy.eu
useGalaxy.eu
Multi-modal co-attention for drug-target interaction annotation and Its Application to SARS-CoV-2

CoaDTI Multi-modal co-attention for drug-target interaction annotation and Its Application to SARS-CoV-2 Abstract Environment The test was conducted i

Layne_Huang 7 Nov 14, 2022
SARS-Cov-2 Recombinant Finder for fasta sequences

Sc2rf - SARS-Cov-2 Recombinant Finder Pronounced: Scarf What's this? Sc2rf can search genome sequences of SARS-CoV-2 for potential recombinants - new

Lena Schimmel 41 Oct 3, 2022
BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

Overview BisQue is a web-based platform specifically designed to provide researchers with organizational and quantitative analysis tools for up to 5D

Vision Research Lab @ UCSB 26 Nov 29, 2022
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

Couler What is Couler? Couler aims to provide a unified interface for constructing and managing workflows on different workflow engines, such as Argo

Couler Project 781 Jan 3, 2023
PyTorch implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation.

PyTorch implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation. Warning: the master branch might collapse. To ob

null 559 Dec 14, 2022
wlad 2 Dec 19, 2022
Very simple NCHW and NHWC conversion tool for ONNX. Change to the specified input order for each and every input OP. Also, change the channel order of RGB and BGR. Simple Channel Converter for ONNX.

scc4onnx Very simple NCHW and NHWC conversion tool for ONNX. Change to the specified input order for each and every input OP. Also, change the channel

Katsuya Hyodo 16 Dec 22, 2022
Toontown: Galaxy, a new Toontown game based on Disney's Toontown Online

Toontown: Galaxy The official archive repo for Toontown: Galaxy, a new Toontown

null 1 Feb 15, 2022
Object Database for Super Mario Galaxy 1/2.

Super Mario Galaxy Object Database Welcome to the public object database for Super Mario Galaxy and Super Mario Galaxy 2. Here, we document all object

Aurum 9 Dec 4, 2022
Nb workflows - A workflow platform which allows you to run parameterized notebooks programmatically

NB Workflows Description If SQL is a lingua franca for querying data, Jupyter sh

Xavier Petit 6 Aug 18, 2022