Working with Apps on CAVATICA

We now will explore how to use your own and public apps in your project workspace

Head Straight to CAVATICA

In your browser navigate to "https://cavatica.sbgenomics.com"

Login here:

You are then in an environment where you can see a number of things including public and developer tabs

Developer Authentication Token

As in all things these days, to push or at times use (if your application is private) applications you have in the repository - you will need to authenticate with your personal authentication token. You should not share your token with others.

If you haven't generated your own authentication token, you can see how it is done here with the instructions How to generate your authentication token

Using Public Apps to your Project

You can use a public app without copying the app into your project.

After you login to CAVATICA, you can Browse the public apps. At the time of this writing, there are [703] on the CAVATICA platform.

You can browse to find the app you are interested in.

And you can search, let's search for the Fastqc Analysis application.

And you can run this application (note that I have selected the workflow and not just a single application:

Once Run is selected, you will be prompted to specify your project:

Next, you see that you need to provide files for the application.

Let's see how to get files into your project.

Copying Files to your Project

There are many ways to get files into your project, while you are developing your workflows or doing analysis, it is useful to have test data.

As I mentioned, I use Zenodo for testdata, derivative data products, that is matrices, that are typical for input to interactive analysis as we demonstrated on day 1 with the JupyterLab Notebook and the Volcano plot

And also derivative data products such as what was done for the analysis work, The Impact of Biological Sex on Alternative Splicing. In this case, a Nextflow rMATS workflow was run on controlled access data, the GTEx data, and these aggregate matrices were made and released on Zenodo and were input to all the downstream analyses whose notebooks are on GitHub

To work with Apps on Cavatica, you need to make your files available to us.

It is possible to bring this data in very easily in CAVATICA, and the data can come from many sources. Just a caution, you pay for storage. So working with small test data sets as you learn and develop your workflows, applications and notebooks, is good practice.

Also getting in the habit of removing everything at the end of the day and making those steps reproducible.

Starting from scratch where it is reasonable will ensure that you have the understanding of how to proceed that you think you have. Much in the same way where i have asked for you to run the class, to share your screen and execute the steps I say will work, is a kind of testing, and a way to ensure what I am saying is true and reproducible.

By getting into the habit of daily saving your work on GitHub, and saving those sharable derivative data products on Zenodo, you start to create durable, successful habits that will ensure when you publish that what you say and do are correct and accurate to the best of your ability.

Navigate to Files

Navigate to Files and select Add Files

Select FTP/HTTP

In your browser, navigate to the Zenodo site

https://doi.org/10.5281/zenodo.7025773

Where our testdata files now reside.

Right click and copy link address on test.10k_reads_1.fastq.gz

Paste in the window what you copied:

https://zenodo.org/record/7025773/files/test.20k_reads_1.fastq.gz?download=1

You will see that unfortunately it includes ?download=1. Delete this ending so your pasted copy looks like this:

https://zenodo.org/record/7025773/files/test.20k_reads_1.fastq.gz

Now copy this file in the window and change the second one to have the _2 instead of _1.

Now in the window it should have these two files

https://zenodo.org/record/7025773/files/test.20k_reads_1.fastq.gz
https://zenodo.org/record/7025773/files/test.20k_reads_2.fastq.gz

And your screen should look like this:

Press Done and you will see that now your files are there.

Running the App

Now we have our app in our project, and we have our files -- now we can run the app.

Navigate back to our app, select the copy we have made.

Select run

It says we need files.

Now we need to Select our files

Now we select run and we see that it is executing.

You will see that machines are initializing

App Completion

Then the analysis was run and we can view the resulting files in the same way that you can view the results of the execution example ran on the Google Shell Cloud -- but we are in a workspace now where we can have a large number of machines running in parallel. There are limits of course, and depending upon the analysis these limits can be discussed -- because the important things is to get the Science done properly and efficiently.

Adding your own containers

We showed in previous steps how to build your own Docker Images and push them onto CAVATICA

Return to the Agenda

Main Agenda

Additional resources:

CAVATICA documentation for the Apps interface: https://docs.cavatica.org/docs/public-apps
CAVATICA documentation for Docker Basics: http://docs.cavatica.org/docs/docker-basics

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WorkingWithAppsOnCAVATICA.md

WorkingWithAppsOnCAVATICA.md

Working with Apps on CAVATICA

Head Straight to CAVATICA

Developer Authentication Token

Using Public Apps to your Project

Copying Files to your Project

Navigate to Files

Running the App

Adding your own containers

Return to the Agenda

Additional resources:

Files

WorkingWithAppsOnCAVATICA.md

Latest commit

History

WorkingWithAppsOnCAVATICA.md

File metadata and controls

Working with Apps on CAVATICA

Head Straight to CAVATICA

Developer Authentication Token

Using Public Apps to your Project

Copying Files to your Project

Navigate to Files

Running the App

Adding your own containers

Return to the Agenda

Additional resources: