The only input required to run CREMA is a set of raw .fastq files containing raw sequencing reads and a description of these files in TSV format (see below). Note that, as CREMA models changes in chromatin state across samples, it requires at least 2 input files. To add files for upload please click the "Add files" button and add files using the file selection dialog window. Before starting the upload process please select your data type (ChIP-Seq of histone marks or ATAC/DNase-Seq) and the organism from which the data derive. To start upload click the "Start upload" button. We strongly recommend to upload compressed (gzipped) fastq files in order to reduce upload time and minimize the probability of network errors. We also strongly suggest you provide an email address so that we can notify you when CREMA has finished analyzing the data, or contact you in case of problems with the submitted data. We will NOT share your email with any third parties. Finally, we recommend that you provide a name for your current submission in the project name field.
When the data upload has finished you will be automaticly redirected to the status page of the job. NOTE: if you have not provided your email address, please make sure to save the link to this page since it will redirect to the results when the CREMA analysis has finished.
Instead of web-interface you can use CREMA uploader script which could be run from a command line on a remote server in headless mode.
In order to properly process the data, CREMA requires some information about the submitted data files to be submitted as a tab-separated text file. In particular, CREMA needs to know which files correspond to replicates of the same condition, and for paired-end reads it needs to know which files correspond to the paired ends of the same sample. In addition, for ChIP-seq we strongly recommend submitting corresponding background/input samples, and CREMA needs to know which samples are IP and which samples are background.
The "samples.tsv" file should be an ASCII, tab separated table with four columns. The file should have one line per sample and provide the following fields: The name of the condition (IMPORTANT: replicates should have the exact same condition name), the type of the sample (foreground/IP or background/input) indicated as 'fg' or 'bg', and the name(s) of the corresponding sequence read files. In case of paired-end sequences, two filenames corresponding to the first and second ends of the sequences should be given, whereas for single-end sequence data a single file name should be given. Note that mixtures of paired-end and single-end sequences are allowed. Finally, please use informative condition names since these will be used in the display of the results. Below is an example TSV file.
liver fg Sample1_1.fastq.gz Sample1_2.fastq.gz liver fg Sample2.fastq.gz liver bg Sample3.fastq.gz lung fg EXDDF098.fastq.gz EXDDF103.fastq.gz lung fg EXDDF099.fastq.gz lung bg FGD_90KL01.fastq.gz
Download example samples.tsv file.
Instead of local .fastq files you can provide links to fastq files which are publicly available in internet, or provide SRR Ids of samples from SRA archive. In such case you need to upload only the "samples.tsv" file. The CREMA server takes care of dowloading data from the internet according to links/database ids you provided. Please see example below:
liver fg SRR123456 liver bg SRR123457 lung fg http://files.pub/EXDDF098.fastq.gz http://files.pub/EXDDF103.fastq.gz lung fg ftp://files.pub/EXDDF099.fastq.gz lung bg ftp://files.pub/FGD_90KL01.fastq.gz
Currently you can upload your data in two ways.
If you experience any problems with the methods above please do not hesitate to contact us! There are other posibilities how you can share your data with us and we will find the most appropriate way for you.
The developers of CREMA give permission to you and your institution to use the CREMA webserver for internal, research purposes, on the following conditions:
Commercial users should contact us for licensing arrangements.