Input data location considerations

Input data for your workflow may be on your local system or in the cloud, either in an S3 bucket or in Illumina BaseSpace.

See the Amazon documentation for more on S3 pricing (https://aws.amazon.com/s3/pricing/). At time of writing, AWS does not charge for uploading data to S3, while storage in S3 and download from S3 are chargeable.

Additional considerations relating to reference data

Often, data is needed for the analysis that is not itself being acted upon by the analysis. for example, reference sequences to be mapped against, or target regions to limit the focus of the analysis. Such reference data flows into parameter input channels in workflows.

Reference data transfer costs differ depending on the data source:



Footnotes

... bucket4.1
The cache bucket is configured by your GCE administrator. It is a cloud-based location for the temporary storage of input data that you selected from a local system when launching a workflow. By default, files in the cache bucket are retained for 30 days after their last use. Your GCE admin can adjust this period, so please check with them if in doubt.