Let's look at what we have so far:

Unprofiled samples

We started with:

Raw Data (unprofiled samples)

Profiled the raw data to get:

Profiled Samples (profiled but undescribed samples)

Grouped the profiled samples into a study:

Study (collection of profiled samples)

Combined the profiled sample metadata with the SRA metadata:

Associated Metadata (description of samples)


We want to combine the profiled samples and their associated metadata to create a dataset.

Datasets contain samples that are both profiled and described which we can use to categorize into cohorts for discovery of statistical patterns. Datasets are necessary for bioinformatic analyses.


In the next step, we will upload the metadata to the profiles.

