# Create Datasets

Now that we have profiled samples as part of a study and associated metadata, we want to create a dataset for analyses. To create a dataset, we attach the metadata to the matching profiles.&#x20;

## Navigate to the 'Dataset' Tab

<figure><img src="https://820779907-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FDWKOAVP0eaMhg1acSkor%2Fuploads%2FBDaTGSBbWbAS63OPLnoq%2Fimage.png?alt=media&#x26;token=0cc3ff95-b27d-4ee9-b0ff-d0b8768f7c98" alt=""><figcaption></figcaption></figure>

## Create Dataset

Select 'Create a New Dataset' and fill in the basic information parameters.

<figure><img src="https://820779907-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FDWKOAVP0eaMhg1acSkor%2Fuploads%2FYBa3uXJmQ9CLAx717fcw%2Fimage.png?alt=media&#x26;token=13dc741f-68da-4398-bc58-6852629e038d" alt=""><figcaption></figcaption></figure>

## Select Profiles for a Dataset

There are multiple ways to select profiles for a dataset. We will use the default upload type method. Either, click or drag and drop the concatenated .xlsx file from the metadata steps into the upload area.&#x20;

{% file src="<https://820779907-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FDWKOAVP0eaMhg1acSkor%2Fuploads%2Fa529o8ATDP864wG9UszA%2FPRJNA834801_dataset.xlsx?alt=media&token=7b3820b3-913b-4408-ac1e-0cd44bbcee08>" %}
Example, concatenated metadata .xlsx file for dataset preparation.
{% endfile %}

{% hint style="info" %}
Datasets can be made up of any profiled samples in your account i.e. any sample with a profile\_id. These can be profiles from an entire study, a combination of studies, or individually selected profiles. The choice is up to you.&#x20;
{% endhint %}

<figure><img src="https://820779907-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FDWKOAVP0eaMhg1acSkor%2Fuploads%2FsW2H3QzX2sLd6UzlBLbf%2Fimage.png?alt=media&#x26;token=a5507116-c83a-4093-85ca-f64ff17bff34" alt=""><figcaption></figcaption></figure>

{% hint style="warning" %}
Double-check the sample name matches the run name, ensuring the metadata is correctly assigned to the right samples.
{% endhint %}

## Select 'Upload'

Upload the selected profiles to the dataset.&#x20;

<figure><img src="https://820779907-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FDWKOAVP0eaMhg1acSkor%2Fuploads%2FxS9fIAafPRf731Iq1ELL%2Fimage.png?alt=media&#x26;token=457b59b5-d66a-4290-8af6-d5c0db311e10" alt=""><figcaption><p>Processing the dataset</p></figcaption></figure>

<figure><img src="https://820779907-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FDWKOAVP0eaMhg1acSkor%2Fuploads%2F7nKDesBjoTaw9HmDZe4p%2Fimage.png?alt=media&#x26;token=76f98362-3e9e-4f20-b649-6cc7b629be9b" alt=""><figcaption><p>Successful dataset upload</p></figcaption></figure>

## Explore Dataset

Clicking on the dataset will show you any analyses we have run on it and the metadata associated with each sample/run.&#x20;

<figure><img src="https://820779907-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FDWKOAVP0eaMhg1acSkor%2Fuploads%2FkxT36ScMAUBfr5ynsvRf%2Fimage.png?alt=media&#x26;token=010de8a2-8c79-434f-b4b0-fa37290b69d0" alt=""><figcaption></figcaption></figure>

{% hint style="info" %}
Fields like Run, age\_at\_collection, etc. have all been added from the SRA. It’s good to confirm these are ready for the next step, analysis.
{% endhint %}

<figure><img src="https://820779907-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FDWKOAVP0eaMhg1acSkor%2Fuploads%2F1TRFr3kCk4oI1sozijNN%2Fimage.png?alt=media&#x26;token=2e48eb61-4e81-4bed-bb12-3aa5bfba8fa3" alt=""><figcaption></figcaption></figure>

Now let’s analyze this dataset!
