# Create Datasets

Now that we have profiled samples as part of a study and associated metadata, we want to create a dataset for analyses. To create a dataset, we attach the metadata to the matching profiles.&#x20;

## Navigate to the 'Dataset' Tab

<figure><img src="/files/MoU7PF4iFwLCxsyGlBnB" alt=""><figcaption></figcaption></figure>

## Create Dataset

Select 'Create a New Dataset' and fill in the basic information parameters.

<figure><img src="/files/y7I79OlxCEOEOqGNyHY6" alt=""><figcaption></figcaption></figure>

## Select Profiles for a Dataset

There are multiple ways to select profiles for a dataset. We will use the default upload type method. Either, click or drag and drop the concatenated .xlsx file from the metadata steps into the upload area.&#x20;

{% file src="/files/fAtG4EnggndZd16uI7Ug" %}
Example, concatenated metadata .xlsx file for dataset preparation.
{% endfile %}

{% hint style="info" %}
Datasets can be made up of any profiled samples in your account i.e. any sample with a profile\_id. These can be profiles from an entire study, a combination of studies, or individually selected profiles. The choice is up to you.&#x20;
{% endhint %}

<figure><img src="/files/Sp02oQDYpotbtMD9WyM9" alt=""><figcaption></figcaption></figure>

{% hint style="warning" %}
Double-check the sample name matches the run name, ensuring the metadata is correctly assigned to the right samples.
{% endhint %}

## Select 'Upload'

Upload the selected profiles to the dataset.&#x20;

<figure><img src="/files/kZV9T5wZAw1TneaHadZe" alt=""><figcaption><p>Processing the dataset</p></figcaption></figure>

<figure><img src="/files/Oy2ye51P5mVIemxMYEU5" alt=""><figcaption><p>Successful dataset upload</p></figcaption></figure>

## Explore Dataset

Clicking on the dataset will show you any analyses we have run on it and the metadata associated with each sample/run.&#x20;

<figure><img src="/files/TSpckjAmRnB4dp8Xitf1" alt=""><figcaption></figcaption></figure>

{% hint style="info" %}
Fields like Run, age\_at\_collection, etc. have all been added from the SRA. It’s good to confirm these are ready for the next step, analysis.
{% endhint %}

<figure><img src="/files/fFvbjEKLfwvE9mutkkNn" alt=""><figcaption></figcaption></figure>

Now let’s analyze this dataset!


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://kb.ezbiocloud.net/home/protocols/shotgun-microbiome/create-datasets.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
