LogoLogo
EzBioCloud
  • 📄Overview
  • 🔆Highlights
  • 🔬Science Blogs
    • Basics
      • Species
      • Species Taxonomy
      • Chimeras
      • Average Nucleotide Identity
      • OrthoANI
      • Genetic Resolution
    • Identify
      • 16S rRNA
      • Identification with 16S rRNA
      • 16S rRNA Resolution
      • 16S rRNA Database
      • Genome Identification
      • Genome Identification Process
      • Multi-Locus Sequence Typing
      • 16S vs Genome Identification
      • Subspecies
      • Phylogenomic Trees
      • Genome Database
      • Quality Control
    • Profile
      • Tetra-Nucleotide Frequencies
      • 16S Copy Number
      • Up-to-date Bacterial Core Genes
      • UBCG Technical Guide
      • UBCG Set
      • Depth of Sequencing
      • Metagenome-Assembled Genomes Suitability
      • 16S Versus Metagenomic Sequencing
      • Microbiomes
    • Detect
      • Clinical Metagenomics
      • Inferring with Amplicons
      • Pathogenicity Markers
      • Antimicrobial Resistance
      • Clinical Report Process
      • Defining a Pathogen
      • Human Pathogens
      • in silico Serotyping
    • Analyze
      • Alpha Diversity
      • Beta Diversity
      • Co-occurrence
      • Enterotyping
      • Taxonomic Composition
    • reAnalyze
      • reAnalyze #1 - Skin Disease
      • reAnalyze #2 - Skin Ageing
      • PreAnalyze #3 - Scalp Dandruff
  • ⚗️Protocols
    • 16S Identification
      • Get Started
      • Prepare Samples
        • Private Samples
        • Public Samples
      • Navigate Menu
      • Upload Data
        • Single Upload
        • Batch Upload
      • Download Report
    • Genome Identification
      • Get Started
      • Prepare Samples
        • Private Samples
        • Public Samples
        • SRA Samples
      • Navigate Menu
      • Upload Data
        • Whole Genome
        • Illumina
        • Nanopore
      • Download Report
    • Shotgun Microbiome
      • Get Started
      • Download Samples
        • NCBI Route
        • Linux Route
      • Navigate Menus
      • Create Studies
      • Profile Samples
      • Describe Profiles
        • Retrieve Metadata
        • Organize Metadata
        • Upload Metadata
      • Create Datasets
      • Analyze Datasets
        • Quality Check
        • Pie Chart Composition
        • Summary Statistics
        • Group Composition
        • Alpha Diversity
        • Beta Diversity
        • Differential Abundance
        • Enterotype
        • Co-occurrence
        • Co-occurrence Spearman
        • Statistical Matching
        • LEfSe
        • Metadata EDA
        • Profile EDA
    • Clinical Metagenomics
  • 🏛️Dr. Chun's Lectures
  • 🔧Tools
  • 🧫Taxonomy
  • ❔FAQs
    • Identification
    • Clinical Metagenomics
    • Privacy Policy
    • Terms of Service
Powered by GitBook
LogoLogo

Legal

  • Terms of Service
  • Privacy Policy

EzBioCloud© 2024. All Rights Reserved

On this page
  • Select ‘Differential Abundance’
  • Select your dataset
  • Fill in details for future referencing
  • Input the arguments according to your desired output
  • Select ‘Run Analysis’
  • Wait for the analysis to complete
  • Discover Differential Analysis
  1. Protocols
  2. Shotgun Microbiome
  3. Analyze Datasets

Differential Abundance

PreviousBeta DiversityNextEnterotype

Last updated 1 year ago

Let’s run a Differential Abundance analysis of the profiles in this dataset from the Parkinson’s disease study.

Select ‘Differential Abundance’

Select your dataset

Fill in details for future referencing

Input the arguments according to your desired output

Select ‘Run Analysis’

Wait for the analysis to complete

Discover Differential Analysis

Differential abundance analysis (DAA) finds candidate biomarkers for the group of interest. For example, several questions can be addressed, e.g. ‘Which microbes are more abundant in the IBD group compared to the healthy group?’. Tools for DAA can find taxa enriched in IBD or healthy groups (We can find candidate biomarkers based on (1) the fold change value between groups and (2) the p-value). However, results may vary depending on the DAA tool used. Therefore, seven DAA tools, including ANCOM-BC, ALDEx2, DESeq2, LEfSe, MaAsLin2, SIAMCAT and LinDA, were implemented to find candidate biomarkers, and the results of each DAA tool were compared to find consensus biomarkers.

Input

taxonomic profiles and metadata with group information (this engine is currently for the two groups: e.g., case VS control)

Output

ANCOM-BC/, ALDEx2/, DESeq2/, LEfSe/, MaAsLin2/, SIAMCAT/, LinDA/: Candidate biomarkers for each tool. *_raw.csv is the first output when people run that tool (not processed, columns are different between each tool). *.csv is the processed output from the *_raw.csv (Columns are matched between each tool).

Columns in the *.csv:

  • effect: effect size (for example, fold change values between the case and the control groups)

  • se: standard error of the effect size

  • pval: p-value

  • qval: q-value (p-value after multiple testing correction, we often use this value for selecting significant features)

  • direct: ‘+’ means enriched in the case group.

  • sig: ‘1’ means significant feature based on the q-value threshold (like 0.05), ‘0’ means ‘not significant’.

  • score: if direct is ‘+’, sig1. if direct is ‘-‘, sig-1.

Consensus/: a consensus table and a consensus plot (upsetPlot). ‘total_result.csv’ is a concatenated table from *.csv files of each tool. From ‘total_result.csv’, significant features selected from one or more DAA tools were collected (upsetPlot.csv). upsetPlot.csv was visualized (upsetPlot.png). In the plot, ‘red’ means case-enriched, and ‘blue’ means control-enriched.

⚗️