LogoLogo
EzBioCloud
  • 📄Overview
  • 🔆Highlights
  • 🔬Science Blogs
    • Basics
      • Species
      • Species Taxonomy
      • Chimeras
      • Average Nucleotide Identity
      • OrthoANI
      • Genetic Resolution
    • Identify
      • 16S rRNA
      • Identification with 16S rRNA
      • 16S rRNA Resolution
      • 16S rRNA Database
      • Genome Identification
      • Genome Identification Process
      • Multi-Locus Sequence Typing
      • 16S vs Genome Identification
      • Subspecies
      • Phylogenomic Trees
      • Genome Database
      • Quality Control
    • Profile
      • Tetra-Nucleotide Frequencies
      • 16S Copy Number
      • Up-to-date Bacterial Core Genes
      • UBCG Technical Guide
      • UBCG Set
      • Depth of Sequencing
      • Metagenome-Assembled Genomes Suitability
      • 16S Versus Metagenomic Sequencing
      • Microbiomes
    • Detect
      • Clinical Metagenomics
      • Inferring with Amplicons
      • Pathogenicity Markers
      • Antimicrobial Resistance
      • Clinical Report Process
      • Defining a Pathogen
      • Human Pathogens
      • in silico Serotyping
    • Analyze
      • Alpha Diversity
      • Beta Diversity
      • Co-occurrence
      • Enterotyping
      • Taxonomic Composition
    • reAnalyze
      • reAnalyze #1 - Skin Disease
      • reAnalyze #2 - Skin Ageing
      • PreAnalyze #3 - Scalp Dandruff
  • ⚗️Protocols
    • 16S Identification
      • Get Started
      • Prepare Samples
        • Private Samples
        • Public Samples
      • Navigate Menu
      • Upload Data
        • Single Upload
        • Batch Upload
      • Download Report
    • Genome Identification
      • Get Started
      • Prepare Samples
        • Private Samples
        • Public Samples
        • SRA Samples
      • Navigate Menu
      • Upload Data
        • Whole Genome
        • Illumina
        • Nanopore
      • Download Report
    • Shotgun Microbiome
      • Get Started
      • Download Samples
        • NCBI Route
        • Linux Route
      • Navigate Menus
      • Create Studies
      • Profile Samples
      • Describe Profiles
        • Retrieve Metadata
        • Organize Metadata
        • Upload Metadata
      • Create Datasets
      • Analyze Datasets
        • Quality Check
        • Pie Chart Composition
        • Summary Statistics
        • Group Composition
        • Alpha Diversity
        • Beta Diversity
        • Differential Abundance
        • Enterotype
        • Co-occurrence
        • Co-occurrence Spearman
        • Statistical Matching
        • LEfSe
        • Metadata EDA
        • Profile EDA
    • Clinical Metagenomics
  • 🏛️Dr. Chun's Lectures
  • 🔧Tools
  • 🧫Taxonomy
  • ❔FAQs
    • Identification
    • Clinical Metagenomics
    • Privacy Policy
    • Terms of Service
Powered by GitBook
LogoLogo

Legal

  • Terms of Service
  • Privacy Policy

EzBioCloud© 2024. All Rights Reserved

On this page
  • Tetra-Nucleotide Frequency Patterns
  • Algorithm
  • References
  1. Science Blogs
  2. Profile

Tetra-Nucleotide Frequencies

How are Tetra-Nucleotide Frequencies used in microbiome profiling?

PreviousProfileNext16S Copy Number

Last updated 1 year ago

Tetra-Nucleotide Frequency Patterns

A tetra-nucleotide is a fragment of DNA sequence with 4 bases (e.g. AGTC or TTGG). Pride et al. (2003) showed that the frequency of tetra-nucleotides in bacterial genomes contain useful, albeit weak, phylogenetic signals. Even though tetra-nucleotide analysis (TNA) utilizes the information of whole genome, it is evident that it cannot replace other alignment-based phylogenetic methods such as or 16S rRNA phylogeny. However, TNA can be useful for phylogenetic characterization when whole genome or 16S rRNA gene information is not available. For example, a partial genomic fragment obtained from a metagenome can be identified by TNA (Teeling et al., 2004). TNA is also fast enough that it can be used as a search engine against a large genome database.

Algorithm

Information contained in a genome sequence can be transformed into an array of tetra-nucleotide frequencies (See the below figure).

Information of each genome sequence is now stored as counts of 256 tetra-nucleotides. When two genome sequences are similar, the more correlated these tetra-nucleotide patterns are. Therefore, statistical measures of tetra-nucleotide frequency correlation between two genome sequences can be roughly used to determine the genome-relatedness of two genomes.

Tetra-nucleotide correlation coefficient ranges from 0 to 1, and two identical genomes would produce 1.0.

References

Pride, D. T., Meinersmann, R. J., Wassenaar, T. M. & Blaser, M. J. Evolutionary implications of microbial genome tetranucleotide frequency biases. Genome Res .

Teeling, H., Meyerdierks, A., Bauer, M., Amann, R. & Glockner, F. O. Application of tetranucleotide frequencies for the assignment of genomic fragments. Environ Microbiol .

🔬
13, 145-158 (2003)
6, 938-947 (2004)
OrthoANI