Microbial DNA barcoding

[2] Only a few years later, a new tree of life with three domains was proposed by again Woese and colleagues, who were the first to use the small subunit of the ribosomal RNA (SSU rRNA) gene to distinguish between bacteria, archaea and eukaryotes.

With the development of HTS (High-Throughput-Sequencing) in the early 2000s and the ability to deal with this massive data using modern bioinformatics and cluster algorithms, investigating microbial life got much easier.

[4] Taxonomic identification of microbial eukaryotes requires exceedingly skillful expertise and is often difficult due to small sizes of the organisms, fragmented individuals, hidden diversity and cryptic species.

[10] These mentioned qualities make DNA barcoding a cost-effective, reliable and less time-consuming method, compared to the traditional ones, to meet the increasing need for large-scale environmental assessments.

While it is difficult to define species within prokaryotic organisms, 16S marker can be used for determining individual operational taxonomic units (OTUs).

In some cases, these OTUs can also be linked to traditionally defined species and can therefore be considered a reliable representation of the evolutionary relationships.

[17] However, when analyzing a taxonomic structure or biodiversity of a whole cyanobacterial community (see DNA metabarcoding), it is more informative to use markers specific for cyanobacteria.

[21] A few sets of such primers have been tested for barcoding or metabarcoding of environmental samples and gave good results, screening out majority of non-photosynthetic or non-cyanobacterial organisms.

[25] Besides 16S marker, phylogenetic studies could therefore include also more variable sequences, such as sequences of protein-coding genes (gyrB, rpoC, rpoD,[26] rbcL, hetR,[27] psbA,[28][29] rnpB,[30] nifH,[31] nifD[32]), internal transcribed spacer of ribosomal RNA genes (16S-23S rRNA-ITS)[33][25] or phycocyanin intergenic spacer (PC-IGS).

General databases like the NCBI platform include all kind of sequences, either whole genomes or specific marker genes of all organisms.

Cyanobacteria from genus Dolichospermum, as seen under a microscope.