Skip to contents

Database DateGithub Downloads(total)

Latest news on BlueSky

SSU rRNA gene database

The PR2 database was initiated in 2010 in the frame of the BioMarks project from work that had developed in the previous ten years in the Plankton Group of the Station Biologique of Roscoff. Its aim is to provide a reference database of carefully annotated 18S rRNA sequences using eight unique taxonomic fields (from domain to species). At present it contains about 240,000 sequences. A number of metadata fields are available for many sequences, including geo-localisation, whether it originates from a culture or a natural sample, host type etc… The annotation of PR2 is performed by experts from each taxonomic groups. One very important project in this respect is EukRef which has recently decided to merge its effort with PR2. EukRef has built bioinformatics pipelines that have been used during three workshops dedicated to specific taxonomic groups.

The web interface now includes links to the sequences of the Ribosomal Operon Database (ROD) published in Krabberød, A.K., Stokke, E., Thoen, E., Skrede, I. & Kauserud, H. 2025. The Ribosomal Operon Database: A Full-Length rDNA Operon Database Derived From Genome Assemblies. Molecular Ecology Resources. 25:e14031.

Current version

  • Version: 5.1.1

  • Released: 2025-10-10

  • DOI: DOI

Core Team

  • Daniel VAULOT, Station Biologique de Roscoff, CNRS, France and University of Oslo, Norway
  • Javier del CAMPO, Institut de Biologia Evolutiva, Barcelona, Spain

Scientific committee and contributors

Please cite

18S rRNA primer database

The PR2 primer database is a compilation of primers found in the litterature with an in silico analysis against the PR2 database.

metaPR2

The metaPR2 metabarcode database is a compilation of metabarcode datasets processed by the dada2 R package and assigned against PR2.

Ribosomal Operon Database (ROD) - version 1.2

The ROD database contains full-length eukaryotic ribosomal operons. The database is based on the genome assemblies from NCBI, and the operons are extracted from the assemblies. The database currently contains 69,480 operon variants from more than 11,935 genomes.

Eukaryotic mitochondrial cytochrome oxidase (coi) Database (eKOI) - version 1.0

The eKOI database is a curated COI gene database designed to enhance the taxonomic annotation for protists that can be used for COI-based metabarcoding. eKOI integrates data from GenBank and mitochondrial genomes, followed by extensive manual curation to eliminate redundancies and contaminants, recovering 15 947 sequences within 80 eukaryotic phyla.

Questions ?

Report issues

  • Please report any issue on GitHub