PR2 version 4.14.0
Cafeteriales from Alex Schoenle

1 Reference

  • Schoenle, A., Hohlfeld, M., Rosse, M., Filz, P., Wylezich, C., Nitsche, F., & Arndt, H. (2020). Global comparison of bicosoecid Cafeteria-like flagellates from the deep ocean and surface waters, with reorganization of the family Cafeteriaceae. European Journal of Protistology, 73, 125665.


  • Several sequences have been deposited as Pseudobodo, but are actually Cafeteria species as well.
  • I know that Cavalier Smith has renamed several Pseudobodo species to a new species “Boroka karpovii” (Borokaceae). Within my stramenopile 18S analysis (Schoenle et al. 2020), I found the Pseudobodo sequences to be far away from the Cafeteriaceae, that is why I used the name Boroka karpovii for them. However, I do not know if it is more suitable to change only the family level instead of the name. I marked the sequences in green.
  • I blasted the sequence KX602057 again (I did not have the sequence in my 18S analysis of Stramenopiles) and it is not a Cafeteria species. But I was not sure to what I should change it. (marked red)
  • AY665996 is as well not a Cafeteria species, I have changed it to Bicosoecida sp., but I am not sure if that this is the best way. (marked red)

2 Init

3 Set up the files

4 Read the original data and reformat

4.1 Read the data

  • Number of sequences = 72
4.2 Add to PR2 missing sequences from Genbank

  • Run the script script_genbank_xml.R on server

  • Run second part PR2-update-GenBank.R

  • Sequences in target group in PR2: 42

  • Sequences in target group in PR2 that need update: 41

  • Updated sequences that are not active in PR2: 4

  • Sequences duplicated (e.g. with and without introns): 0

5 Taxonomy

5.1 Build and check

5.2 Find taxa in PR2 that are not included in the update

6 Finalization

6.1 Sequences that need updating

6.3 Sequences without species name or with different species

  • Sequences adde: 31
  • Sequences updated: 30
7 Save everything to an Excel file

