-
Notifications
You must be signed in to change notification settings - Fork 637
MSA Server Database History
Milot Mirdita edited this page Aug 3, 2025
·
5 revisions
Date | UniRef30 | PDB | Environmental DB |
---|---|---|---|
Release | 2103 | PDB70 200916 | BFD/MGnify |
2022-03-04 | 2103 | PDB70 200916 | ColabFoldDB 202108 |
2022-07-13 | 2202 | PDB70 220313 | ColabFoldDB 202108 |
2023-06-12 | 2302 | PDB100 230517 | ColabFoldDB 202108 |
2023-07-27 | 2202 | PDB70 220313 | ColabFoldDB 202108 |
2023-07-31 | 2302 | PDB100 230517 | ColabFoldDB 202108 |
2025-08-04 | 2302* | PDB100 230517 | ColabFoldDB 202108 |
We updated the taxonomy/pairing files for the UniRef100. Previously, the pairing was done on the UniRef LCA. We switched this to the taxon identifier of the representative sequence. We also updated the _taxonomy
file, which is not directly used in the ColabFold search. About ~8000 taxa were missing in this, resulting in some hits being assigned unclassified labels, instead of a more specific label.
We updated the setup_databases.sh
script to first download the base files with the old pairing/taxonomy file and then download a second archive that only replaces these files. If you don't want to re-download all databases you can download only the two changed files in this archive.