Welcome to CATH
CATH is a manually curated classification of protein domain structures. Each protein has been chopped into structural domains and assigned into homologous superfamilies (groups of domains that are related by evolution). This classification procedure uses a combination of automated and manual techniques which include computational algorithms, empirical and statistical evidence, literature review and expert analysis.
New in CATH v3.3
CATH v3.3 is built from 97,625 PDB chains. We have added the following data since v3.2:
- 124 folds (total 1,288)
- 226 superfamilies (total 2,593)
- 1,148 sequence families (total 10,019)
- 14,473 domains (total 128,688)
