CATH / Gene3D v4.1

53 million protein domains classified into 2,737 superfamilies

3D Structure

Find out what 3D structure your protein adopts

Protein Evolution

Learn about a particular protein family and how it evolved

Protein Function

Investigate the function of your protein

Conserved Sites

Look at protein sites that are highly conserved and implicated in function

Download Data

Download data files and query CATH via webservices

Learn more

Find out how CATH is created and maintained, how to link to CATH and more

What is CATH-Gene3D?

CATH is a classification of protein structures downloaded from the Protein Data Bank. We group protein domains into superfamilies when there is sufficient evidence they have diverged from a common ancestor.

Gene3D uses the information in CATH to predict the locations of structural domains on millions of protein sequences available in public databases. This allows us to include additional annotations to the CATH-Gene3D database such as functional information and active site residues.

If you have any questions, comments or suggestions please get in touch via Twitter, ask a question in our online forum or visit our support page.


ELIXIR-UK Resource

CATH-Gene3D is a proud member of ELIXIR - the European-wide project providing high quality and sustainable infrastructure for biological information. Following a strict selection procedure, CATH-Gene3D has been identified as a "Tier 1" resource by ELIXIR-UK (Protein Structure and Function).

Latest Release Statistics Info

PDB Release 01-01-2015
Domains 308999 440012
Superfamilies 2737 6892
Annotated PDBs 108378 131091

Gene3D v14
Cellular Genomes19,471
Protein Sequences43,387,462
CATH Domain Predictions53,479,436

Citing this resource

If you find the information in this resource useful, please consider using the following citations:

CATH: comprehensive structural and functional annotations for genome sequences.
Sillitoe I, Lewis, TE, Cuff AL, Das S, Ashford P, Dawson NL, Furnham N, Laskowski RA, Lee D, Lees J, Lehtinen S, Studer R, Thornton JM, Orengo CA
Nucleic Acids Res. 2015 Jan
Gene3D: expanding the utility of domain assignments.
Lam SD, Dawson NL, Das S, Sillitoe I, Ashford P, Lee D, Lehtinen S, Orengo CA, Lees JG.
Nucleic Acids Res. 2016 Jan


The CATH and Gene3D resources have enjoyed generous funding from a number of research councils.

BBSRC logo MRC logo NIH logo Wellcome logo ERC logo