Dr. Alison Cuff, Dr. Ian Sillitoe, Dr. Mark Dibley, Mr. Tony Lewis, Mr. Oliver Redfern, Dr. Frances M.G. Pearl
Contributors to the CATH Version 3.1.0 Release
Ms. Sarah Addou,
Mr. Tim Dallman,
Mr. Benoit Dessailly,
Dr. Lesley Greene,
Dr. David Lee,
Dr. Jon Lees,
Dr. Russell L. Marsden,
Mr. Adam Reid,
Mr. Stathis Sideris,
Dr. Corin Yeats,
Prof. Janet Thornton,
Prof. Christine A. Orengo
Full HMM Library (right-click link and select "Save as...")
Concatenated file of 7794 models representing all sequence families in CATH v3.1.0 (gzipped HMMER2.0 format, 83MB)
Class, derived from secondary structure content, is
assigned for more than 90% of protein structures automatically.
Architecture, which describes the gross orientation of secondary
structures, independent of connectivities, is currently assigned
manually. The topology level clusters structures into fold groups
according to their topological connections and numbers of secondary
structures. The homologous superfamilies cluster proteins with highly
similar structures and functions. The assignments of structures to
fold groups and homologous superfamilies are made by sequence and
structure comparisons.
The boundaries and assignments for each protein domain
are determined using a combination of automated and manual
procedures. These include computational techniques, empirical and
statistical evidence, literature review and expert analysis.
Assessing strategies for improved superfamily recognition.Sillitoe I, Dibley M, Bray J, Addou S. and Orengo C. (2005) Protein Science. Vol 14. No 7. p. 1800-1810.
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis.Pearl F, Todd A, Sillitoe I, Dibley M, Redfern O, Lewis T, Bennett C, Marsden R, Grant A, Lee D, Akpor A, Maibaum M, Harrison A, Dallman T, Reeves G, Diboun I, Addou S, Lise S, Johnston C, Sillero A, Thornton J, Orengo C. (2005) Nucleic Acids Research. Vol. 33 Database Issue D247-D251