Publications
- similar pairs (same SCOP fold and same CATH topology, green cells in the tables): 129436 domain pairs
- non-similar pairs (different folds in SCOP and different topologies in CATH, red cells in the tables): 1740476 domain pairs
Mapping of SCOP => CATH classified sets
The following two tables display the mapping of inner nodes from SCOP onto CATH nodes via the mapping method discussed in the paper. Only the best node from CATH according to the F-measure is mapped onto the query node in SCOP. The first table displays all mappable nodes, i.e. all nodes which have a F-measure > 0.0. In the second table we require a higher quality for a mapping, i.e. a F-measure > 0.8. While the mapped nodes now fit very well and their domain sets have a large "overlap", many nodes do not find a mapping partner in CATH which fulfils the quality criterion.
Mapping of CATH => SCOP
Those two tables display detailed data on the consistency checks of domain pairs from CATH via SCOP. The first table displays the cases (number of pairs and percentages summing up to 100% per column) being observed in the consistency checks. E.g. 474.399 pairs (2.604%) two domains from the same CATH superfamily are classified to be in different folds of the same class in SCOP.
The second table displays the same type of information but this time shows how many distinct folds and superfamilies from SCOP and topologies / superfamiles from CATH account for the cases observed. E.g. we find that the 474.399 pairs accounting for the example described above origin from 69 fold and 91 superfamilies in SCOP (29 topologies in and 35 superfamilies in CATH respectively).
Green (positive pairs) and Red (negative pairs) cells in the tables correspond to the pairs of domains selected for the benchmark sets extracted from our mapping between SCOP and CATH.
| outer class | class | architecture | topology | superfamily | |
|---|---|---|---|---|---|
| outer class (SCOP) outer class (CATH) | 745/1258 736/1462 | 711/1202 682/1402 | 724/1189 708/1407 | 174/432 48/512 | 60/79 42/54 |
| class (SCOP) class (CATH) | 745/1258 736/1462 | 706/1194 676/1393 | 698/1148 693/1357 | 238/558 57/618 | 69/91 29/35 |
| fold (SCOP) fold (CATH) | 23/159 88/180 | 39/263 107/331 | 52/286 132/346 | 86/481 71/532 | 32/118 29/42 |
| supfam (SCOP) supfam (CATH) | 21/22 70/87 | 39/42 101/150 | 41/43 106/138 | 65/91 57/253 | 188/253 165/250 |
| family (SCOP) family (CATH) | 14/14 53/62 | 25/26 54/77 | 20/21 55/71 | 36/60 28/130 | 745/1258 736/1462 |
Mapping of SCOP => CATH
Those two tables display detailed data on the consistency checks of domain pairs from SCOP via CATH. The first table displays the cases (number of pairs and percentages summing up to 100% per column) being observed in the consistency checks. E.g. for 70.188 pairs (0.866%%) two domains from the same SCOP family are classified to be in different folds of the same class in CATH. The second table displays the same type of information but this time shows how many distinct folds and superfamilies from SCOP and topologies / superfamiles from CATH account for the cases observed. Therefore, we find that the pairs accounting for the example described above origin from 25 fold and 26 superfamilies in SCOP (54 topologies in and 77 superfamilies in CATH respectively).
Green (positive pairs) and Red (negative pairs) cells in the tables correspond to the pairs of domains selected for the benchmark sets extracted from the mapping.
| outer class | class | fold | superfamily | family | |
|---|---|---|---|---|---|
| outer class (SCOP) outer class (CATH) | 745/1258 736/1462 | 745/1258 736/1462 | 23/159 88/180 | 21/22 70/87 | 14/14 53/62 |
| class (SCOP) class (CATH) | 711/1202 682/1402 | 706/1194 676/1393 | 39/263 107/331 | 39/42 101/150 | 25/26 54/77 |
| arch (SCOP) arch (CATH) | 724/1189 708/1407 | 698/1148 693/1357 | 52/286 132/346 | 41/43 106/138 | 20/21 55/71 |
| top (SCOP) top (CATH) | 174/432 48/512 | 238/558 57/618 | 86/481 71/532 | 65/91 57/253 | 36/60 28/130 |
| hom (SCOP) hom (CATH) | 60/79 42/54 | 69/91 29/35 | 32/118 29/42 | 188/253 165/250 | 745/1258 736/1462 |
In order to allow for an interactive use of the mapping of SCOP and CATH computed we have implemented an interactive browser. You can choose any SCOP or CATH node as entry point and explore the relationships between SCOP and CATH in detail by clicking on a node (a SCOP or CATH set) in the displayed graphs to display its respective mapping in the other classification (CATH or SCOP, respectively).
The following two tables display information about the most current database versions of SCOP and CATH. In particular they display their protein and domain content, their number of families, superfamilies, folds and classes (homologous superfamilies, topologies, architectures and classes respectively). The second table displays the content of the two hierarchies after mapping the domain definitions as described in the paper with a minimal overlap of 0.8.
Data on the content of the most current SCOP and CATH version
| CATH 3.1.0 (January 2007) | SCOP 1.73 (November 2007) | ||
|---|---|---|---|
| class | 4 | 11 | class |
| arch | 40 | -- | -- |
| topology | 1084 | 1283 | fold |
| superfamily | 2091 | 2034 | superfamily |
| -- | -- | 3751 | family |
| domains | 93851 | 97178 | domains |
| pdbs | 30028 | 34495 | pdbs |
| pdbs in both | 27553 | pdbs in both | |
Data content after mapping domains (overlap > 0.8)
| CATH 3.1.0 (January 2007) | SCOP 1.73 (November 2007) | ||
|---|---|---|---|
| class | 4 | 11 | class |
| arch | 38 | -- | -- |
| topology | 736 | 754 | fold |
| superfamily | 1462 | 1258 | superfamily |
| -- | -- | 2228 | family |
| domains | 56104 | domains | |
| pdbs | 19266 | pdbs | |
Mapping of SCOP => CATH classified sets
The following two tables display the mapping of inner nodes from SCOP onto CATH nodes via the mapping method discussed in the paper. Only the best node from CATH according to the F-measure is mapped onto the query node in SCOP. The first table displays all mappable nodes, i.e. all nodes which have a F-measure > 0.0. In the second table we require a higher quality for a mapping, i.e. a F-measure > 0.8. While the mapped nodes now fit very well and their domain sets have a large "overlap", many nodes do not find a mapping partner in CATH which fulfils the quality criterion.
Mapping of CATH => SCOP
Those two tables display detailed data on the consistency checks of domain pairs from CATH via SCOP. The first table displays the cases (number of pairs and percentages summing up to 100% per column) being observed in the consistency checks. E.g. 474.399 pairs (2.604%) two domains from the same CATH superfamily are classified to be in different folds of the same class in SCOP.
The second table displays the same type of information but this time shows how many distinct folds and superfamilies from SCOP and topologies / superfamiles from CATH account for the cases observed. E.g. we find that the 474.399 pairs accounting for the example described above origin from 69 fold and 91 superfamilies in SCOP (29 topologies in and 35 superfamilies in CATH respectively).
Green (positive pairs) and Red (negative pairs) cells in the tables correspond to the pairs of domains selected for the benchmark sets extracted from our mapping between SCOP and CATH.
| outer class | class | architecture | topology | superfamily | |
|---|---|---|---|---|---|
| outer class (SCOP) outer class (CATH) | 745/1258 736/1462 | 711/1202 682/1402 | 724/1189 708/1407 | 174/432 48/512 | 60/79 42/54 |
| class (SCOP) class (CATH) | 745/1258 736/1462 | 706/1194 676/1393 | 698/1148 693/1357 | 238/558 57/618 | 69/91 29/35 |
| fold (SCOP) fold (CATH) | 23/159 88/180 | 39/263 107/331 | 52/286 132/346 | 86/481 71/532 | 32/118 29/42 |
| supfam (SCOP) supfam (CATH) | 21/22 70/87 | 39/42 101/150 | 41/43 106/138 | 65/91 57/253 | 188/253 165/250 |
| family (SCOP) family (CATH) | 14/14 53/62 | 25/26 54/77 | 20/21 55/71 | 36/60 28/130 | 745/1258 736/1462 |
Mapping of SCOP => CATH
Those two tables display detailed data on the consistency checks of domain pairs from SCOP via CATH. The first table displays the cases (number of pairs and percentages summing up to 100% per column) being observed in the consistency checks. E.g. for 70.188 pairs (0.866%%) two domains from the same SCOP family are classified to be in different folds of the same class in CATH. The second table displays the same type of information but this time shows how many distinct folds and superfamilies from SCOP and topologies / superfamiles from CATH account for the cases observed. Therefore, we find that the pairs accounting for the example described above origin from 25 fold and 26 superfamilies in SCOP (54 topologies in and 77 superfamilies in CATH respectively).
Green (positive pairs) and Red (negative pairs) cells in the tables correspond to the pairs of domains selected for the benchmark sets extracted from the mapping.
| outer class | class | fold | superfamily | family | |
|---|---|---|---|---|---|
| outer class | 962.011.672 79.380% | 27.559.663 8.311% | 131.175 0.989% | 35.834 0.402% | 2.326 0.029% |
| class | 220.082.756 18.1100% | 186.177.961 56.146% | 338.810 2.553% | 167.921 1.882% | 70.188 0.866% |
| arch | 29.352.140 2.422% | 82.570.027 24.901% | 371.882 2.803% | 113.038 1.267% | 7.316 0.090% |
| top | 444.171 0.037% | 34.815.630 10.499% | 10.879.564 81.994% | 396.388 4.443% | 53.505 0.6100% |
| hom | 18.286 0.002% | 474.399 0.143% | 1.547.324 11.661% | 8.208.965 92.007% | 7.970.415 98.355% |
| sum | 1.211.908.992 77.005% | 331.597.664 21.070% | 13.268.755 0.843% | 8.922.146 0.567% | 8.103.750 0.515% |
| outer class | class | fold | superfamily | family | |
|---|---|---|---|---|---|
| outer class (SCOP) outer class (CATH) | 745/1258 736/1462 | 745/1258 736/1462 | 23/159 88/180 | 21/22 70/87 | 14/14 53/62 |
| class (SCOP) class (CATH) | 711/1202 682/1402 | 706/1194 676/1393 | 39/263 107/331 | 39/42 101/150 | 25/26 54/77 |
| arch (SCOP) arch (CATH) | 724/1189 708/1407 | 698/1148 693/1357 | 52/286 132/346 | 41/43 106/138 | 20/21 55/71 |
| top (SCOP) top (CATH) | 174/432 48/512 | 238/558 57/618 | 86/481 71/532 | 65/91 57/253 | 36/60 28/130 |
| hom (SCOP) hom (CATH) | 60/79 42/54 | 69/91 29/35 | 32/118 29/42 | 188/253 165/250 | 745/1258 736/1462 |
In the table below we show all pairs of domains calculated on the mappable subset of all CATH domains sharing less than 50% sequence identity, which are inconsistently defined in SCOP and CATH. Only pairs where both SCOP domains belong to the classes 'a'-'d' are shown. The rows correspond to CATH levels, the columns to levels of the SCOP hierarchy. Clicking e.g. on the class row and the family column shows all pairs of domains which are classified to be in the same family in SCOP but to be in different classes in CATH.
| outer class | class | fold | superfamily | family | |
|---|---|---|---|---|---|
| outer class | 523 pairs | 297 pairs | 11 pairs | ||
| class | 2647 pairs | 764 pairs | 114 pairs | ||
| arch | 1863 pairs | 92 pairs | |||
| top | 2908 pairs | 294631 pairs | 3190 pairs | 332 pairs | |
| hom | 70 pairs | 2462 pairs | 9873 pairs |
Here we display the subgraphs of similar folds in SCOP which have been identified by a consistency check in CATH as described in the paper. We show the SCOP fold identifiers as well as the number of connections / links per fold in brackets. Click on the image link to view the fold graph in detail.
- image c.120 (37) c.60 (37) c.26 (37) c.108 (37) c.25 (37) c.15 (37) c.33 (37) c.91 (37) c.41 (37) c.48 (37) c.69 (37) c.34 (37) c.36 (37) c.30 (37) c.2 (37) c.78 (37) c.17 (37) c.4 (37) c.31 (37) c.56 (38) c.32 (37) c.44 (37) c.129 (37) c.66 (37) c.23 (37) c.5 (37) d.108 (1) c.72 (37) c.52 (37) c.80 (37) c.125 (37) c.24 (37) c.16 (37) c.62 (37) c.51 (37) c.61 (37) c.65 (37) c.37 (37) c.124 (37)
- image b.3 (8) b.94 (9) b.71 (8) d.273 (9) b.19 (9) b.82 (17) b.2 (8) b.115 (9) b.15 (8) b.121 (9) b.16 (8) b.22 (9) b.1 (8) b.6 (8) b.29 (9) b.7 (8) b.18 (9) b.23 (9)
- image b.34 (7) d.12 (2) b.136 (3) d.58 (2) d.17 (2) d.1 (2) d.9 (4) b.38 (3) i.1 (6) b.40 (4) b.84 (4) g.41 (5)
- image h.3 (8) f.14 (8) f.28 (8) a.214 (8) a.30 (8) a.16 (8) a.2 (8) f.17 (8) a.32 (8)
- image a.56 (3) a.4 (5) a.51 (2) d.52 (3) d.227 (3) a.61 (3) d.130 (3) j.84 (2) a.60 (6)
- image a.238 (1) a.8 (6) f.23 (3) f.3 (3) h.1 (3) a.156 (2) a.5 (2)
- image a.118 (2) d.110 (1) i.23 (2) d.211 (1)
- image d.67 (2) d.74 (2) d.185 (2)
- image d.230 (2) d.240 (1) d.68 (1)
- image g.3 (2) g.69 (2) g.68 (2)
- image b.49 (2) b.43 (2) b.44 (2)
- image a.1 (1) f.1 (1)
- image d.78 (1) a.143 (1)
- image a.19 (1) a.69 (1)
- image a.47 (1) a.7 (1)
- image a.6 (1) d.201 (1)
- image b.107 (1) b.4 (1)
- image b.122 (1) b.53 (1)
- image b.60 (1) b.61 (1)
- image c.1 (1) c.6 (1)
- image c.59 (1) c.45 (1)
- image d.81 (1) c.47 (1)
- image d.129 (1) d.105 (1)
- image k.45 (1) d.15 (1)
- image d.55 (1) d.150 (1)
- image g.37 (1) d.50 (1)
- image d.87 (1) d.54 (1)
- image g.27 (1) g.18 (1)
- image g.50 (1) g.44 (1)
Here we display the subgraphs of similar topologies in CATH which have been identified by a consistency check in SCOP as described in the paper. We show the CATH topology ids as well as the number of connections / links per fold in brackets. Click on the image link to view the topology graph in detail.
- image 2.70.70 (1) 2.20.25 (4) 3.90.80 (2) 2.40.200 (2) 3.30.1060 (3) 2.40.50 (5) 3.40.462 (3) 2.30.30 (5) 2.60.11 (4) 3.30.70 (5) 3.10.450 (4) 3.40.1120 (1) 2.20.28 (4) 3.90.320 (3)
- image 3.40.580 (6) 3.40.1190 (1) 3.40.810 (2) 3.40.1350 (6) 3.90.950 (1) 3.40.850 (1) 3.40.1340 (6) 3.40.630 (1) 3.40.50 (12) 3.40.91 (6) 3.40.210 (6) 3.40.600 (6) 3.40.1080 (2)
- image 2.60.130 (1) 2.60.40 (4) 2.70.9 (4) 3.60.130 (2) 2.170.30 (4) 2.70.50 (1) 2.70.100 (1) 2.60.169 (4) 3.90.209 (1) 2.60.120 (8) 2.60.175 (4)
- image 4.10.95 (7) 4.10.220 (1) 1.10.8 (2) 1.10.442 (7) 4.10.49 (7) 1.20.1270 (2) 1.20.5 (10) 4.10.93 (7) 4.10.51 (7) 4.10.81 (7) 4.10.91 (7)
- image 4.10.640 (5) 1.10.10 (5) 1.10.1270 (5) 1.10.1680 (5) 1.10.1750 (5) 1.10.150 (6) 3.30.300 (1)
- image 3.90.1410 (6) 2.10.150 (6) 3.90.1210 (6) 2.40.340 (6) 2.40.300 (6) 2.70.40 (6) 2.170.270 (6)
- image 2.10.69 (4) 2.10.22 (4) 2.10.25 (4) 3.30.30 (4) 3.30.60 (4)
- image 1.25.40 (3) 1.25.10 (4) 1.10.2080 (3) 1.20.190 (3) 3.30.450 (1)
- image 3.40.367 (3) 2.30.37 (3) 3.40.350 (3) 3.30.420 (3)
- image 2.40.10 (3) 3.20.14 (2) 2.40.30 (3) 2.40.37 (2)
- image 3.30.1660 (2) 3.30.1490 (2) 3.30.110 (3) 3.65.10 (1)
- image 3.30.310 (3) 3.90.380 (3) 3.30.1400 (3) 3.30.530 (3)
- image 3.90.176 (3) 3.90.228 (3) 3.90.210 (3) 3.90.175 (3)
- image 1.10.490 (2) 1.10.1060 (1) 1.10.437 (1)
- image 4.10.160 (2) 4.10.260 (2) 1.10.1620 (2)
- image 3.90.20 (1) 1.10.287 (2) 1.20.20 (1)
- image 2.40.240 (2) 2.30.130 (2) 3.10.400 (2)
- image 3.30.50 (2) 4.10.830 (2) 2.30.170 (2)
- image 2.40.170 (2) 2.40.230 (2) 2.40.160 (2)
- image 3.90.1580 (2) 3.10.40 (2) 3.10.100 (2)
- image 3.30.379 (2) 3.90.1240 (2) 3.40.390 (2)
- image 3.90.540 (2) 3.40.570 (2) 3.90.75 (2)
- image 1.10.120 (1) 1.10.110 (1)
- image 1.10.1130 (1) 3.90.10 (1)
- image 1.10.1140 (1) 1.20.150 (1)
- image 1.10.1220 (1) 1.10.140 (1)
- image 1.10.3210 (1) 1.10.1300 (1)
- image 1.10.1480 (1) 3.90.940 (1)
- image 1.10.1660 (1) 3.30.56 (1)
- image 3.30.240 (1) 1.10.260 (1)
- image 1.10.3130 (1) 2.160.10 (1)
- image 1.10.510 (1) 3.30.200 (1)
- image 1.20.144 (1) 1.10.606 (1)
- image 1.20.1260 (1) 1.10.620 (1)
- image 1.20.950 (1) 1.20.1300 (1)
- image 1.20.140 (1) 1.20.920 (1)
- image 1.50.10 (1) 1.50.30 (1)
- image 2.40.20 (1) 2.10.10 (1)
- image 2.150.10 (1) 2.160.20 (1)
- image 3.90.1590 (1) 2.170.150 (1)
- image 2.20.20 (1) 3.10.360 (1)
- image 2.30.33 (1) 3.90.180 (1)
- image 2.40.310 (1) 2.40.128 (1)
- image 3.30.500 (1) 3.10.320 (1)
- image 3.20.110 (1) 3.20.20 (1)
- image 3.30.479 (1) 3.30.1130 (1)
- image 3.30.1360 (1) 3.30.830 (1)
- image 3.30.1700 (1) 3.30.230 (1)
- image 3.30.360 (1) 3.40.30 (1)
- image 3.50.30 (1) 3.50.7 (1)
- image 3.60.60 (1) 3.60.20 (1)
- image 3.90.70 (1) 3.90.260 (1)