Welcome to the Utrecht Biomolecular Interaction Web Portal >>

Dataset

The Prodigy datasets site includes the following sections:

The predictive model developed in PRODIGY was trained using a subset of bound structures from the structure-based protein–protein binding affinity benchmark of Kastritis et al. (2011). It contains 144 non-redundant protein–protein complexes with known 3D structures (of both the unbound and bound components) and associated experimental binding affinity (ΔG) values.

From this dataset we obtained our “reliable” dataset on which we trained our model according to the following cases:
  • Removing cases with not unique values of ΔG
  • Removing the complexes that showed gaps or unresolved fragments at the binding interface (considering a gap to be a missing segment longer than two residues)
  • Removing the complexes for which the ΔG has been measured with a less reliable experimental technique according to the results we reported in Vangone and Bonvin, eLife (2015).

In the following table experimental data, number of ICs and NIS properties have been reported for our 'cleaned' dataset made of 81 protein-protein complexes.

Download the table here (.csv files)
PDB Chains Type ΔG_exp ΔG_pred Method i-RMSD BSA ICs_CC ICs_CP ICs_CA ICs_PP ICs_PA ICs_AA NIS_P NIS_A NIS_C
1A2K.pdb C:AB OG -9.3 -9.0 ITC 1.1 1603 5 4 20 2 11 25 31.6 41.8 26.6
1ACB.pdb E:I EI -13.1 -11.9 spectroscopy 1.1 1544 3 2 19 4 19 25 42.7 38.5 18.8
1AK4.pdb A:D OX -6.4 -7.3 ITC 1.3 1029 0 4 14 1 10 20 27.9 45.4 26.6
1AKJ.pdb AB:DE OX -5.3 -7.3 SPR 1.1 1995 8 20 7 18 18 2 35.2 35.2 29.6
1ATN.pdb A:D OX -12.1 -10.9 spectroscopy 3.3 1774 9 8 20 4 15 28 36.3 34.9 28.7
1AVZ.pdb B:C NC -6.5 -8.4 ITC 0.7 1259 5 8 9 2 14 10 30.6 42.5 26.9
1B6C.pdb A:B OX -8.9 -9.4 SPR 2.0 1752 6 11 18 1 11 21 31.5 35.4 33.1
1BJ1.pdb HL:VW AB -11.6 -12.2 SPR 0.5 1731 3 8 21 7 22 19 41.7 35.7 22.6
1BUH.pdb A:B EI -9.7 -8.4 SPR 0.8 1324 12 4 20 0 5 17 28.8 41.2 29.9
1BVK.pdb DE:F A -10.5 -11.3 stopped-flow 1.2 1321 2 8 18 2 14 14 43.8 33.6 22.7
1BVN.pdb P:T EI -15.1 -13.3 SPR 0.9 2222 3 12 22 0 23 33 38.2 36.8 25
1CBW.pdb ABC:D NC -10.7 -11.3 spectroscopy 0.7 1457 3 10 26 1 13 28 39.7 39.7 20.6
1DE4.pdb AB:CF OX -9.8 -8.9 SPR 2.6 2066 5 13 17 9 14 9 35.2 33.9 30.9
1DQJ.pdb AB:C A -11.7 -12.7 SPR 0.8 1765 2 20 18 11 25 15 47.8 31.4 20.8
1E4K.pdb AB:C OR -7.9 -9.4 SPR 2.6 1634 3 8 26 2 5 21 38.9 36.0 25.2
1E6E.pdb A:B ES -8.3 -8.6 SPR 1.3 2315 18 15 17 5 11 11 26.1 42.8 31.2
1E6J.pdb HL:P A -10.3 -9.1 SPR 1.1 1245 0 7 8 5 15 26 40.8 38.8 20.5
1E96.pdb A:B OG -7.4 -8.0 ITC 0.7 1179 4 7 16 4 11 8 28.7 37.2 34.1
1EER.pdb A:BC OR -15.6 -13.6 SPR 2.4 3347 30 19 25 12 31 19 26.1 43.8 30.1
1EFN.pdb B:A OX -10.1 -8.0 SPR 0.9 1253 5 5 12 1 12 16 27.5 44.3 28.2
1EMV.pdb A:B EI -18.6 -11.0 stopped-flow 1.3 1535 4 12 11 4 24 13 29.7 29.1 41.2
1EWY.pdb A:C ES -7.4 -8.3 spectroscopy 0.8 1502 9 10 15 3 9 12 29.8 37.0 33.2
1EZU.pdb C:AB EI -13.8 -13.1 spectroscopy 1.2 2751 7 9 33 8 28 42 30.6 40.4 29.0
1FC2.pdb C:D OX -10.4 -8.5 stopped-flow 1.7 1307 4 4 9 8 14 11 37.6 35.6 26.8
1FFW.pdb A:B OX -8.1 -5.7 ITC 1.4 1170 7 2 16 0 2 15 21.9 44.4 33.8
1FSK.pdb BC:A AB -13.1 -11.7 SPR 0.5 1623 7 10 22 3 15 25 40.6 35.8 23.6
1GCQ.pdb B:C OX -6.5 -9.0 SPR 0.9 1208 5 7 11 0 12 21 33.3 41.2 25.5
1GLA.pdb G:F ER -6.8 -8.1 spectroscopy 1.0 1304 5 8 13 0 11 9 26.4 38.8 34.8
1GXD.pdb A:C EI -11.7 -12.6 SPR 1.4 2445 16 6 33 0 15 35 30.3 40 29.7
1HE8.pdb B:A OG -7.4 -7.3 stopped-flow 0.9 1305 8 15 6 5 8 6 33.2 33.3 33.5
1I2M.pdb A:B OG -15.8 -14.6 stopped-flow 2.1 2779 20 17 37 7 26 12 30.4 38.6 31.0
1I4D.pdb D:AB OG -7.5 -9.3 ITC 1.4 1657 3 9 9 7 21 18 31.2 35.1 33.6
1J2J.pdb A:B OG -8.1 -7.0 SPR 0.6 1209 3 4 12 1 3 20 31.8 30.5 37.8
1JPS.pdb HL:T A -13.6 -12.9 SPR 0.5 1852 13 16 18 4 20 12 40.8 34.8 24.4
1JTG.pdb B:A EI -12.8 -11.6 SPR 0.5 2600 11 19 34 4 15 25 32.2 40.8 27.0
1JWH.pdb CD:A ER -11.1 -9.5 ITC 1.3 1451 2 5 20 0 10 20 33.3 34.2 32.5
1K5D.pdb AB:C OG -12.8 -11.6 stopped-flow 1.2 2527 16 15 24 6 21 27 25.9 33.6 40.5
1KAC.pdb A:B OR -10.7 -11.9 SPR 0.9 1456 7 5 13 2 18 12 42.9 37.7 19.5
1KKL.pdb ABC:H ES -10 -9.1 SPR 2.2 1641 6 9 16 4 14 22 31.2 38.1 30.7
1KTZ.pdb A:B OR -8.9 -8.1 SPR 0.4 989 5 6 7 0 5 12 38.2 33.9 27.9
1KXQ.pdb H:A AB -11.5 -11.8 SPR 0.7 2172 7 9 18 12 25 23 40.5 34.5 25
1LFD.pdb B:A OG -7.8 -8.4 stopped-flow 1.8 1167 9 8 9 3 9 11 32.5 29.3 38.2
1M10.pdb A:B ER -11.2 -10.5 SPR 2.1 2097 16 24 24 2 8 12 35.1 36.3 28.6
1MLC.pdb AB:E NC -9.6 -10.3 SPR 0.6 1392 5 8 15 13 18 8 45.6 33.0 21.4
1MQ8.pdb A:B OX -7.5 -8.2 SPR 1.8 1241 3 11 10 6 15 11 31.8 37.1 31.1
1P2C.pdb AB:C A -13.6 -11.4 SPR 0.5 1456 5 8 17 13 22 10 45.1 32.7 22.3
1PPE.pdb E:I EI -15.6 -14.7 spectroscopy 0.3 1688 5 10 28 0 20 24 47.1 34.9 18.0
1PVH.pdb A:B OR -9.5 -10.9 ITC 0.3 1403 5 8 8 4 21 11 38.1 35.1 26.8
1QA9.pdb A:B OX -7.2 -9.6 SPR 0.7 1353 21 10 14 0 2 3 35.2 25.8 39.0
1R0R.pdb E:I EI -14.2 -11.3 spectroscopy 0.5 1409 1 13 8 6 21 31 46.7 37.1 16.2
1S1Q.pdb A:B OX -4.3 -9.0 SPR 1.0 1288 1 4 11 6 16 16 35.7 35.7 28.6
1US7.pdb A:B ER -8.1 -6.8 ITC 1.1 1106 6 6 10 1 5 14 27.6 33.8 38.6
1VFB.pdb AB:C A -11.5 -11.4 ITC 1.0 1383 2 8 20 3 13 14 45.7 31.9 22.4
1WEJ.pdb HL:F A -12.5 -9.4 spectroscopy 0.3 1177 10 9 18 2 4 8 41.3 34.5 24.3
1XQS.pdb A:C OX -7.1 -11.2 SPR 1.8 2350 24 18 24 7 15 3 29.7 35.3 35.0
1XU1.pdb ABD:T OR -11.2 -9.1 SPR 1.3 1700 8 12 21 3 10 16 31.8 39.2 29.1
1Z0K.pdb A:B OG -7 -11.7 SPR 0.5 1787 5 7 14 2 21 27 34.8 30.4 34.8
1ZHI.pdb A:B OX -9.1 -8.7 ITC 0.7 1322 5 4 18 2 12 12 28.0 38.3 33.7
2A9K.pdb A:B ES -10.2 -11.5 ITC 0.8 1751 4 15 23 6 23 14 30.1 31.9 38.0
2ABZ.pdb B:E EI -11.7 -12.0 spectroscopy 0.9 1447 4 2 18 4 18 25 44.0 34.6 21.4
2AJF.pdb A:E OR -10.6 -10.8 SPR 0.7 1704 1 8 18 5 19 10 37.7 36.0 26.3
2AQ3.pdb A:B NC -6.7 -7.4 ITC 1.8 1105 2 7 7 6 7 15 39.4 30.7 29.9
2C0L.pdb A:B OX -9.8 -11.2 ITC 2.6 2013 9 9 13 3 25 28 27.8 40.3 31.9
2FJU.pdb B:A OG -7.2 -5.9 SPR 1.0 1245 3 4 6 3 6 18 29.0 39.3 31.8
2GOX.pdb A:B OX -12.1 -11.3 SPR 0.6 1631 13 6 17 3 19 6 30.5 35.6 33.9
2HQS.pdb A:H OX -10.2 -12.1 ITC 1.1 2333 10 21 37 5 17 19 33.2 43.9 22.9
2HRK.pdb A:B OX -11 -8.7 SPR 2.0 1595 8 11 14 1 9 13 32.5 40.3 27.3
2I25.pdb N:L A -12.3 -12.0 SPR 1.2 1425 7 7 17 2 18 13 39.1 31.8 29.1
2MTA.pdb HL:A ES -7.4 -7.8 spectroscopy 0.4 1461 3 3 21 2 11 24 24.5 45.8 29.7
2NYZ.pdb AB:D OR -12.7 -12.7 SPR 2.1 2160 6 10 18 5 25 20 38.5 36.4 25.1
2OOB.pdb A:B ES -5.7 -6.2 ITC 0.8 808 3 3 13 0 2 15 27.2 34.8 38.0
2PCB.pdb A:B NC -6.8 -7.4 ITC 0.5 1029 11 4 7 3 7 1 30.9 35.4 33.8
2PCC.pdb A:B ES -7.9 -8.1 ITC 0.4 1141 8 6 5 2 10 8 32.8 34.7 32.5
2TGP.pdb Z:I NC -7.5 -12.3 spectroscopy 0.6 1432 4 9 28 1 14 20 43.0 38.6 18.4
2VIR.pdb AB:C A -12.3 -10.9 SPR 0.8 1263 0 5 8 9 25 12 43.2 38.0 18.8
2VIS.pdb AB:C NC -7.4 -11.0 SPR 0.8 1296 0 5 8 7 23 16 43.7 37.9 18.4
2WPT.pdb A:B NC -10.7 -11.9 stopped-flow 1.6 1581 8 14 12 2 28 10 23.1 27.2 49.7
3BZD.pdb A:B OX -9.6 -9.4 ITC 1.1 1312 4 7 6 7 15 12 41.0 29.5 29.5
3CPH.pdb G:A OG -8.8 -9.3 ITC 2.1 1685 6 7 14 2 15 19 28.8 35.7 35.5
3SGB.pdb E:I EI -14.5 -13.1 spectroscopy 0.4 1268 3 5 15 2 20 20 49.4 36.0 14.5
4CPA.pdb A:I EI -11.3 -10.1 spectroscopy 1.5 1177 0 3 16 5 10 19 48.0 31.7 20.4

Table information:
(a) PDB= PDB entry; Chains=Interacting chains (AB:C to represent a complex where chains A and B make up component 1, chain C, component 2); Type=Functional class of the complex; ΔG_exp= Experimental measured ΔG Method=Experimental technique used to measure the ΔG i_RMDS=root-mean-square displacement of the C-alpha atoms of interface residues of the two partners after the unbound and the bound interfaces have been superimposed by least-square; BSABuried Surface Area upon complex formation; ICs_XX=Number of ICs divided by their charged(C)/polar(P)/apolar(A) character; NIS_X=Percentage of the charged(C), polar(P) and apolar(A) NIS (Non-Interacting Surface) residues; ΔG_pred=Values of ΔG predicted by PRODIGY.
(b) Functional classes: A antigen/antibody; AB antigen/antibody with bound antibody; EI enzyme/inhibitor; ES enzyme/substrate; ER enzyme complex with a regulatory or accessory chain ; OG G-protein containing ; OR receptor containing ; OX miscellaneous ; NC non-cognate.
(c)Experimental methods: ITC Isotermal Titration Calorimetry; SPR Surface Plasmon Resonance.

For further information please refer to Vangone and Bonvin, eLife (2015).

Download the dataset at this link (.pdb files compressed in a .tgz file)

Please note that the provided PDB files have been 'cleaned' according to the folllwing points:
  • All the HETATM entries have been removed from the .pdb file
  • The chains have been renamed as A for the receptor (i.e. the biggest protein in the complex) and B for the ligand. In the case of multiple chains for either the receptor and/or the ligand in the original file, a shift of +500 for each chain has been applied to the numbering to avoid overlap in the residues numbering.
  • Residue with double occupancy were removed
  • The hydrogens have been removed

Back to PRODIGY home page

Go to PRODIGY prediction model

In the following table experimental binding affinity (reported in kcal/mol) and the PRODIGY-LIG predictions have been reported for 124 cases with known experimental Ki in the PDBbind dataset (http://www.pdbbind.org.cn/).

Download the table here (.csv files)
PDB ΔG_pred ΔG_score ΔG_noelec ΔG_exp
1e66 -9.0 87.8 -9.3 -13.5
1f8b -8.5 139.8 -7.7 -7.4
1f8c -9.4 103.4 -7.1 -10.1
1f8d -8.9 117.6 -7.0 -4.6
1gpk -9.0 90.7 -8.6 -7.3
1h23 -12.2 12.0 -11.1 -11.4
1hfs -12.8 10.1 -13.8 -11.9
1hnn -9.8 69.0 -7.2 -8.5
1lol -8.8 125.3 -6.3 -8.7
1loq -9.6 102.0 -6.5 -5.0
1lor -10.1 108.4 -7.0 -15.1
1mq6 -8.9 102.1 -9.1 -15.2
1n1m -8.9 83.7 -6.8 -7.8
1n2v -6.4 165.1 -6.0 -5.6
1nvq -10.8 57.5 -10.4 -11.3
1o3f -8.1 114.0 -8.2 -10.9
1o5b -7.6 119.5 -7.6 -7.9
1oyt -10.3 56.5 -8.9 -9.9
1ps3 -7.1 167.7 -6.7 -3.1
1qi0 -6.4 192.7 -6.8 -3.2
1r5y -6.4 158.7 -6.2 -8.8
1sln -9.7 80.9 -8.9 -9.1
1sqa -9.2 76.8 -8.1 -12.6
1u33 -7.8 183.0 -8.5 -6.3
1vso -9.8 85.4 -7.2 -6.4
1w3k -6.5 198.4 -7.0 -5.9
1w3l -7.1 200.8 -7.7 -8.6
1w4o -8.8 113.1 -6.5 -7.1
1xd0 -11.2 126.8 -11.4 -9.7
1z95 -13.5 -6.2 -12.9 -9.7
2brb -7.8 130.9 -8.5 -6.6
2cbj -7.0 175.1 -7.3 -11.3
2d1o -9.1 95.5 -9.0 -10.5
2d3u -8.6 109.3 -8.9 -9.4
2fvd -8.5 113.8 -8.3 -11.6
2g70 -9.0 100.1 -7.3 -10.6
2gss -7.9 130.5 -7.8 -6.7
2hb1 -10.5 63.8 -7.2 -5.2
2j62 -9.0 113.7 -9.3 -15.5
2obf -9.8 82.3 -9.2 -12.1
2ole -11.6 14.9 -9.0 -9.9
2p4y -13.0 11.3 -12.5 -12.3
2pq9 -22.8 -235.7 -9.4 -11.1
2qbp -13.3 -1.6 -10.3 -11.5
2qbr -12.0 24.8 -9.4 -8.6
2qft -10.3 71.3 -5.7 -7.2
2qmj -9.9 138.0 -9.4 -5.7
2vl4 -7.8 133.7 -6.6 -8.2
2vo5 -9.5 91.1 -8.5 -6.7
2vot -7.7 143.9 -8.0 -9.7
2w66 -7.3 157.2 -7.0 -5.5
2wbg -8.7 122.0 -8.0 -6.1
2wca -7.7 151.6 -7.7 -7.6
2wtv -9.7 72.2 -10.1 -11.9
2x0y -6.8 167.7 -7.0 -6.3
2x8z -8.8 99.5 -7.2 -10.9
2x97 -9.6 107.2 -9.2 -7.7
2xb8 -9.2 123.0 -10.1 -10.3
2xhm -9.9 101.9 -10.0 -9.3
2xnb -8.6 98.1 -8.0 -9.3
2xy9 -13.7 10.1 -13.5 -12.5
2xys -9.2 93.3 -9.3 -10.1
2y5h -9.4 90.1 -9.5 -7.9
2yfe -9.9 81.1 -9.6 -9.0
2zcq -12.6 13.2 -9.2 -12.0
2zcr -12.2 23.9 -9.9 -9.4
2zjw -8.0 159.9 -9.0 -10.5
2zx6 -9.5 87.6 -8.4 -14.5
2zxd -7.6 136.8 -7.1 -7.1
3acw -9.1 90.1 -9.4 -6.5
3b3s -6.3 160.0 -6.4 -3.5
3b3w -7.3 136.7 -6.5 -5.7
3b68 -11.9 47.3 -11.9 -11.5
3bfu -9.3 85.5 -6.2 -8.6
3bkk -12.2 29.2 -11.7 -8.3
3cyx -11.5 42.7 -11.4 -10.9
3d4z -6.5 175.1 -6.8 -6.7
3dd0 -9.4 81.2 -6.3 -12.3
3dxg -7.5 148.7 -6.4 -3.3
3E+93 -10.4 66.3 -10.9 -12.1
3ebp -8.5 118.5 -9.2 -8.1
3ehy -7.3 154.2 -7.3 -8.0
3ejr -10.0 78.7 -8.7 -11.7
3f3a -7.4 134.4 -7.6 -5.7
3f3c -8.8 101.7 -7.7 -8.2
3fcq -6.7 170.3 -7.5 -3.8
3fk1 -10.1 76.2 -5.6 -3.6
3fv1 -11.7 49.0 -8.0 -12.7
3g0w -10.9 47.7 -10.1 -13.0
3g2n -11.7 46.3 -8.6 -5.6
3g2z -6.0 162.9 -4.2 -3.2
3gbb -10.4 80.0 -7.8 -9.4
3ge7 -8.5 99.1 -7.6 -11.9
3i3b -6.7 189.1 -7.0 -3.0
3kgp -8.3 108.7 -7.1 -3.5
3kwa -8.9 78.1 -6.6 -5.6
3l3n -12.7 2.1 -10.7 -11.2
3l4u -9.4 117.1 -7.8 -10.3
3l4w -8.2 132.7 -7.1 -8.2
3l7b -6.6 181.8 -6.3 -3.3
3mfv -14.5 -79.0 -5.4 -3.4
3muz -7.0 174.4 -7.6 -4.7
3n7a -6.6 180.6 -7.4 -5.0
3n86 -9.2 115.2 -9.5 -7.7
3nox -9.1 87.0 -8.5 -11.8
3nw9 -9.2 113.4 -9.1 -12.3
3oe5 -9.1 120.6 -9.0 -9.4
3owj -7.7 129.9 -8.8 -8.3
3ozt -9.3 118.4 -8.6 -5.6
3pe2 -8.8 99.2 -8.5 -13.3
3pww -11.5 44.9 -11.4 -10.0
3su2 -9.0 116.1 -9.7 -10.0
3su3 -8.9 117.2 -9.7 -12.5
3su5 -8.9 115.7 -9.8 -7.6
3u9q -8.4 108.1 -7.3 -6.0
3utu -9.9 80.2 -9.1 -14.9
3vd4 -7.0 173.1 -7.7 -6.6
3vh9 -6.8 149.7 -7.2 -8.5
4de1 -7.1 135.2 -5.4 -8.1
4de2 -7.9 115.2 -6.1 -5.6
4djr -9.8 97.1 -10.1 -15.7
4djv -10.3 58.5 -9.9 -9.2
4gid -11.9 42.4 -11.5 -14.7
4gqq -6.6 166.6 -6.9 -3.9
10gs -11.1 43.3 -8.8 -8.7
1a30 -6.9 157.4 -6.3 -5.9
1jyq -15.8 -77.2 -9.0 -11.9
1os0 -10.3 74.4 -10.1 -8.2
1zea -5.6 174.4 -5.6 -7.1
3ag9 -10.3 71.9 -8.7 -11.0
3ov1 -5.4 180.9 -5.6 -7.1
3s8o -11.4 35.8 -7.6 -9.3
3uri -13.3 3.1 -11.2 -12.3
4tmn -9.3 116.2 -9.7 -13.9

Table information:
PDB: PDB code entry; ΔG_pred/ΔG_score and ΔG_noelec are the predicted values according to the PRODIGY-LIG protocols (see “Method” page for further information). ΔG_exp: is the experimentally-measured binding affinity (originally measured as Ki). All the ΔG values reported in the table (pred/score/noelec/exp) are reported in kcal/mol.

All the HADDOCK-refined models used for testing and training are available for free download on the SBGrid Data repository:

https://data.sbgrid.org/dataset/566/

Proudly powered by:
EOSC-hub
EOSC
BioExcel
EGI
EGI-ACE
NWO
This work is co-funded by the Horizon 2020 projects EOSC-hub and EGI-ACE (grant numbers 777536 and 101017567), BioExcel (grant numbers 823830, 675728 and 101093290)
and by a computing grant from NWO-ENW (project number 2019.053).