Benchmarking predictive methods for small-angle X-ray scattering from atomic coordinates of proteins using maximum likelihood consensus data

Trewhella J, Vachette P, Larsen A, IUCrJ 11(5) (2024) DOI

SASDUE4 – Lysozyme Updated Consensus SAXS Data

Lysozyme C
MWexperimental 14 kDa
MWexpected 14 kDa
log I(s) 1.00×100 1.00×10-1 1.00×10-2 1.00×10-3
Lysozyme C small angle scattering data  s, nm-1
ln I(s)
Lysozyme C Guinier plot ln 1.01×100 Rg: 1.5 nm 0 (1.5 nm)-2 s2
Lysozyme C Kratky plot 1.104 0 3 sRg
Lysozyme C pair distance distribution function Rg: 1.5 nm 0 Dmax: 4.7 nm

Data validation

Fits and models

log I(s)
 s, nm-1
Lysozyme C PDB (PROTEIN DATA BANK) model

log I(s)
 s, nm-1
Lysozyme C PDB (PROTEIN DATA BANK) model

log I(s)
 s, nm-1
Lysozyme C PDB (PROTEIN DATA BANK) model

log I(s)
 s, nm-1
Lysozyme C PDB (PROTEIN DATA BANK) model

Updated consensus SAXS profiles generated using the ML-SAScombine tool (with log-s binning) for lysozyme in 50 mM sodium citrate, pH 4.5, 150 mM NaCl. A total of 12 independent batch SAXS profiles contributed from 5 SAXS beamlines were combined. Protein concentrations for batch measurements ranged from 1 - 6 mg/mL. The lysozyme atomistic models for CRYSOL, Pepsi-SAXS, and FoXS calculations is the PDB ID 2VB1 with small-molecule crystallisation agents removed. Custom WAXSiS calculations used the same coordinates with added explicit waters and ions to match the experimental conditions for the MD simulations. WAXSiS calculations include statistical errors and the error weighting for residual differences is therefore the square root of the sum of the squares of the experimental and WAXSiS statistical errors. Model fits are shown in order (top to bottom): CRYSOL (classic directional hydration layer), FoXS, Pepsi-SAXS and custom WAXSiS. All three model fits with implicit hydration layer are to data on a log s-scale, while for custom WAXSiS the data are on a linear s-scale. The unusually good statistics for the consensus SAXS data generally give rise to large χ-square values for the model fits.

Additional data and information are made available in the full-entry zip archive and include: i) The input data for ML-SAScombine; ii) Runscripts used with ML-SAScombine; iii) Output files for updated consensus files from ML-SAScombine with log and linear s-binning; iv) Output files for combined SEC-SAS data from ML-SAScombine with log s-binning and; v) The original custom-WAXSiS model-fits with errors with the consensus data on the same s-grid.

Lysozyme C
Mol. type   Protein
Organism   Gallus gallus
Olig. state   Monomer
Mon. MW   14.3 kDa
UniProt   P00698 (19-147)
Sequence   FASTA