On the relationship between cumulative correlation coefficients and the quality of crystallographic data sets

Protein Sci. 2017 Dec;26(12):2410-2416. doi: 10.1002/pro.3314. Epub 2017 Oct 27.

Abstract

In 2012, Karplus and Diederichs demonstrated that the Pearson correlation coefficient CC1/2 is a far better indicator of the quality and resolution of crystallographic data sets than more traditional measures like merging R-factor or signal-to-noise ratio. More specifically, they proposed that CC1/2 be computed for data sets in thin shells of increasing resolution so that the resolution dependence of that quantity can be examined. Recently, however, the CC1/2 values of entire data sets, i.e., cumulative correlation coefficients, have been used as a measure of data quality. Here, we show that the difference in cumulative CC1/2 value between a data set that has been accurately measured and a data set that has not is likely to be small. Furthermore, structures obtained by molecular replacement from poorly measured data sets are likely to suffer from extreme model bias.

Keywords: CC1/2; PSII; X-ray free-electron laser; cumulative correlation coefficients; femtosecond serial crystallography; model bias; photosystem II.

MeSH terms

  • Crystallography, X-Ray / methods*
  • Databases, Factual*
  • Models, Molecular
  • Photosystem II Protein Complex / chemistry
  • Signal-To-Noise Ratio

Substances

  • Photosystem II Protein Complex