Prediction of sensitivity to gefitinib/erlotinib for EGFR mutations in NSCLC based on structural interaction fingerprints and multilinear principal component analysis

BMC Bioinformatics. 2018 Mar 7;19(1):88. doi: 10.1186/s12859-018-2093-6.

Abstract

Background: Non-small cell lung cancer (NSCLC) with activating EGFR mutations, especially exon 19 deletions and the L858R point mutation, is particularly responsive to gefitinib and erlotinib. However, the sensitivity varies for less common and rare EGFR mutations. There are various explanations for the low sensitivity of EGFR exon 20 insertions and the exon 20 T790 M point mutation to gefitinib/erlotinib. However, few studies discuss, from a structural perspective, why less common mutations, like G719X and L861Q, have moderate sensitivity to gefitinib/erlotinib.

Results: To decode the drug sensitivity/selectivity of EGFR mutants, it is important to analyze the interaction between EGFR mutants and EGFR inhibitors. In this paper, the 30 most common EGFR mutants were selected and the technique of protein-ligand interaction fingerprint (IFP) was applied to analyze and compare the binding modes of EGFR mutant-gefitinib/erlotinib complexes. Molecular dynamics simulations were employed to obtain the dynamic trajectory and a matrix of IFPs for each EGFR mutant-inhibitor complex. Multilinear Principal Component Analysis (MPCA) was applied for dimensionality reduction and feature selection. The selected features were further analyzed for use as a drug sensitivity predictor. The results showed that the accuracy of prediction of drug sensitivity was very high for both gefitinib and erlotinib. Targeted Projection Pursuit (TPP) was used to show that the data points can be easily separated based on their sensitivities to gefetinib/erlotinib.

Conclusions: We can conclude that the IFP features of EGFR mutant-TKI complexes and the MPCA-based tensor object feature extraction are useful to predict the drug sensitivity of EGFR mutants. The findings provide new insights for studying and predicting drug resistance/sensitivity of EGFR mutations in NSCLC and can be beneficial to the design of future targeted therapies and innovative drug discovery.

Keywords: Epidermal growth factor receptor mutation; Interaction fingerprints; Molecular dynamics simulations; Multilinear principal component analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Carcinoma, Non-Small-Cell Lung / drug therapy*
  • Carcinoma, Non-Small-Cell Lung / genetics*
  • Computer Simulation
  • ErbB Receptors / genetics*
  • Erlotinib Hydrochloride / therapeutic use*
  • Exons / genetics
  • Gefitinib
  • Humans
  • Lung Neoplasms / drug therapy
  • Lung Neoplasms / genetics*
  • Mutation / genetics*
  • Principal Component Analysis*
  • Protein Kinase Inhibitors / pharmacology
  • Protein Kinase Inhibitors / therapeutic use
  • Quinazolines / therapeutic use*

Substances

  • Protein Kinase Inhibitors
  • Quinazolines
  • Erlotinib Hydrochloride
  • EGFR protein, human
  • ErbB Receptors
  • Gefitinib