Advanced gastrointestinal stromal tumor: reliable classification of imatinib plasma trough concentration via machine learning

Ran, Pan; Tan, Tao; Li, Jinjin; Yang, Hao; Li, Juan; Zhang, Jun

doi:10.1186/s12885-024-11930-6

Research
Open access
Published: 24 February 2024

Advanced gastrointestinal stromal tumor: reliable classification of imatinib plasma trough concentration via machine learning

Pan Ran¹,
Tao Tan¹,
Jinjin Li¹,
Hao Yang²,
Juan Li³ &
…
Jun Zhang¹

BMC Cancer volume 24, Article number: 264 (2024) Cite this article

481 Accesses
Metrics details

Abstract

Aim

Patients with advanced gastrointestinal stromal tumors (GISTs) exhibiting an imatinib plasma trough concentration (IM C_min) under 1100 ng/ml may show a reduced drug response rate, leading to the suggestion of monitoring for IM C_min. Consequently, the objective of this research was to create a customized IM C_min classification model for patients with advanced GISTs from China.

Methods

Initial data and laboratory indicators from patients with advanced GISTs were gathered, and the above information was segmented into a training set, validation set, and testing set in a 6:2:2 ratio. Key variables associated with IM C_min were identified to construct the classification model using the least absolute shrinkage and selection operator (LASSO) regression and forward stepwise binary logistic regression. Within the training and validation sets, nine ML classification models were constructed via the resampling method and underwent comparison through the Brier scores, the areas under the receiver-operating characteristic curve (AUROC), the decision curve, and the precision-recall (AUPR) curve to determine the most suitable model for this dataset. Two methods of internal validation were used to assess the most suitable model's classification performance: tenfold cross-validation and random split-sample validation (test set), and the value of the test set AUROC was used to evaluate the model's classification performance.

Results

Six key variables (gender, daily IM dose, metastatic site, red blood cell count, platelet count, and percentage of neutrophils) were ultimately selected to construct the classification model. In the validation set, it is found by comparison that the Extreme Gradient Boosting (XGBoost) model has the largest AUROC, the lowest Brier score, the largest area under the decision curve, and the largest AUPR value. Furthermore, as evaluated via internal verification, it also performed well in the test set (AUROC = 0.725).

Conclusion

For patients with advanced GISTs who receive IM, initial data and laboratory indicators could be used to accurately estimate whether the IM C_min is below 1100 ng/ml. The XGBoost model may stand a chance to assist clinicians in directing the administration of IM.

Peer Review reports

Introduction

Gastrointestinal stromal tumors (GISTs) are the most common mesenchymal tumors of the digestive tract [1]. Acquired functional mutations in the tyrosine-protein kinase growth factor receptor proto-oncogene (KIT) and platelet-derived growth factor-alpha gene lead to increased tyrosine kinase activity, which is considered a key factor in the pathogenesis of GIST [1,2,3]. Imatinib (IM), a tyrosine kinase inhibitor (TKI), blocks KIT receptor activity and has become the conventional first-line therapy for patients with advanced GISTs [4], which inhibits proliferation and promotes apoptosis of GIST cells [4,5,6]. Therefore, the IM plasma trough concentration (C_min) is intimately linked to the effectiveness of treatment [7].

The IM C_min of patients with advanced GISTs below 1100 ng/mL showed a shorter time to progression, according to a prior study by Demetri et al. [8]. Meanwhile, marked inter-individual variability in IM pharmacokinetics between subjects has been observed [9,10,11], leading to the suggestion of monitoring for IM C_min [12]. However, the absence of a therapeutic drug monitoring (TDM) platform in certain hospitals is due to restricted health conditions, making the sampling and examination of TDM for IM expensive both temporally and financially. Thus, there is a need for more convenient concentration classification tools than TDM, such as rapidly developing machine learning (ML) methods [13, 14], which can provide a reference for clinicians to make clinical decisions, thus reducing the cost of time and money for patients.

ML has an irreplaceable position in data analysis and can help promote data-driven estimation when predicting from multiple variables and capturing non-linear variable relations to construct a model with high classification performance [15, 16]. Therefore, this study aimed to streamline the process of IM C_min monitoring using the ML model based on patients’ initial data (demographic, treatment, and clinical information) and laboratory indicators.

Materials and methods

Patients and data

Demographic information of patients with advanced GISTs who were followed up at the First Affiliated Hospital of Chongqing Medical University (CMU) between January 2000 and August 2023 was gathered retrospectively. Meanwhile, IM C_min data, treatment information, clinical information, and laboratory indicators were collected in the same patient with advanced GIST from April 2017 to August 2023. For patients with advanced GISTs, our team generally recommends that patients go to the GIST specialist clinic for follow-up every 3 or 6 months or so for an abdomen ultrasound or CT examination, to observe the tumor situation and monitor the IM C_min simultaneously. It is worth noting that blood samples were collected and separated for routine blood, liver, and kidney function examinations from patients with GISTs on the same day the venous blood samples were collected to determine IM C_min. The inclusion criteria were as follows: (1) verification of GIST through biopsy or postoperative pathology, (2) age over 18 years, (3) good medication adherence with IM, (4) less than 8% missing data, and (5) had been taking IM ≥ 1 month. The exclusion criteria were as follows: (1) patients with GISTs who had undergone complete tumor resection and had no recurrence of the tumor at the end of follow-up, (2) history or existence of other malignancies, (3) patients with missing IM C_min data, and (4) patients lost to follow-up. The case screening flowchart and the schematic representation of the study design are displayed in Fig. 1.

We established a database called “Weinichangzai”, which included initial data for each patient, and all patients’ initial data was obtained through the GIST specialist outpatient clinic, telephone calls, WeChat, and other interaction tools. Initial data included demographic information (including age at diagnosis, age at blood sampling, gender, and residence); treatment information (including surgical procedures (1, gastrectomy; 2, non-gastric operation) [17, 18] and daily IM dose); and clinical information (including expression of DOG-1/CD117/CD34, metastatic site (1, liver; 2, non-liver) [17, 18], and primary tumor site).

Determination of IM C_min

All patients with advanced GISTs were advised to take IM at lunchtime, and a 3 ml venous blood sample was collected in a heparinized vial on the second day (generally 24 ± 3 h following the previous dose) before taking an IM dose. Blood samples were centrifuged at room temperature for 5 min at 3000 g. A protocol was established for determining IM C_min according to the method described by Tan et al. [19] and Roth et al. [20]. Plasma samples were pretreated by protein precipitation. We added 100 μl of acetonitrile and 50 μl (50%) of perchloric acid successively to plasma (0.5 ml), swirled the solution for 20 s, and centrifuged it at 10,800 rpm for 15 min. We added 50 μl of neutralizing solution (containing 1.4 g potassium carbonate and 0.65 g potassium chloride dissolved in 5 ml of purified water) to 400 μl of the supernatant, and the mixture was vortexed thoroughly before submerging for 30 min at 4 °C. Thirty microliters of the supernatant was injected into a high-performance liquid chromatography system. The lower limit of quantification was set at 50 ng/ml.

Laboratory indicators

Blood samples were collected from patients with advanced GISTs, and separated for routine blood, liver, and kidney function examinations. Routine blood examinations included the white blood cell count (WBC), platelet count (PLT), percentage of neutrophils (NEU%), red blood cell count (RBC), hemoglobin (HB), and percentage of lymphocytes (LYM%). Liver function examinations included alanine aminotransferase (ALT), aspartate aminotransferase (AST), total bilirubin (TBIL), direct bilirubin (DBIL), indirect bilirubin (IBIL), gamma-glutamyl transpeptidase (GGT), and alkaline phosphatase (AKP). Kidney function examinations included creatinine (Cr), urea nitrogen (BUN), and the estimated glomerular filtration rate (eGFR).

Method of feature selection

We collected 26 candidates from demographic information, treatment information, clinical information, and laboratory indicators. To obtain the best predictive performance, the variable selection was performed on 26 candidates using the LASSO (least absolute shrinkage and selection operator) regression with tenfold cross-validation, which could compress the variable coefficients to prevent overfitting and solve severe collinearity problems [21, 22]. LASSO regression analyses were performed using “Extreme Smart Analysis” (www.xsmartanalysis.com). To further control the influence of confounding factors, variables selected by LASSO regression were analyzed by forward stepwise binary logistic regression (LR) to obtain key variables. LR analyses were performed using SPSS version 27.0 (IBM Corp, Armonk, NY, USA).

Selection method of ML algorithm

In this study, we randomly divided the dataset into three sets: the training (60%) and validation sets (20%) for ML model development and the test set (20%) for performance evaluation. The randomization's success was determined by comparing baseline characteristics in each group. Nine types of ML algorithms were used to construct the classification models in this study: Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Random Forest (RF), Gaussian Naive Bayes (GNB), Complement Naive Bayes (CNB), Multilayer Perceptron (MLP), Support Vector Machine (SVM), K-Nearest Neighbour (KNN), and Adaptive Boost (AdaBoost). All analyses were performed using “Extreme Smart Analysis”, which can also select the best-performing hyper-parameters using the grid-search method.

Within the training and validation sets, nine ML classification models were constructed via the resampling method and underwent comparison through the Brier scores, the areas under the receiver-operating characteristic curve (AUROC), the decision curve, and the precision-recall (AUPR) curve to determine the most suitable model for this dataset, which were important indicators that can be used to evaluate classification models. Two methods of internal validation were used to assess the most suitable model's classification performance: tenfold cross-validation and random split-sample validation (test set). To explain the model predictions, we used Shapley Additive Explanations (SHAP) of “Extreme Smart Analysis” to calculate the Shapley values of the test set. SHAP values are based on Shapley values in cooperative game theory to proceed with the best explanation of the output of our machine-learning model [23].

Statistical analysis

Continuous variables (non-normal distribution) are described using median and interquartile range (IQR) values, and categorical variables are presented as frequencies (percentages). The Mann–Whitney U-test (non-normal distribution) was used to assess the differences in continuous variables between the training, validation, and test sets. Categorical variables were compared between the training, validation, and test sets using the Pearson chi-square test, and Fisher exact test. Statistical significance was set at p < 0.05. All P values were calculated as two-tailed. All analyses were performed using SPSS version 27.0 (IBM Corp, Armonk, NY, USA).

Results

Baseline characteristics

In total, 212 patients with advanced GISTs, based on the inclusion and exclusion criteria, were included, of whom 890 IM C_min data were collected. Missing data were filled by imputing the data via the RF algorithm [24]. The mean value of IM C_min, the label variable, was 1469.59 ng/mL, with a standard deviation (SD) value of 755.71 ng/ml. In this dataset, 31.24% of IM C_min values were < 1100 ng/ml. More than half were males (59.10%). The mean age at diagnosis was 56 years, and 18.54% of this dataset underwent gastrectomy. The daily IM dose in this dataset (76.29%) was 400 mg/day. The comparison of baseline characteristics between the test set (20%) and training and validation sets (80%) is shown in Table 1, without any statistically significant differences in the variables between the two groups (p > 0.05). The comparison of baseline characteristics between “IM C_min ≤ 1100 ng/ml” and “IM C_min > 1100 ng/ml” was shown in Table 2, significant differences were observed between the groups according to age at diagnosis, age at blood sampling, gender, daily IM dose, metastatic site, NEU%, RBC, HB, LYM%, ALT, TBIL, IBIL, GGT, Cr, BUN, and eGFR (p < 0.05).

Table 1 Baseline characteristics of 890 IM C_min data from 212 patients with advanced GISTs

Full size table

Table 2 Comparison of baseline characteristics between “IM C_min ≤ 1100 ng/ml” and “IM C_min > 1100 ng/ml”

Full size table

Key variables

In the training and validation sets, the 26 candidates underwent a tenfold cross-validation LASSO regression analysis (Fig. 2A and B). The results showed that the optimal parameter λ (λ = 0.018) in the LASSO regression analysis with the smallest mean square error, which reduced the 26 candidates to 9 feature variables, including daily IM dose, Metastatic site, Gender, PLT, NEU%, RBC, HB, LYM%, and age at diagnosis. To address potential confounding factors, the binary LR was used to analyze the above 9 feature variables via the forward-stepwise method. Finally, only daily IM dose, Metastatic site, Gender, PLT, NEU%, and RBC were determined as key variables (p < 0.05), as shown in Table 3.

Table 3 Forward stepwise binary logistic regression analysis

Full size table

The best model building

Following the identification of these six key variables, XGBoost, LightGBM, RF, GNB, CNB, MLP, SVM, KNN, and AdaBoost were trained and applied the resampling method by resampling 10 times. As shown in Fig. 3A and B, RF and KNN (ranked according to AUROC) had the best performance in the training set, but XGBoost (ranked according to AUROC) had the largest AUROC and shortest SD in the validation set, indicating the best stability of this model. When the Brier scores for the nine aforementioned ML models were compared, that of XGBoost was the lowest, indicating that its prediction calibration was the best (Brier scores = 0.193, Fig. 3C). XGBoost model reveals the largest area under the decision curve, indicating a better clinical utility than other models (Fig. 3D). The PR curve is sensitive to data imbalance, and it changes dramatically as the ratio of positive to negative samples changes [25]. As we know, the larger the AUPR, the higher the average precision of the model. Although in the training set, RF and KNN had the largest value of AUPR (Fig. 3E), in the validation set, the PR curve area of the XGBoost model was the largest (AUPR = 0.842) (Fig. 3F). Based on the above results, the XGBoost model may be the optimal model choice for this dataset, rather than the RF and KNN models, which may overfit data.

The best model evaluation

The XGBoost ML algorithm analysis and tenfold cross-validation were performed on the dataset. According to the findings, the training set's average AUROC was 0.881 (0.873–0.890, Fig. 4A), the validation set's average AUROC was 0.699 (0.614–0.782, Fig. 4B), and the testing set's AUROC was 0.725 (Fig. 4C). If the validation set's AUROC is lower than the test set's, the model fitting could be considered successful, indicating that the model has good generalization [26]. Meanwhile, as shown in Fig. 4D, the learning curve revealed that the training and validation sets were well-fitting and stable [26,27,28]. As a result, the above results revealed that the XGBoost algorithm might be employed for this dataset's classification modeling purpose.

The SHAP analyzes the entire test set, visually explaining the impact of six key variables on the XGBoost model. Furthermore, in the SHAP analysis of the XGBoost model, the color represents the value of the variable, red pixels symbolize positive SHAP values enhancing class likelihood, while blue pixels denote negative SHAP values reducing class probability (Fig. 5A). The bar chart shows the relationship between the magnitude of the feature value and the predicted impact (Fig. 5B).

Discussion

Demetri et al. [8] previously reported that advanced GIST patients with IM C_min < 1100 ng/mL had a shorter progression-free time in 2009. Although there is still controversy about the optimal cut-off value for IM C_min, in clinical practice, "1100 ng /ml" has become a common reference value for monitoring IM C_min in outpatients. In this study, we thus used 1100 ng/ml as the cutoff value and converted IM C_min, a continuous variable, into a binary variable. We compared nine common ML algorithms. The optimal ML model was selected using AUROC, DCA, Brier Scores, and AUPR. Finally, the XGBoost model was selected as the best model for analysis, internally validated, and proved to have good classification.

The relationships between the label variable (IM C_min) and feature variables were assessed using LASSO regression and LR. Six key variables (daily IM dose, metastatic site, gender, PLT, NEU%, and RBC) were screened out, which were easy to obtain, and also were the key variables in constructing the XGBoost model in this study. Interestingly, except for PLT, these key variables reached also statistical significance in Table 2. Some studies believe that IM is mainly metabolized in the liver [29, 30]. Therefore, before our data analysis, Laboratory indicators related to liver function examination were expected to be key and important features in constructing the classification model. However, to our surprise, the features that were finally screened by parameters did not include laboratory indicators related to liver function examination. We consider that the reason for this phenomenon may be that our outcome variables are binary, whereas the outcome variable in the previous study was continuous. This difference may lead to the exclusion of laboratory indicators related to liver function examination in the final selection of variables.

IM C_min was shown to be higher in females than in males in several studies, and researchers believed the difference could be attributable to differences in body weight or medication adherence between genders [31, 32]. The existence of liver metastases may result in more changes and increased exposure to IM, which may cause higher in IM C_min [33]. A previous study (Eechoute, 2012) found that IM clearance was expected to decrease by 3.8% for every 100 cm³ increase in liver metastatic volume [29]. Previous studies had reported the relationship between daily IM dose and IM C_min [17, 18, 34], the TDM for IM provided a reference for the adjustment of IM dosage, which added to the utility of TDM in the management of patients with GISTs [12]. It is worth noting that no foreign studies have previously reported the effect of RBC on imatinib clearance, but a recent domestic study confirmed that RBC had a significant effect on the clearance of IM [35], which may be due to ethnic differences between domestic and foreign study populations. Thrombocytopenia and neutropenia are common side effects of IM-targeted therapy [36], which may be why PLT and NEU% are key variables in constructing the model.

IM is an anti-cancer drug administered primarily to outpatients because blood samples are not always available at the end of the administration interval. Thus, IM C_min is the most widely used pharmacokinetic proxy for predicting clinical outcomes [7, 37], and C_min is naturally used as a focus for TDM [38]. TDM for IM may reassure patients and physicians about full exposure to the drug and improve long-term adherence to this chronic treatment, which may be a promising approach for fine-tuning the IM dosage for better tolerability and optimal clinical outcomes in patients with GISTs [7, 37]. It is widely known that high IM C_min increases the risk of adverse effects and toxicity, which can reduce medication adherence rates and quality of life. Therefore, it is crucial for patients with GISTs to frequently undergo TDM of IM [34]. However, most hospitals are unable to monitor IM C_min because they do not have the equipment to do so, which makes the IM C_min classification model valuable for clinical application.

Precision therapy stands as a primary use of ML, offering patients customized medical services including individualized dosage modification, plasma concentration prediction, and prediction of negative drug reactions [13, 39, 40]. In clinical practice, 1100 ng/mL is often used as the reference value, combined with the patient's drug tolerance and the change in CT tumor lesions, to evaluate the drug efficacy and adjust the drug dosage [41]. For example, patients with IM C_min less than 1100 ng/ml (which is predicted by the XGBoost model), where tumor progression is defined by imaging and/or symptomatic progression, could be encouraged to appropriately increase the doses. By the same token, patients with IM C_min greater than 1100 ng/ml, as predicted by the XGBoost model, would experience serious adverse drug reactions and could be encouraged to appropriately reduce their doses. Using the above two examples, we know that using machine-learning methods to detect blood drug concentrations could help some hospitals without the TDM platform reduce their healthcare burden. For some hospitals with the TDM platform, sometimes, the ML model is more often used to streamline IM C_min monitoring rather than completely replace TDM.

A model developed by Gotta in 2012 showed that the Bayesian MAP-ρ method, which considered the correlation between pharmacokinetic parameters, could predict IM C_min with an unbiased accuracy of ± 30.7% [42].

The difference between this study and the above study mainly lies in the study population, study design, and study variables. First, IM C_min measured in the adjuvant setting is excluded. Second, the classification model includes six feature variables that are easily accessible during usual treatment. This advantage enables the model to be generalized and applied well. Finally, to our knowledge, this is the first study to develop and internally validate a classification model for IM C_min that has high predictive performance, which, combines with Demetri's study [8], may aid in prognostic prediction in patients with advanced GISTs. Therefore, in the future, we plan to further establish a web application that is easy to use based on the presented XGBoost classification model, which could then be used as a real-time clinical decision support tool through self-learning and optimization and aid in personalized IM dose adjustment.

Although the new model has good predictive performance, there are still some considerable limitations to this study. First, the limited number of samples available may reduce the performance of the XGBoost model. Second, given its nature as a retrospective, single-center research with an extended duration, it faces all the constraints typical of retrospective studies. For instance, the lack of pharmacokinetic parameters and body surface area data, incomplete laboratory indicators, and fluctuations in blood collection time points may all affect IM C_min. Therefore, in this study, the classification prediction of IM C_min is the next best thing, rather than the specific value prediction, which is continuous. For this reason, our current model is more of a reference than a complete replacement for TDM. Third, while our classification model has been internally validated, additional prospective validation should be performed in future studies, or a wholly external dataset should be employed for external validation to improve the generalization ability of this model. Finally, as several works of literature suggest polymorphism effects on exposure and drug-drug interaction via CYP3A [30, 43,44,45], changes in C_min estimation could be suspected, but those indicators are not included in this research. In future work and research, we will make efforts to make up for the above deficiencies and establish a new model, and the result variable of this model is a continuous value, to help some hospitals without the TDM platform reduce their healthcare burden, or even replace TDM.

Conclusion

We developed and validated ML models for individualized classification of IM C_min tailored to patients with advanced GISTs from China by utilizing readily available baseline information and assay indices, which were easy to obtain. This XGBoost model showed good classification performance and had good clinical application value.

Availability of data and materials

The datasets generated and analyzed during the current study are not publicly available due to a total of 890 IM Cmin samples included in the study from 212 patients with advanced GISTs have been deposited in the “Weinichangzai” database, but are available from the corresponding author on reasonable request.

Abbreviations

GIST:: Gastrointestinal stromal tumor
IM:: Imatinib
C_min :: Plasma trough concentration
AUROC:: Area under the receiver-operating characteristic curve
ROC:: Receiver operating characteristic
AUPR:: Area under the precision-recall curve
PR:: Precision-recall curve
AUC:: Area under curve
SHAP:: Shapley additive explanations
KIT:: Kinase growth factor receptor proto-oncogene
TKI:: Tyrosine kinase inhibitor
TDM:: Therapeutic drug monitoring
ML:: Machine learning
CMU:: Chongqing Medical University
WBC:: Blood cell count
PLT:: Platelet count
NEU%:: Percentage of neutrophils
RBC:: Red blood cell count
HB:: Hemoglobin
LYM%:: Percentage of lymphocytes
ALT:: Alanine aminotransferase
AST:: Aspartate aminotransferase
TBIL:: Total bilirubin
DBIL:: Direct bilirubin
IBIL:: Indirect bilirubin
GGT:: Gamma-glutamyl transpeptidase
AKP:: Alkaline phosphatase
Cr:: Creatinine
BUN:: Urea nitrogen
eGFR:: Estimated glomerular filtration rate
LASSO:: Least absolute shrinkage and selection operator
LR:: Logistic regression
XGBoost:: Extreme Gradient Boosting
LightGBM:: Light Gradient Boosting Machine
RF:: Random forest
GNB:: Gaussian Naive Bayes
CNB:: Complement Naive Bayes
MLP:: Multilayer Perceptron
SVM:: Support vector machine
KNN:: K-nearest neighbour
AdaBoost:: Adaptive boost
IQR:: Interquartile range
SD:: Standard deviation
DOG-1:: Gastrointestinal stromal tumor protein 1
CD117:: Cluster of differentiation 117
CD34:: Cell differentiation factor 34
R:: Regression coefficient
SE:: Standard error
OR:: Odds ratio

References

Wang MX, Devine C, Segaran N, Ganeshan D. Current update on molecular cytogenetics, diagnosis and management of gastrointestinal stromal tumors. World J Gastroenterol. 2021;27(41):7125–33.
Article CAS PubMed PubMed Central Google Scholar
Mei L, Du W, Idowu M, von Mehren M, Boikos SA. Advances and Challenges on Management of Gastrointestinal Stromal Tumors. Front Oncol. 2018;8:135. https://0-doi-org.brum.beds.ac.uk/10.3389/fonc.2018.00135.
Article PubMed PubMed Central Google Scholar
Mantese G. Gastrointestinal stromal tumor: epidemiology, diagnosis, and treatment. Curr Opin Gastroenterol. 2019;35:555–9. https://0-doi-org.brum.beds.ac.uk/10.1097/MOG.0000000000000584.
Article CAS PubMed Google Scholar
Senchak J, Ahr K, von Mehren M. Gastrointestinal stromal tumors: What is the best sequence of TKIs. Curr Treat Options Oncol. 2022;23:749–61.
Article PubMed Google Scholar
Chien YH, Würthwein G, Zubiaur P, et al. Population pharmacokinetic modelling of imatinib in healthy subjects receiving a single dose of 400 mg. Cancer Chemother Pharmacol. 2022;90:125–36.
Article CAS PubMed PubMed Central Google Scholar
Goggin C, Stansfeld A, Mahalingam P, et al. Ripretinib in advanced gastrointestinal stromal tumors: an overview of current evidence and drug approval. Future Oncol. 2022;18:2967–78.
Article CAS PubMed Google Scholar
Fahmy A, Hopkins AM, Sorich MJ, et al. Evaluating the utility of therapeutic drug monitoring in the clinical use of small molecule kinase inhibitors: a review of the literature. Expert Opin Drug Metab Toxicol. 2021;17:803–21.
Article CAS PubMed Google Scholar
Demetri GD, Wang Y, Wehrle E, Racine A, Nikolova Z, Blanke CD, et al. Imatinib plasma levels are correlated with clinical benefit in patients with unresectable/metastatic gastrointestinal stromal tumors. J Clin Oncol. 2009;27:3141–7. https://0-doi-org.brum.beds.ac.uk/10.1200/JCO.2008.20.4818.
Article CAS PubMed Google Scholar
Teng JF, Mabasa VH, Ensom MH. The role of therapeutic drug monitoring of imatinib in patients with chronic myeloid leukemia and metastatic or unresectable gastrointestinal stromal tumors. Ther Drug Monit. 2012;34:85–97. https://0-doi-org.brum.beds.ac.uk/10.1097/FTD.0b013e31823cdec9.
Article CAS PubMed Google Scholar
Lankheet NA, Knapen LM, Schellens JH, Beijnen JH, Steeghs N, Huitema AD. Plasma concentrations of tyrosine kinase inhibitors imatinib, erlotinib, and sunitinib in routine clinical outpatient cancer care. Ther Drug Monit. 2014;36:326–34. https://0-doi-org.brum.beds.ac.uk/10.1097/FTD.0000000000000004.
Article CAS PubMed Google Scholar
Bouchet S, Poulette S, Titier K, Moore N, Lassalle R, Abouelfath A, et al. Relationship between imatinib trough concentration and outcomes in the treatment of advanced gastrointestinal stromal tumours in a real-life setting. Eur J Cancer. 2016;57:31–8. https://0-doi-org.brum.beds.ac.uk/10.1016/j.ejca.2015.12.029.
Article CAS PubMed Google Scholar
Zhuang W, Xie JD, Zhou S, Zhou ZW, Zhou Y, Sun XW, et al. Can therapeutic drug monitoring increase the safety of Imatinib in GIST patients. Cancer Med. 2018;7:317–24. https://0-doi-org.brum.beds.ac.uk/10.1002/cam4.1286.
Article CAS PubMed PubMed Central Google Scholar
Ma P, Liu R, Gu W, et al. Construction and interpretation of prediction model of teicoplanin trough concentration via machine learning. Front Med (Lausanne). 2022;9: 808969.
Article PubMed Google Scholar
Liu Y, Zhao S, Du W, Tian Z, Chi H, Chao C, et al. Applying interpretable machine learning algorithms to predict risk factors for permanent stoma in patients after TME. Front Surg. 2023;10:1125875. https://0-doi-org.brum.beds.ac.uk/10.3389/fsurg.2023.1125875.
Article PubMed PubMed Central Google Scholar
Obermeyer Z, Emanuel EJ. Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. N Engl J Med. 2016;375(13):1216–9.
Article PubMed PubMed Central Google Scholar
Doupe P, Faghmous J, Basu S. Machine learning for health services researchers. Value Health. 2019;22(7):808–15.
Article PubMed Google Scholar
Wu X, Li J, Zhou Y, Mao Y, Luo S, He X, et al. Relative factors analysis of imatinib trough concentration in chinese patients with gastrointestinal stromal tumor. Chemotherapy. 2018;63:301–7. https://0-doi-org.brum.beds.ac.uk/10.1159/000493195.
Article CAS PubMed Google Scholar
Hompland I, Bruland ØS, Ubhayasekhera K, Bergquist J, Boye K. Clinical implications of repeated drug monitoring of imatinib in patients with metastatic gastrointestinal stromal tumour. Clin Sarcoma Res. 2016;6:21. https://0-doi-org.brum.beds.ac.uk/10.1186/s13569-016-0062-2.
Article CAS PubMed PubMed Central Google Scholar
Tan KL, Ankathil R, Gan SH. Method development and validation for the simultaneous determination of imatinib mesylate and N-desmethyl imatinib using rapid resolution high performance liquid chromatography coupled with UV-detection. J. Chromatogr B Analyt Technol Biomed Life Sci. 2011;879:3583–91. https://0-doi-org.brum.beds.ac.uk/10.1016/j.jchromb.2011.09.048.
Article CAS Google Scholar
Roth O, Spreux-Varoquaux O, Bouchet S, Rousselot P, Castaigne S, Rigaudeau S, et al. Imatinib assay by HPLC with photodiode-array UV detection in plasma from patients with chronic myeloid leukemia: Comparison with LC-MS/MS. Clin Chim Acta. 2010;411:140–6. https://0-doi-org.brum.beds.ac.uk/10.1016/j.cca.2009.10.007.
Article CAS PubMed Google Scholar
Ohri N, Duan F, Snyder BS, Wei B, Machtay M, Alavi A, et al. Pretreatment 18F-FDG PET Textural Features in Locally Advanced Non-Small Cell Lung Cancer: Secondary Analysis of ACRIN 6668/RTOG 0235. J Nucl Med. 2016;57(6):842–8. https://0-doi-org.brum.beds.ac.uk/10.2967/jnumed.115.166934.
Article CAS PubMed Google Scholar
Zhao LJ, Zou SD, Huang MZ, Wang GG. Distributed regularized stochastic configuration networks via the elastic net. Neural Comput Appl. 2021;33(8):3281–97. https://0-doi-org.brum.beds.ac.uk/10.1007/s00521-020-05178-x.
Article Google Scholar
Zhang H, Wang Z, Tang Y, Chen X, You D, Wu Y, et al. Prediction of acute kidney injury after cardiac surgery: model development using a Chinese electronic health record dataset. J Transl Med. 2022;20:166. https://0-doi-org.brum.beds.ac.uk/10.1186/s12967-022-03351-5.
Article CAS PubMed PubMed Central Google Scholar
Gangoso L, Viana DS, Dokter AM, Shamoun-Baranes J, Figuerola J, Barbosa SA, Bouten W. Cascading effects of climate variability on the breeding success of an edge population of an apex predator. J Anim Ecol. 2020;89(11):2631–43.
Article PubMed PubMed Central Google Scholar
Li C, Liu H, Hu Q, Que J, Yao J. A Novel Computational Model for Predicting microRNA-Disease Associations Based on Heterogeneous Graph Convolutional Networks. Cells. 2019;8. https://0-doi-org.brum.beds.ac.uk/10.3390/cells8090977.
Lei T, Guo J, Wang P, Zhang Z, Niu S, Zhang Q, et al. Establishment and Validation of Predictive Model of Tophus in Gout Patients, J Clin Med. 2023;12. https://0-doi-org.brum.beds.ac.uk/10.3390/jcm12051755.
Belkin M, Hsu D, Ma S, Mandal S. Reconciling modern machine-learning practice and the classical bias-variance trade-off. Proc Natl Acad Sci U S A. 2019;116:15849–54. https://0-doi-org.brum.beds.ac.uk/10.1073/pnas.1903070116.
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Yuan K, Zhao S, Ye B, Wang Q, Liu Y, Zhang P, et al. A novel T-cell exhaustion-related feature can accurately predict the prognosis of OC patients. Front Pharmacol. 2023;14:1192777. https://0-doi-org.brum.beds.ac.uk/10.3389/fphar.2023.1192777.
Article CAS PubMed PubMed Central Google Scholar
Eechoute K, Fransson MN, Reyners AK, de Jong FA, Sparreboom A, van der Graaf WT, et al. A long-term prospective population pharmacokinetic study on imatinib plasma concentrations in GIST patients. Clin Cancer Res. 2012;18:5780–7. https://0-doi-org.brum.beds.ac.uk/10.1158/1078-0432.CCR-12-0490.
Article CAS PubMed Google Scholar
Maddin N, Husin A, Gan SH, Aziz BA, Ankathil R. Impact of CYP3A4*18 and CYP3A5*3 Polymorphisms on Imatinib Mesylate Response Among Chronic Myeloid Leukemia Patients in Malaysia. Oncol Ther. 2016;4(2):303–14.
Article PubMed Google Scholar
Larson RA, Druker BJ, Guilhot F, O’Brien SG, Riviere GJ, Krahnke T, et al. Imatinib pharmacokinetics and its correlation with response and safety in chronic-phase chronic myeloid leukemia: a subanalysis of the IRIS study. Blood. 2008;111:4022–8. https://0-doi-org.brum.beds.ac.uk/10.1182/blood-2007-10-116475.
Article CAS PubMed Google Scholar
Wu X, Ge Y, He X, Li J, Zhang J. Changes in imatinib plasma trough level during long-term treatment in patients with intermediate- or high-risk gastrointestinal stromal tumors: Relationship between covariates and imatinib plasma trough level. Front Surg. 2023;10:1115141. https://0-doi-org.brum.beds.ac.uk/10.3389/fsurg.2023.1115141.
Article PubMed PubMed Central Google Scholar
Peng B, Lloyd P, Schran H. Clinical pharmacokinetics of imatinib. Clin Pharmacokinet. 2005;44:879–94. https://0-doi-org.brum.beds.ac.uk/10.2165/00003088-200544090-00001.
Article CAS PubMed Google Scholar
Xia Y, Chen S, Luo M, Wu J, Cai S, He Y, et al. Correlations between imatinib plasma trough concentration and adverse reactions in Chinese patients with gastrointestinal stromal tumors. Cancer. 2020;126(Suppl 9):2054–61. https://0-doi-org.brum.beds.ac.uk/10.1002/cncr.32751.
Article CAS PubMed Google Scholar
Jiang X, Fu Q, Jing Y, et al. Personalized dose of adjuvant imatinib in patients with gastrointestinal stromal tumors: results from a population pharmacokinetic analysis. Drug Des Devel Ther. 2023;17:809–20.
Article CAS PubMed PubMed Central Google Scholar
Druker BJ, Sawyers CL, Kantarjian H, et al. Activity of a specific inhibitor of the BCR-ABL tyrosine kinase in the blast crisis of chronic myeloid leukemia and acute lymphoblastic leukemia with the Philadelphia chromosome. N Engl J Med. 2001;344(14):1038–42.
Article CAS PubMed Google Scholar
Yu H, Badhan R. The Application of Virtual Therapeutic Drug Monitoring to Assess the Pharmacokinetics of Imatinib in a Chinese Cancer Population Group. J Pharm Sci. 2023;112(2):599–609. https://0-doi-org.brum.beds.ac.uk/10.1016/j.xphs.2022.09.028.
Article CAS PubMed Google Scholar
Janssen JM, Dorlo T, Beijnen JH, Huitema A. Evaluation of extrapolation methods to predict trough concentrations to guide therapeutic drug monitoring of oral anticancer drugs. Ther Drug Monit. 2020;42:532–9. https://0-doi-org.brum.beds.ac.uk/10.1097/FTD.0000000000000767.
Article PubMed Google Scholar
Anastopoulos IN, Herczeg CK, Davis KN, Dixit AC. Multi-Drug Featurization and Deep Learning Improve Patient-Specific Predictions of Adverse Events. Int J Environ Res Public Health. 2021;18(5):2600. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph18052600.
Article CAS PubMed PubMed Central Google Scholar
Matsuzaki T, Kato Y, Mizoguchi H, Yamada K. A machine learning model that emulates experts’ decision making in vancomycin initial dose planning. J Pharmacol Sci. 2022;148(4):358–63.
Article CAS PubMed Google Scholar
IJzerman NS, Groenland SL, Koenen AM, Kerst M, van der Graaf W, Rosing H, Beijnen JH, Huitema A, Steeghs N,. Therapeutic drug monitoring of imatinib in patients with gastrointestinal stromal tumours - Results from daily clinical practice. Eur J Cancer. 2020;136:140–8. https://0-doi-org.brum.beds.ac.uk/10.1016/j.ejca.2020.05.025.
Article CAS PubMed Google Scholar
Gotta V, Widmer N, Montemurro M, Leyvraz S, Haouala A, Decosterd LA, et al. Therapeutic drug monitoring of imatinib: Bayesian and alternative methods to predict trough levels. Clin Pharmacokinet. 2012;51:187–201. https://0-doi-org.brum.beds.ac.uk/10.2165/11596990-000000000-00000.
Article CAS PubMed Google Scholar
Teo YL, Ho HK, Chan A. Metabolism-related pharmacokinetic drug-drug interactions with tyrosine kinase inhibitors: current understanding, challenges and recommendations. Br J Clin Pharmacol. 2015;79(2):241–53.
Article CAS PubMed PubMed Central Google Scholar
Nebot N, Crettol S, d’Esposito F, Tattam B, Hibbs DE, Murray M. Participation of CYP2C8 and CYP3A4 in the N-demethylation of imatinib in human hepatic microsomes. Br J Pharmacol. 2010;161(5):1059–69.
Article CAS PubMed PubMed Central Google Scholar
Goey AK, Mooiman KD, Beijnen JH, Schellens JH, Meijerman I. Relevance of in vitro and clinical data for predicting CYP3A4-mediated herb-drug interactions in cancer patients. Cancer Treat Rev. 2013;39(7):773–83.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work is supported by Extreme Smart Analysis platform (https://www.xsmartanalysis.com/), and all Statistical analyses were performed using R version 4.2.3 and python version 3.11.4.

Funding

No funding was used in this study.

Author information

Authors and Affiliations

Department of Gastrointestinal Surgery, the First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China
Pan Ran, Tao Tan, Jinjin Li & Jun Zhang
Department of Internal Medicine, Chongqing Key Laboratory of Translational Research for Cancer Metastasis and Individualized Treatment, Chongqing University Cancer Hospital, Chongqing, 400030, China
Hao Yang
Department of Pharmacy, the First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China
Juan Li

Authors

Pan Ran
View author publications
You can also search for this author in PubMed Google Scholar
Tao Tan
View author publications
You can also search for this author in PubMed Google Scholar
Jinjin Li
View author publications
You can also search for this author in PubMed Google Scholar
Hao Yang
View author publications
You can also search for this author in PubMed Google Scholar
Juan Li
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.R. wrote the main manuscript text, prepared Figures 1, 2, 3, 4 and 5, and Tables 1, 2 and 3, and performed the validation; T.T., J.L. (Jinjin Li), H.Y. performed the formal analysis; P.R., T.T., J.L. (Jinjin Li), H.Y. and J.L. (Juan Li) enriched and improved the discussion section; J.Z. and J.L. (Juan Li) designed the research and controlled the structure and quality of the manuscript.

Corresponding authors

Correspondence to Juan Li or Jun Zhang.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Institutional Review Board of The First Affiliated Hospital of Chongqing Medical University (Approval number: 2022-K544) with a waiver for written informed consent, owing to its observational, retrospective, and non-interventional design. All clinical data collected into the “Weinichangzai” database were obtained with the patient's verbal informed consent, in which the patient data used were kept strictly confidential.

Consent for publication

Not Applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Ran, P., Tan, T., Li, J. et al. Advanced gastrointestinal stromal tumor: reliable classification of imatinib plasma trough concentration via machine learning. BMC Cancer 24, 264 (2024). https://0-doi-org.brum.beds.ac.uk/10.1186/s12885-024-11930-6

Download citation

Received: 26 September 2023
Accepted: 29 January 2024
Published: 24 February 2024
DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12885-024-11930-6

Advanced gastrointestinal stromal tumor: reliable classification of imatinib plasma trough concentration via machine learning

Abstract

Aim

Methods

Results

Conclusion

Introduction

Materials and methods

Patients and data

Determination of IM Cmin

Laboratory indicators

Method of feature selection

Selection method of ML algorithm

Statistical analysis

Results

Baseline characteristics

Key variables

The best model building

The best model evaluation

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Cancer

Contact us

Determination of IM C_min