Leveraging explainable artificial intelligence for early prediction of bloodstream infections using historical electronic health records

Rajeev Bopche; Lise Tuset Gustad; Jan Egil Afset; Birgitta Ehrnström; Jan Kristian Damås; Øystein Nytrø

doi:10.1371/journal.pdig.0000506

Peer Review History

Original SubmissionApril 11, 2024
14 Jun 2024 Decision Letter - Dhiya Al-Jumeily OBE, Editor, Qihuang Zhang, Editor PDIG-D-24-00143 Advancing Bloodstream Infection Prediction Using Historical Electronic Health Records PLOS Digital Health Dear Dr. Bopche, Thank you for submitting your manuscript to PLOS Digital Health. After careful consideration, we feel that it has merit but does not fully meet PLOS Digital Health's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript within 60 days Aug 13 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at digitalhealth@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pdig/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: * A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. * A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. * An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. We look forward to receiving your revised manuscript. Kind regards, Qihuang Zhang Academic Editor PLOS Digital Health Journal Requirements: 1. Please provide separate figure files in .tif or .eps format only and remove any figures embedded in your manuscript file. Please also ensure that all files are under our size limit of 10MB. For more information about figure files please see our guidelines: https://journals.plos.org/digitalhealth/s/figures https://journals.plos.org/digitalhealth/s/figures#loc-file-requirements 2. We have noticed that you have uploaded Supporting Information files, but you have not included a list of legends. Please add a full list of legends for your Supporting Information files after the references list. 3. In the online submission form, you indicated that "The jupyter notebooks implementing the data preprocessing, sequence creation, feature engineering and model development pipeline are available through the corresponding author. The processed and transformed final datasets, and the list of derived features are available through the corresponding author on a reasonable request". All PLOS journals now require all data underlying the findings described in their manuscript to be freely available to other researchers, either 1. In a public repository, 2. Within the manuscript itself, or 3. Uploaded as supplementary information. This policy applies to all data except where public deposition would breach compliance with the protocol approved by your research ethics board. If your data cannot be made publicly available for ethical or legal reasons (e.g., public availability would compromise patient privacy), please explain your reasons by return email and your exemption request will be escalated to the editor for approval. Your exemption request will be handled independently and will not hold up the peer review process, but will need to be resolved should your manuscript be accepted for publication. One of the Editorial team will then be in touch if there are any issues. Additional Editor Comments (if provided): I share a similar question as the reviewer regarding the model's performance. Also, the speaker should examine the interpretation of the SHAP values. For example, a lower SHAP value represents a negative influence on the prediction rather than indicating that the predictor is not important (e.g., 'anomaly_score'). [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Does this manuscript meet PLOS Digital Health’s publication criteria? Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe methodologically and ethically rigorous research with conclusions that are appropriately drawn based on the data presented. Reviewer #1: Yes Reviewer #2: Yes -------------------- 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes -------------------- 3. Have the authors made all data underlying the findings in their manuscript fully available (please refer to the Data Availability Statement at the start of the manuscript PDF file)? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception. The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes -------------------- 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS Digital Health does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes -------------------- 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The research work reported is interesting. There are some major issues that the authors consider addressing before acceptance: 1. [Case Study] To further understand the proposed method, the authors may provide a case study. 2. [Motivation and Contribution] As known, there are some existing relevant methods in the community. The authors should include these studies in the manuscript. What are the improvements of the proposed method compared with these existing methods? What problems did the previous works exist? How to solve these problems? The authors may consider analyzing the problems of the previous works and how to address these problems in the manuscript. Please explain that. 3. [SOTA] The authors should compare the proposed method with some recent algorithms. 4. [Mathematics] The authors consider some mathematical reasonings in the manuscript. 5. [Research Gap] The research gaps should be highlighted and explained in the manuscript. 6. [State Similar Studies] There are some existing algorithms that have some merits similar to the proposed method. The authors should explain the differences between the proposed method and existing algorithms, including Densely Knowledge-aware Network for Multivariate Time Series Classification, CapMatch: Semi-supervised Contrastive Transformer Capsule with Feature-based Knowledge Distillation for Human Activity Recognition, DTCM: Deep Transformer Capsule Mutual Distillation for Multivariate Time Series Classification, and Deep Contrastive Representation Learning With Self-Distillation. 7. [Computational Complexity] Computational complexity should be analyzed and compared in the manuscript. Reviewer #2: 1. In line 246, does it indicate that the “Temporal ML models” utilizing additional time variable compared to “static models”? If so, is there explanation about why static model works better than the sequential model incorporating additional information. 2. In the model performance section, a bunch of the model evaluations were used, is there a preferred metric here for bloodstream infection prediction? Especially given the fact of the imbalance feature of the dataset? Have you tried to apply your models on the balanced dataset and see if the model performance/SHAP changed? 3. In Table 2, I am surprised that the precision of RF is 0.8510, which surpass other models including other tree based models, but a low recall of 0.2201; Similar situation applies for NN but not as extreme as RF. Just wonder if it is still affected by the unbalance nature of the dataset, might be good to run a sensitivity analysis through adjusting the weights of the minority group (or resampling), which could potentially make precision/recall score more stable. Maybe Ensemble technique could be used to provide an overall performance. 4. Since Supplementary Table 3 shows some features are highly correlated, it would be great to explain whether/to what extend it affect the shap value, so that the independence assumption are still met. 5. The current SHAP value is from XGBoost model, is it because it has the “balanced performance” (indicated by line 370)? Just wonder if it is possible to have overall “feature importance” ignoring which particular model is picked. 6. Overall for the XBSI framework, in Figure 1, due to the performance for difference model type dynamic/static, does it suggest static models are suitable with the prediction task for the future. -------------------- 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. Do you want your identity to be public for this peer review? If you choose “no”, your identity will remain anonymous but your review may still be made public. For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No -------------------- [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. https://doi.org/10.1371/journal.pdig.0000506.r001
Revision 1
12 Aug 2024 Author Response Attachments Attachment Submitted filename: Response_to_Reviewers.docx https://doi.org/10.1371/journal.pdig.0000506.r002
29 Sep 2024 Decision Letter - Dhiya Al-Jumeily OBE, Editor, Qihuang Zhang, Editor Leveraging Explainable Artificial Intelligence for Enhanced Prediction of Bloodstream Infections Using Historical Electronic Health Records PDIG-D-24-00143R1 Dear Mr. Bopche, We are pleased to inform you that your manuscript 'Leveraging Explainable Artificial Intelligence for Enhanced Prediction of Bloodstream Infections Using Historical Electronic Health Records' has been provisionally accepted for publication in PLOS Digital Health. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow-up email from a member of our team. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they'll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact digitalhealth@plos.org. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Digital Health. Best regards, Qihuang Zhang Academic Editor PLOS Digital Health ********************************************************* Reviewer Comments (if any, and for reference): Reviewer's Responses to Questions Comments to the Author The authors added more clarification of how the temporal part contribute to the static models, clarified the the advantage of the static models over the sequential models providing more sensitivity analysis and provided a comprehensive analysis contributing the bloodstream infection prediction. The portion that may need minor corrections: 1. In section 3.4 line 611, it seems like the sentence “Our findings indicate that while static models generally outperformed sequential models.” Is not complete otherwise might be good to delete “while”. 2. The horizontal Prediction 2 part in Figure 4 has label and value overlapped, it would be better to modify the layout. ******** https://doi.org/10.1371/journal.pdig.0000506.r003

Open letter on the publication of peer review reports

PLOS recognizes the benefits of transparency in the peer review process. Therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. Reviewers remain anonymous, unless they choose to reveal their names.

We encourage other journals to join us in this initiative. We hope that our action inspires the community, including researchers, research funders, and research institutions, to recognize the benefits of published peer review reports for all parts of the research system.

Learn more at ASAPbio .