International Telemedicine Program: Physician Versus AI Responses

Authors

  • Paul Hart, MD Formerly Volunteer Physician, University of Massachusetts Medical School, Worcester, Massachusetts, USA https://orcid.org/0000-0001-7217-1966

DOI:

https://doi.org/10.30953/thmt.v11.712

Keywords:

artificial intelligence, asynchronous consultations, clinical decision support, diagnostic reasoning, international health, low-resource settings, OpenEvidence, patient safety, physician–AI comparison, telemedicine

Abstract

Introduction: Asynchronous, provider-to-provider telemedicine is increasingly used to extend specialist expertise to frontline clinicians in low-resource settings,[1–4] yet the role of artificial intelligence (AI) clinical decision support in these workflows remains unclear.[5,6] The Telemedicine Health and Empowerment Program of a US based NGO links U.S.-based volunteer physicians with African providers through a secure, store-and-forward platform.[3] Open Evidence is an AI decision support tool that generates differentials and management suggestions from structured clinical narratives.[5] This study compared how Open Evidence and human volunteers responded to the same real-world telemedicine consultations.

Methods: A retrospective comparative case series of 50 telemedicine consultations from African providers that had de-identified clinical narratives and written responses from volunteer physicians was conducted. The same narratives were entered into OpenEvidence, and AI outputs were captured without additional prompting. For each physician and AI, the response was recorded for the presence of four elements: (1) explicit diagnostic list or working diagnosis, (2) recommended next diagnostic steps (history, examination, laboratory tests, or procedures), (3) immediate treatment recommendations, and (4) identification of red flag features or follow-up concerns. Elements were compared descriptively between AI and physician responses. 

Results: Seven consultations (14%) asked whether an initial assessment was correct, 9 (18%) requested guidance on further workup, and 34 (68%) sought a broader differential diagnosis. AI responses included a diagnostic list in 41 cases (82%), suggested next diagnostic steps in 50 (100%), immediate treatment recommendations in 50 (100%), red flag features in 26 (52%), and follow-up concerns in 37 (74%). Physician responses included an explicit diagnostic list in 12 cases (24%), suggested next diagnostic steps in 42 (84%), immediate treatment recommendations in 29 (58%), red flag features in 4 (8%), and follow-up concerns in 9 (18%).

Conclusions: In this African–U.S. telemedicine program, AI-generated responses were more structurally complete and safety-oriented, whereas physicians offered more selective, context-focused guidance. These findings support a hybrid telehealth workflow in which AI provides systematic completeness and safety checks while human consultants supply contextual judgment and local prioritization. [5–7]

Downloads

Download data is not yet available.

References

1. Wootton R, Bonnardot L. Telemedicine in low-resource settings. Front Public Health. 2015;3:3. https://doi.org/10.3389/fpubh.2015.00003

2. Bonnardot L, Liu J, Wootton E, Amoros I, Olson D, Wong S, et al.

The development of a multilingual tool for facilitating the primary–specialty care interface in low-resource settings: the MSF tele-expertise system. Front Public Health. 2014;2:126.

https://doi.org/10.3389/fpubh.2014.00126

3. Kim EJ, Moretti ME, Kimathi AM, Chan SY, Wootton R. Use of provider-to-provider telemedicine in Kenya during the COVID-19 pandemic. Front Public Health. 2022;10:1028999. https://doi. org/10.3389/fpubh.2022.1028999

4. Totten AM, Womack DM, Griffin JC, McDonagh MS, Davis O’Reilly C, Blazina I, et al. Telehealth-guided provider-toprovider

communication to improve rural health: a systematic review. J Telemed Telecare. 2024;30(8):1209–29. https://doi.org/

10.1177/1357633X221139892

5. Hurt RT, Stephenson CR, Gilman EA, Aakre CA, Croghan IT, Mundi MS, et al. The use of an artificial intelligence platform OpenEvidence to augment clinical decision-making for primary care physicians. J Prim Care Community Health. 2025;16:21501319251332215. https://doi.org/10.1177/21501319251332215

6. Bagla P, Hanna J, Marthambadi B, Watkins S. Patterns of artificial intelligence use in clinical work by hospitalists: survey study. J Med. Internet Res. 2026;28(1):e85973. https://doi.org/10.2196/85973

7. Ye J, He L, Beestrum M. Implications for implementation and adoption of telehealth in developing countries: a systematic review of China’s practices and experiences. NPJ Digit Med. 2023;6(1):174. https://doi.org/10.1038/s41746-023-00908-6

8. Goh E, Gallo R, Hom J, Strong E, Weng Y, Kerman H, et al. Large

language model influence on diagnostic reasoning: a randomized clinical trial. JAMA Network Open. 2024;7(10):e2440969. https://doi. org/10.1001/jamanetworkopen.2024.40969

9. Duggan MJ, Gervase J, Schoenbaum A, Hanson W, Howell JT, 3rd, Sheinberg M, et al. Clinician experiences with ambient scribe technology to assist with documentation burden and efficiency. JAMA Network Open. 2025;8(2):e2460637. https://doi.org/10.1001/ jamanetworkopen.2024.60637

Published

2026-06-26

How to Cite

Hart, P. (2026). International Telemedicine Program: Physician Versus AI Responses. Telehealth and Medicine Today, 11(2). https://doi.org/10.30953/thmt.v11.712

Issue

Section

Original Clinical Research

Most read articles by the same author(s)