Our new ChatGPT benchmarking approach published by JAMIA
2023-12-20
2023/12/20
We published a novel approach for ChatGPT benchmarking study in JAMIA.
We pioneered the use of live symptom checking services, such as Mayo Clinic Symptom Checker, for benchmarking ChatGPT’s symptom checking capabilities. Across a broad range of 194 diseases, the symptom checking accuracy approached 80% for GPT-4 model, which qualifies ChatGPT for clinical studies.
Chen A, Ch...