Science News Daily App

OpenAI Releases HealthBench: An Open-Source Benchmark for Measuring the Performance and Safety of Large Language Models in Healthcare

Written by

in

OpenAI has released HealthBench, an open-source evaluation framework designed to measure the performance and safety of large language models (LLMs) in realistic healthcare scenarios. Developed in collaboration with 262 physicians…

Continue Reading

More posts

3D-printed kidney tumors offer a new tool in the fight against renal cancer

August 12, 2025
Sunil Shah and Dr Catherine Elton join Atelerix’s Board to drive global commercial expansion

August 12, 2025
China unveils space-debris catcher with possible military use

August 12, 2025
Falcon 9 nearing its peak launch rate

August 12, 2025