OpenAI HealthBench Dataset

News

As more people turn to ChatGPT for health concerns, OpenAI introduces a new benchmark to evaluate the safety and accuracy of ...

OpenAI has unveiled a large dataset to help test how well artificial intelligence models answer health care questions.

10h

OpenAI's HealthBench benchmark tests how safely and accurately AI like ChatGPT can handle health queries, suggest treatments, ...

22h

OpenAI recently sparked some online controversy for not running certain safety evaluations on the final version of its o1 AI model.

Some results have been hidden because they may be inaccessible to you