News
As more people turn to ChatGPT for health concerns, OpenAI introduces a new benchmark to evaluate the safety and accuracy of ...
OpenAI has unveiled a large dataset to help test how well artificial intelligence models answer health care questions.
OpenAI's HealthBench benchmark tests how safely and accurately AI like ChatGPT can handle health queries, suggest treatments, ...
OpenAI recently sparked some online controversy for not running certain safety evaluations on the final version of its o1 AI model.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results