Claude 2, developed by rising startup star Anthropic, is the most capable large language model generative AI on the current market. It reached a success ratio of 70 percent with the HumanEval benchmark. This is particularly noteworthy as it is a 0-shot evaluation, meaning all AI programs benchmarked against it had not had previous data of this sort nor previous training with the tasks. This means that Claude 2 was the quickest at absorbing and understanding the task given to it.
HumanEval benchmark comparison between major generative artificial intelligence (AI) programs in 2023
Adjust the presentation of the statistic and data points.
Share the statistic on social media channels or embed the statistic in your
website using "Embed Code", where available.
Cite this statistic and select one of the following formats: APA, Chicago, Harvard, MLA & Bluebook.
Print the statistic including description and metadata.
Chart type
HumanEval benchmark comparison between major generative artificial intelligence (AI) programs in 2023
Share this statistic
You have no right to use this feature.
Make sure to contact us if you are interested in scientific citation.
You can upgrade your account to enable this functionality for all statistics.
This feature is not available with your current account.Request access
Learn more about how Statista can support your business.
xAI. (November 4, 2023). HumanEval benchmark comparison between major generative artificial intelligence (AI) programs in 2023 [Graph]. In Statista. Retrieved May 11, 2025, from https://www.statista.com/statistics/1447778/humaneval-benchmark-comparison-of-major-ai-programs/
xAI. "HumanEval benchmark comparison between major generative artificial intelligence (AI) programs in 2023." Chart. November 4, 2023. Statista. Accessed May 11, 2025. https://www.statista.com/statistics/1447778/humaneval-benchmark-comparison-of-major-ai-programs/
xAI. (2023). HumanEval benchmark comparison between major generative artificial intelligence (AI) programs in 2023. Statista. Statista Inc.. Accessed: May 11, 2025. https://www.statista.com/statistics/1447778/humaneval-benchmark-comparison-of-major-ai-programs/
xAI. "Humaneval Benchmark Comparison between Major Generative Artificial Intelligence (Ai) Programs in 2023." Statista, Statista Inc., 4 Nov 2023, https://www.statista.com/statistics/1447778/humaneval-benchmark-comparison-of-major-ai-programs/
xAI, HumanEval benchmark comparison between major generative artificial intelligence (AI) programs in 2023 Statista, https://www.statista.com/statistics/1447778/humaneval-benchmark-comparison-of-major-ai-programs/ (last visited May 11, 2025)
HumanEval benchmark comparison between major generative artificial intelligence (AI) programs in 2023 [Graph], xAI, November 4, 2023. [Online]. Available: https://www.statista.com/statistics/1447778/humaneval-benchmark-comparison-of-major-ai-programs/
Profit from additional features with an Employee Account
Please create an employee account to be able to mark statistics as favorites.
Then you can access your favorite statistics via the star in the header.
Profit from the additional features of your individual account
Currently, you are using a shared account. To use individual functions (e.g., mark statistics as favourites, set
statistic alerts) please log in with your personal account.
If you are an admin, please authenticate by logging in again.