Alpaca was the least reliable generative AI program in 2023, with an accuracy score of 20 percent or lower in nearly all categories. ChatGPT, made by OpenAI, Claude, and Claude 2, made by Anthropic, were the most reliable generative AI programs overall. Davinci002 was the most reliable model in the general aspect, but it suffers considerably from hallucinations in summarization of topics.
HaluEval hallucination classification accuracy benchmark of generative artificial intelligence (AI) models globally in 2023
The survey was administered to census-targeted samples of over 1,000 people in each of 21 countries, for a total of 23,882 surveys conducted in 12 languages.
Source breaks down the functions as follows: "SO = Service operations", "M&S = Marketing and sales", and "R&D = Research and development".
Profit from the additional features of your individual account
Currently, you are using a shared account. To use individual functions (e.g., mark statistics as favourites, set
statistic alerts) please log in with your personal account.
If you are an admin, please authenticate by logging in again.