Editor's Pick

Before launching, GPT-4o broke records on chatbot leaderboard under a secret name

May 13, 202494 views0

Enlarge (credit: Getty Images)

On Monday, OpenAI employee William Fedus confirmed on X that a mysterious chat-topping AI chatbot known as “gpt-chatbot” that had been undergoing testing on LMSYS’s Chatbot Arena and frustrating experts was, in fact, OpenAI’s newly announced GPT-4o AI model. He also revealed that GPT-4o had topped the Chatbot Arena leaderboard, achieving the highest documented score ever.

“GPT-4o is our new state-of-the-art frontier model. We’ve been testing a version on the LMSys arena as im-also-a-good-gpt2-chatbot,” Fedus tweeted.

Chatbot Arena is a website where visitors converse with two random AI language models side by side without knowing which model is which, then choose which model gives the best response. It’s a perfect example of vibe-based AI benchmarking, as AI researcher Simon Willison calls it.

Read 8 remaining paragraphs | Comments

What's your reaction?

Excited

0

Happy

0

In Love

0

Not Sure

0

Silly

0

You may also like

Editor's Pick

Yearlong supply-chain attack targeting security pros steals 390K credentials

By

December 13, 2024

Editor's Pick

Twirling body horror in gymnastics video exposes AI’s flaws

By

December 13, 2024

Editor's Pick

Critical WordPress plugin vulnerability under active exploit threatens thousands

By

December 12, 2024

Leave a reply Cancel reply

More in:Editor's Pick

Editor's Pick

OpenAI introduces “Santa Mode” to ChatGPT for ho-ho-ho voice chats

On Thursday, OpenAI announced that ChatGPT users can now talk to a simulated version of ...

Editor's Pick

Russia takes unusual route to hack Starlink-connected devices in Ukraine

Russian nation-state hackers have followed an unusual path to gather intel in the country’s ongoing ...

Editor's Pick

Google goes “agentic” with Gemini 2.0’s ambitious AI agent features

On Wednesday, Google unveiled Gemini 2.0, the next generation of its AI-model family, starting with ...

Editor's Pick

AI company trolls San Francisco with billboards saying “stop hiring humans”

Since the dawn of the generative AI era a few years ago, the march of ...

Now Reading

Before launching, GPT-4o broke records on chatbot leaderboard under a secret name

2min read

0 %