ARENA: Get Rewarded for Evaluating AI Chatbot Responses

Let the battle of the LLMs commence

AI Network
AI Network
7 min readJun 19, 2024

--

ChatGPT, Claude, Llama3, BERT, these are just some of the multiple AI LLM (large language models) chatbots now out there in the word, answering our questions and assisting us with an astonishingly large array of different tasks.

A few years ago AI assistants like these with such capability was nearly inconceivable for most, but since ChatGPT exploded onto the scene, many people have become aware of AI chatbots and their vast capabilities, and also the different chatbots now available.

Are they all the same, though? Does it matter which LLM you use? If you pose your question to ChatGPT or Llama3? Won’t they all give you the same answer or the same level of quality when generating responses?

The answer, as it happens, is no.

The reason for this is the type of LLM it is, the manner in which they’re trained, the underlying methods they use to generate responses and how much they’re actually used.

Large language models are trained on a vast amount of data before they’re put out into the world for public use. The amount of data differs from LLM to LLM, and generally speaking, the more data the model is trained on the more capable it should be.

The timing of the model’s training is also a factor. GPT3.5, for example, was trained up to Sept 2021, and the model itself couldn’t search the internet, severely limiting its capabilities to answer queries with any relevance after the end of the training cut-off date. GPT4, on the other hand, was trained until Apr 2023 and on more parameters, and it could search the internet, making it superior to its predecessor.

The way in which the LLM operates is also a factor. Some LLMs use transformer architecture; a type of machine learning model which assigns probabilities to the next world in a sequence, and LLMs with superior transformer architecture are more likely to get the next word in a sentence ‘right’ and answer users’ queries more effectively. LLMs also learn the more queries they’re posed, so a popular LLM may improve quicker over time than a lesser-used one.

LLMs can also be closed or open source, meaning that their underlying code is either a closely guarded secret (closed source, like ChatGPT), or its open and anyone can freely see it and add to it in their own way (like Llama3 by Meta). This may also hold a factor over the LLMs capabilities, for open source promotes collaboration, whereas closed source is shrouded in secrecy.

All of this makes it obvious that different LLMs may possess differing capabilities. How do we know which one is best?

This is why we built Arena.

ARENA — where LLMs Battle for Supremacy

Arena is an AI chatbot comparison service where LLMs battle it out to see which comes out on top.

Incredibly simple to use, users ask a question on Arena and two anonymous chatbots answer. Users then rate which response is better, if they were a tie or if they were both bad. As reward, users receive $AIN tokens (and more $AIN tokens are received for higher quality questions from the user).

arena.ainetwork.ai. Choose the better answer and get rewarded with $AIN token.

It’s that simple. The next time you have a question for a chatbot, go to Arena and ask, get two answers from two different LLMs, choose the better one and get rewarded for it.

How to Use Arena

Arena is very simple to use, just follow these steps:

  1. Download AIN wallet for Chrome

2. Head to Arena — arena.ainetwork.ai

3. Connect your AIN wallet

4. Ask a question

5. Rate which answer is better

6. Get rewarded with $AIN token directly into your wallet.

It’s that simple.

Why it’s Important to Test LLMs

AI is changing the world, and its not doing it slowly, AI models seem to be getting more advanced by the day, and it has the power to fundamentally shift the way we as humans work, play and live our lives.

So far a lot of this world-changing technology has been veiled in mystery. OpenAI, for example, launched ChatGPT3.5 in Nov 2022 and the world suddenly became aware of the mindblowing potential of AI, and every update since has been shrouded in secrecy until the time of launch.

At AI Network we believe in collaboration, transparency and openness, and believe AI and its development should be clear for all to see, and decided by the masses. This is why we champion open source AI, and keep all of our services open source.

Testing LLMs is part of this mission — to make AI transparent and useful for everyone. If LLMs are going to a be a daily part of our lives (and for many, they already are), its important to understand which will serve your purposes the best. This is why we built Arena.

$AIN token & AI Network

When you use Arena you’ll receive reward sin $AIN token, the native cryptocurrency of AI Network. AI Network is a blockchain-based AI development ecosystem, with the mission is to democratize AI and AI development, making it useful and transparent for everyone.

$AIN is the backbone of the AI Network ecosystem and is used in all its transactions.

AI Network offers multiple services and platforms, among them;

  • AINA: the world’s first open source AI agent platform & marketplace
  • GPU sharing service: hourly GPU resource rentals for AI startups
  • Runo NFT: holders get rewarded in $AIN for supporting AI projects
  • LLM as a service: providing LLMs for specific purposes and projects

Additional services on the ecosystem are numerous, like Unblock Media, Soulfiction, Miniegg and Uncommon Gallery.

$AIN token is used in all AI Network services, like AINA, and can also be traded on CEXes and DEXes like MEXC, LBank and Uniswap.

AINA — the World’s First open source AI Agent platform and marketplace

One of the most exciting place you can use $AIN is on AINA — the world’s first open source AI Agent platform, marketplace and LLM.

AINA is an AI agent marketplace, and users can create AI agents on AINA. An AI Agent is an AI model made for a specific purpose, and which autonomously learns from other AI models within the ecosystem. This means the agent improves over time without requiring the input of the user.

AINA is built on the blockchain. Users connect their AIN wallets to the platform and use $AIN tokens for transactions. Every AI agent is digitized as an AINFT, and every AI agent created by a user is recorded as that user’s property as an NFT. This means users own all the value their AI agents creat. For example, if you create an AI agent to write articles, and it begins generating profit from writing (maybe those articles get picked up by a publication which pays per article view), you as the user own that profit.

Because you own your AI agent as an AINFT, you’re free to sell or rent your AI agents to other users on AINA, and also to buy or rent agents from other users. Every transaction on AINA is mediated with $AIN tokens, and you gain $AIN tokens as rewards from battling LLMs on Arena.

Join the ARENA Launch Event

AI Network is hosting a special 20-day event where participants can earn even more rewards with Arena. Users can receive up to 10 AIN tokens daily during the event, depending on the quality of their questions. Don’t miss out on this opportunity to be among the first to experience the power of ARENA and contribute to the future of AI development.

The event starts on June 18.

AI Network is a decentralized AI development ecosystem based on blockchain technology. Within its ecosystem, resource providers can earn $AIN tokens for their GPUs, developers can gain access to GPUs for open source AI programs, and creators can transform their AI creations into AINFTs. The ultimate goal of AI Network is to bring AI to Web3, where everyone can easily develop and utilize artificial intelligence.

If you want to know more about us,

--

--

AI Network
AI Network

A decentralized AI development ecosystem built on its own blockchain, AI Network seeks to become the “Internet for AI” in the Web3 era.