Falcon 40B: Redefining the Limits of Open-Source LLMs

Published in

𝐀𝐈 𝐦𝐨𝐧𝐤𝐬.𝐢𝐨

3 min readJun 21, 2023

Image Credits: TII (Technology Innovation Institute)

Welcome back, fellow seekers of wisdom! As you may already know, Language Models (LLMs) are becoming increasingly powerful due to the extensive research and development capabilities of organizations. In the early stages, OpenAI’s ChatGPT (ChatGPT 3) was an open-source model. However, with the subsequent releases of ChatGPT 3.5 and ChatGPT 4, OpenAI made the decision to close their source code.

Despite this, many new organizations and institutes are now focusing on developing open-source LLMs. Consequently, the industry is shifting towards prioritizing the development of open-source LLMs compared to the earlier stages of AI generation.

So in today’s topic let’s talk about a very interesting opensource LLM known as Falcon 40B.

The falcon 40B LLM is now a top performing LLM in the huggingface open LLM leaderboard. The model was developed by the TII (Technology Innovation Institute) and in the early this year they have annouced thier model as “free of royalties for commercial and research use, in response to global demand for inclusive access to AI.”

The Falcon LLM

The falcon LLM has 40 billion parameters and trained on a massive dataset of 1 trillion tokens. The training process started in December 2022 and took over two months and used 384 GPUs on AWS. The pretraining data gathered from the public web crawls and filtered out the machine generated text and adult content. A part from that deduplication techniques were used to clean the pretraining dataset. Altogether Falcon trained on 5 trillion token dataset. The Falcons skills were boosted by adding carefully chosen sources to the pretraining dataset, including research publications and social media chats.

Falcon 40B can be used for a variety of tasks, including:
Natural language understanding
Natural language generation
Machine translation
Question answering
Text summarization

The TII has also made available the instruct versions of the LLM, along with those models fine-tuned using instructional and conversational data.
Falcon has its younger brother known as Falcon 7B, which also has an instruct model similar to Falcon 40B.

Bard vs Falcon 40B vs ChatGPT3.5

Here is the key differences if Bard, Falcon 40B and ChatGPT 3.5.

Conclusion

Language Models (LLMs) are becoming increasingly powerful due to extensive research and development capabilities. The opensource Falcon 40B being a top performing LLM in the huggingface open LLM leaderboard. Falcon 40B has 40 billion parameters and trained on a massive dataset of 1 trillion tokens and can be used for tasks like natural language understanding, natural language generation, machine translation, question answering, and text summarization.

If you like the article and would like to support me, make sure to:

👏 Clap for the story (claps)
🔔 Follow me on Medium
Visit My Website for more info

Follow our Social Accounts- Facebook/Instagram/Linkedin/Twitter
Join AImonks Youtube Channel to get interesting videos.

Falcon 40B: Redefining the Limits of Open-Source LLMs

Bard vs Falcon 40B vs ChatGPT3.5

Conclusion

Written by Bhathiya Bandara