Blog Layout

A language model bigger than GPT-3 has arrived with a bold ambition: freeing AI from Big Tech’s clutches.

Named BLOOM, the large language model (LLM) promises a similar performance to Silicon Valley’s leading systems — but with a radically different approach to access.

While tech giants tend to keep their vaunted LLMs hidden from the public, BLOOM is available to anyone for free.

Greetings, humanoids

Subscribe to our newsletter now for a weekly recap of our favorite AI stories in your inbox.

It’s also multilingual — unlike Google’s LaMDA and OpenAI’s GPT-3 — an unusual feature in an English-dominated field.

These features could democratize access to technology that’s set to make a deep impact on society.

Powerful AI models can be trained and released in an open way.


LLMs are proving proficient at a growing range of tasks , including writing essays, generating code, and translating languages.

They’re also adept at producing harmful content — and their future capabilities are difficult to predict.

BLOOM gives researchers a unique chance to explore their risks and benefits.

“BLOOM is a demonstration that the most powerful AI models can be trained and released by the broader research community with accountability and in an actual open way, in contrast to the typical secrecy of industrial AI research labs.” said Teven Le Scao, co-lead of BLOOM’s training, in a statement.

Opening AI

LLMs are prohibitively expensive to create and run. Training GPT-3, for instance, was estimated to cost  up to $27.6 million.

Inevitably, tech companies want to protect such large investments — particularly when they provide competitive advantages.

It’s therefore unsurprising that LLMs are rarely open-sourced — with some notable exceptions.

Meta has produced the most prominent anomaly. In May, the company offered access to the 175-billion parameter OPT system .

The full model, however, is only available upon request and for non-commercial use.

BLOOM ramps up the accessibility.

The 176-billion-parameter model is available for free to any individual or institution who agrees to the system’s Responsible AI License.

Anyone can publicly view  the meeting notes, discussions, and code behind the model.

The seeds of BLOOM

BLOOM was created by BigScience, a research project that launched in early 2021. The initiative is bootstrapped and led by AI startup   Hugging Face .

“Large ML models have changed the world of AI research over the last two years but the huge compute cost necessary to train them resulted in very few teams actually having the ability to train and research them,” said Thomas Wolf, the BigScience co-lead and Hugging Face co-founder.

The training corpus aligned with our values.


The team of 100,000 researchers from more than 60 countries and 250 institutions developed BLOOM to promote inclusion and responsibility in LLMs.

They trained the model on the Jean Zay supercomputer in Paris, France.

“We adopted a data-first approach to make sure the training corpus was aligned with our values,” said Christopher Akiki, a BigScience researcher based at Leipzig University.

“The multidisciplinary and international makeup of BigScience enabled us to critically reflect on every step of the process from multiple vantage points: ethical, legal, environmental, linguistic, and technical.

“That meant we were able to mitigate ethical concerns without compromising on performance or scale.”

The size is certainly imposing. At 176 billion parameters, BLOOM is larger than OpenAI’s GPT-3 and MetaAI’s OPT. 

The model can generate text in 46 natural languages and dialects and 13 programming languages. For many of them, it’s the first-ever language model with over 100B parameters.

It’s also uniquely affordable. BigScience says researchers can use BLOOM for less than $40/hr on a cloud provider.

The model isn’t likely to compete with those built by Big Tech — but it at least provides a way to scrutinize them.

By Laurence November 21, 2022
Usually, the winners of a pitching competition are bathed with accolades, media attention, and applause. After it’s done and dusted, all they have to think about is what to spend
By Laurence November 19, 2022
Above all else, FTX advertisements wanted you to know two things: that cryptocurrency is a force for good, and that you don’t need to be an expert to buy and
By Laurence November 19, 2022
This article was originally published on .cult by Luis Minvielle. .cult is a Berlin-based community platform for developers. We write about all things career-related, make original documentaries, and share heaps
By Laurence November 18, 2022
Okay, that’s a good question. Red Crew, Blue Crew Had it not been for the heroics of three members of NASA’s specialized “Red Crew,” NASA’s absolutely massive — and incredibly
By Laurence November 18, 2022
Pharmaceutical manufacturing is closely linked to mass production. In order for medicines to be sold cheaply, they often have to be made in huge amounts. But what happens if you
By Laurence November 17, 2022
“I’m in checkmark purgatory.” Checkmate They say “don’t meet your heroes,” but what’s even worse? When your hero buys Twitter, forces you and others to start paying eight dollars per
More Posts
Share by: