StableLM: What to know about Stability AI's language model

Do not expect ChatGPT.
By Cecily Mauran and Mike Pearl  on 
A phone with the Stability AI logo displayed
Credit: SOPA Images / Getty

Move over GPT-4, there's a new language model in town! But don't move too far, because the chatbot powered by this model is...scarily bad.

On Wednesday, Stability AI launched its own language called StableLM. The company, known for its AI image generator called Stable Diffusion, now has an open-source language model that generates text and code. According to the Stability AI blog post, StableLM was trained on an open-source dataset called The Pile, which includes data from Wikipedia, YouTube, and PubMed. However, Stability AI says its dataset is three times larger than that of The Pile with "1.5 trillion tokens of content."

So how does it stack up against ChatGPT? So badly that we hope it's not meant to be comparable. The truth value of its outputs is practically nonexistent. Below, for instance, you'll notice it claims that on January 6, 2021, Trump supporters took control of the legislature. That's some dangerously confusing misinformation about a recent event.

StableLM's answer about January 6
Credit: Hugging Face / Stability AI

A common test for language models used by Mashable is one in which we check how capable and willing it is to satisfy an ethically questionable prompt asking for a news story about Tupac Shakur. The results for StableLM when given this test are enlightening. The model fails to write a convincing news story, which isn't necessarily a bad thing, but it also fails to recognize the basic contours of what it's being prompted to do, and doesn't "know" who Tupac Shakur is.

Mashable Light Speed
Want more out-of-this world tech, space and science stories?
Sign up for Mashable's weekly Light Speed newsletter.
By signing up you agree to our Terms of Use and Privacy Policy.
Thanks for signing up!
StableLM's news story
Credit: Hugging Face / Stability AI

To be generous, this kind of text-generation doesn't appear to be the intended use for StableLM, but when asked "What does StableLM do?" its response was an underwhelming two short sentences containing some technical jargon: "It is primarily used as a decision support system in systems engineering and architecture, and can also be used in statistical learning, reinforcement learning, and other areas."

StableLM lacks guardrails for sensitive content

Also of concern is the model's apparent lack of guardrails for certain sensitive content. Most notably, it falls on its face when given the famous "don't praise Hitler" test. The kindest thing one could say about StableLM's response to this test is that it's nonsensical.

StableLM's response to a prompt
Credit: Hugging Face / Stability AI

But here are some things to keep in mind before anyone calls this "the worst language model ever": It's open source, so this particular "black box" AI allows anyone to peek inside the box and see what the potential causes of its problems are. Also, the version of StableLM released today is in Alpha mode, the earliest stage of testing. It contains between 3 and 7 billion parameters, which are variables that determine how the model predicts content, and Stability AI plans to release more models with larger parameters of up to 65 billion. If that sounds like a lot, it's a relatively small amount. For context, OpenAI's GPT-3 has 175 billion parameters, so StableLM has a lot of catching up to do — if that is indeed the plan.

How to try StableLM right now

The code for StableLM is currently available on GitHub, and Hugging Face hosts a version that has a user-friendly front end with the extremely catchy name "StableLM-Tuned-Alpha-7b Chat." The Hugging-Face-hosted version works like a chatbot, though a somewhat slow one.

So now that you know its limitations, feel free to try it for yourself.

Mashable Image
Cecily Mauran

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before getting her master's degree at Columbia Journalism School, she spent several years working with startups and social impact businesses for Unreasonable Group and B Lab. Before that, she co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. You can find her on Twitter at @cecily_mauran.


Recommended For You

Microsoft launches Phi-3 Mini, a tiny AI model that packs a punch
a futuristic rendering of a brain on top of a computing chip

IOS 18 tipped to get 3 new AI features, including Siri summarizing your texts
iOS 18 illustrative collage


AI might be influencing your vote this election. How to spot and respond to it.
An illustration of an American flag. The stars are replaced by white computer monitors and the bottom right corner is on fire.

Trending on Mashable
NYT Connections today: See hints and answers for May 5
A phone displaying the New York Times game 'Connections.'

'Wordle' today: Here's the answer hints for May 5
a phone displaying Wordle

NYT Connections today: See hints and answers for May 4
A phone displaying the New York Times game 'Connections.'

NYT's The Mini crossword answers for May 5
Closeup view of crossword puzzle clues

The best movies on Prime Video right now
By Kimber Myers
An Asian woman holds a sword; a boy rides a dragon; a Black man in a suit faces forward
The biggest stories of the day delivered to your inbox.
This newsletter may contain advertising, deals, or affiliate links. Subscribing to a newsletter indicates your consent to our Terms of Use and Privacy Policy. You may unsubscribe from the newsletters at any time.
Thanks for signing up. See you at your inbox!