AI startup Stability AI has launched Stable Audio Open Small, a “stereo” audio-generating AI mannequin that the corporate claims is the quickest in the marketplace — and environment friendly sufficient to run on smartphones.
Stable Audio Open Small is the fruit of a collaboration between Stability AI and Arm, the chipmaker that produces lots of the processors inside tablets, telephones, and different cellular gadgets. While numerous AI-powered apps can generate audio, like Suno and Udio, most depend on cloud processing, which means that they will’t be used offline.
Stability additionally claims that Stable Audio Open Small’s coaching set is made up fully of songs from the royalty-free audio libraries Free Music Archive and Freesound. That’s versus the coaching units of the aforementioned Suno and Udio, which reportedly comprise copyrighted content material, posing an IP threat.
Stable Audio Open Small is 341 million parameters in measurement and optimized to run on Arm CPUs. (Parameters, generally known as weights, are the interior elements of a mannequin that information its conduct.) Designed for rapidly producing brief audio samples and sound results (e.g., drum and instrument riffs), Stable Audio Open Small can produce as much as 11 seconds of audio on a smartphone in lower than 8 seconds, claims Stability AI.
Here’s a pattern generated by Stable Audio Open Small:
And right here’s one other one:
The mannequin isn’t with out its limitations. Stable Audio Open Small solely helps prompts written in English, and Stability notes in its documentation that the mannequin can’t generate reasonable vocals or high-quality songs. The mannequin additionally doesn’t carry out equally properly throughout musical types, Stability warns — a consequence of its Western-biased coaching information.
In one other potential wrinkle for devs, Stable Audio Open Small has considerably restrictive utilization phrases. It’s free to make use of for researchers, hobbyists, and companies with lower than $1 million in annual income, however builders and organizations making over $1 million in income must pay for Stability’s enterprise license.
Stability, the beleaguered agency behind the favored picture technology mannequin Stable Diffusion, raised new money final yr as buyers, together with Eric Schmidt and Napster founder Sean Parker, sought to show the enterprise round. Emad Mostaque, Stability’s co-founder and ex-CEO, reportedly mismanaged Stability into monetary wreck, main employees to resign, a partnership with Canva to fall by, and buyers to develop involved in regards to the firm’s prospects.
In the previous few months, Stability has employed a brand new CEO, appointed Titanic director James Cameron to its board of administrators, and launched a number of new picture technology fashions.