What is SAM Audio?
SAM Audio is Meta’s open-source unified multimodal model for audio separation, allowing isolation of any sound from complex mixtures using text, visual, or time-span prompts.
When was SAM Audio released?
SAM Audio was officially introduced and released by Meta on December 16, 2025.
Is SAM Audio free to use?
Yes, it is completely free and open-source under the SAM License, with model checkpoints, code, and a Playground demo available for research and commercial use.
What prompts does SAM Audio support?
It supports text prompts (describe the sound), visual prompts (click on video source), and time-span prompts (select segment in timeline).
Where can I try SAM Audio?
Test it instantly in the Segment Anything Playground at aidemos.meta.com/segment-anything/editor/segment-audio, or download from GitHub/Hugging Face for local use.
What types of audio can SAM Audio separate?
It handles general sounds (e.g., traffic, barking), music (instruments/vocals), and speech (speakers from noise) from audio or video files.
What license does SAM Audio use?
Released under the SAM License, allowing both research and commercial applications with no restrictions on usage.
How does SAM Audio compare to other tools?
It sets new standards with multimodal prompting and unified handling of sounds/music/speech, outperforming previous separation models on benchmarks.




