Bark
Bark is a transformer-based text-to-audio model by Suno that generates highly realistic, multilingual speech as well as music, background noise, and simple sound effects. It also produces nonverbal cues like laughing or sighing. The model is provided for research purposes with pretrained checkpoints available for inference.
Key Information
- Category: Audio Models
- Source: Huggingface
- Tags: text-to-speech
- Last updated: January 09, 2026
Structured Metrics
No structured metrics captured yet.
Links
Canonical source: https://huggingface.co/suno/bark