Bark

Category: Audio Models
Source: Github
Last updated: January 09, 2026

Bark is an open-source, transformer-based generative audio model by Suno that converts text prompts into realistic, multilingual speech as well as other audio outputs (e.g., music, background noise, and nonverbal cues). It is designed for research and commercial use, offering fast inference on both GPU and CPU.