Results for "text-to-audio"

86 matches found.

facebook

facebook/musicgen-medium

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 1,369,571
facebook

facebook/musicgen-small

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 153,154
ACE-Step

ACE-Step/Ace-Step1.5

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 50,024
stabilityai

stabilityai/stable-audio-open-1.0

`Stable Audio Open 1.0` generates variable-length (up to 47s) stereo audio at 44.1kHz from text prompts. It comprises three components: an a...

✨ text-to-audio 25,927
facebook

facebook/musicgen-large

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 20,106
razhan

razhan/mms-tts-ckb

No description available.

✨ text-to-audio 18,222
ACE-Step

ACE-Step/acestep-5Hz-lm-0.6B

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 9,995
ACE-Step

ACE-Step/acestep-v15-base

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 8,274
ACE-Step

ACE-Step/acestep-5Hz-lm-4B

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 7,752
ACE-Step

ACE-Step/ACE-Step-v1-chinese-rap-LoRA

ACE-Step is a novel open-source foundation model for music generation that overcomes key limitations of existing approaches through a holist...

✨ text-to-audio 5,895
OpenMOSS-Team

OpenMOSS-Team/MOSS-SoundEffect

    ...

✨ text-to-audio 5,695
ACE-Step

ACE-Step/acestep-v15-sft

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 4,667
ylacombe

ylacombe/musicgen-melody

No description available.

✨ text-to-audio 4,232
ACE-Step

ACE-Step/acestep-captioner

No description available.

✨ text-to-audio 4,097
facebook

facebook/musicgen-melody

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 3,762
HeartMuLa

HeartMuLa/HeartMuLa-oss-3B

No description available.

✨ text-to-audio 3,683
HeartMuLa

HeartMuLa/HeartMuLa-oss-3B-happy-new-year

The best open-sourced music generation model in terms of lyrics controllability and music quality....

✨ text-to-audio 3,475
Xenova

Xenova/musicgen-small

No description available.

✨ text-to-audio 3,130
stabilityai

stabilityai/stable-audio-open-small

`Stable Audio Open Small` generates variable-length (up to 11s) stereo audio at 44.1kHz from text prompts. It comprises three components: an...

✨ text-to-audio 3,093
mradermacher

mradermacher/zen-musician-i1-GGUF

No description available.

✨ text-to-audio 3,089
slseanwu

slseanwu/MIDI-LLM_Llama-3.2-1B

Base Model: `meta-llama/Llama-3.2-1B` - Model Size: 1.4B parameters - Extended Vocabulary: 183,286 tokens (128,256 for text + 55,030 for MID...

✨ text-to-audio 3,026
ACE-Step

ACE-Step/acestep-v15-turbo-shift3

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 2,713
facebook

facebook/musicgen-stereo-small

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 2,552
declare-lab

declare-lab/mustango

No description available.

✨ text-to-audio 2,235
eustlb

eustlb/higgs-audio-v2-generation-3B-base

No description available.

✨ text-to-audio 2,144
ACE-Step

ACE-Step/acestep-v15-turbo-continuous

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 2,044
FabioSarracino

FabioSarracino/VibeVoice-Large-Q8

No description available.

✨ text-to-audio 1,768
facebook

facebook/musicgen-stereo-medium

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 1,767
ACE-Step

ACE-Step/acestep-v15-turbo-shift1

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 1,753
Marvis-AI

Marvis-AI/marvis-tts-250m-v0.2

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 1,715
calcuis

calcuis/ace-gguf

- base model from ace-step - full set gguf (model+encoder+vae) works right away...

✨ text-to-audio 1,629
ylacombe

ylacombe/musicgen-stereo-melody

No description available.

✨ text-to-audio 1,607
Marvis-AI

Marvis-AI/marvis-tts-250m-v0.1

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 1,574
facebook

facebook/musicgen-stereo-large

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 1,477
espnet

espnet/fastspeech2_conformer

The FastSpeech2Conformer model was proposed with the paper Recent Developments On Espnet Toolkit Boosted By Conformer by Pengcheng Guo, Flor...

✨ text-to-audio 1,277
riffusion

riffusion/riffusion-model-v1

Developed by: Seth Forsgren, Hayk Martiros - Model type: Diffusion-based text-to-image generation model - Language(s): English - License: Th...

✨ text-to-audio 1,248
2Noise

2Noise/ChatTTS

No description available.

✨ text-to-audio 1,093
echarlaix

echarlaix/tiny-random-vits

No description available.

✨ text-to-audio 1,017
mingyi456

mingyi456/Ace-Step1.5-DF11-ComfyUI

For more information (including how to compress models yourself), check out https://huggingface.co/DFloat11 and https://github.com/LeanModel...

✨ text-to-audio 758
Marvis-AI

Marvis-AI/marvis-tts-250m-v0.1-transformers

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 629
LiquidAI

LiquidAI/LFM2.5-Audio-1.5B-ONNX

No description available.

✨ text-to-audio 590
facebook

facebook/magnet-small-10secs

Organization developing the model: The FAIR team of Meta AI. Model date: MAGNeT was trained between November 2023 and January 2024. Model ve...

✨ text-to-audio 554
sil-ai

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-NT-va-acoustic

No description available.

✨ text-to-audio 497
mradermacher

mradermacher/zen-musician-GGUF

No description available.

✨ text-to-audio 475
CypressYang

CypressYang/SongBloom

No description available.

✨ text-to-audio 459
declare-lab

declare-lab/TangoFlux

TangoFlux consists of FluxTransformer blocks which are Diffusion Transformer (DiT) and Multimodal Diffusion Transformer (MMDiT), conditioned...

✨ text-to-audio 447
nateraw

nateraw/musicgen-songstarter-v0.2

No description available.

✨ text-to-audio 432
Marvis-AI

Marvis-AI/marvis-tts-250m-v0.2-MLX-6bit

No description available.

✨ text-to-audio 397
2121-8

2121-8/japanese-parler-tts-mini-bate

No description available.

✨ text-to-audio 377
Marvis-AI

Marvis-AI/marvis-tts-100m-v0.2

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 367
benjiaiplayground

benjiaiplayground/HeartMuLa-oss-3B-bf16

No description available.

✨ text-to-audio 364
Marvis-AI

Marvis-AI/marvis-tts-250m-v0.1-MLX-8bit

No description available.

✨ text-to-audio 350
benjiaiplayground

benjiaiplayground/HeartCodec-oss-bf16

No description available.

✨ text-to-audio 343
Marvis-AI

Marvis-AI/marvis-tts-250m-v0.2-transformers

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 335
Matthijs

Matthijs/mms-tts-eng

No description available....

✨ text-to-audio 330
facebook

facebook/magnet-medium-30secs

Organization developing the model: The FAIR team of Meta AI. Model date: MAGNeT was trained between November 2023 and January 2024. Model ve...

✨ text-to-audio 330
tencent

tencent/SongGeneration

No description available.

✨ text-to-audio 328
HKUSTAudio

HKUSTAudio/AudioX-MAF

No description available.

✨ text-to-audio 319
HKUSTAudio

HKUSTAudio/AudioX-MAF-MMDiT

No description available.

✨ text-to-audio 294
espnet

espnet/fastspeech2_conformer_with_hifigan

No description available.

✨ text-to-audio 287
Beehzod

Beehzod/speechT5_tts_uzbek

More information needed...

✨ text-to-audio 265
facebook

facebook/musicgen-melody-large

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 261
Lingalingeswaran

Lingalingeswaran/facebook_mms_tamil

No description available.

✨ text-to-audio 251
Marvis-AI

Marvis-AI/marvis-tts-250m-v0.2-MLX-8bit

No description available.

✨ text-to-audio 240
eustlb

eustlb/higgs-v2-archive

No description available.

✨ text-to-audio 228
facebook

facebook/audio-magnet-medium

Organization developing the model: The FAIR team of Meta AI. Model date: MAGNeT was trained between November 2023 and January 2024. Model ve...

✨ text-to-audio 221
atul10

atul10/nepali_male_v1

``` Nepali language ``` VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthes...

✨ text-to-audio 216
tencent

tencent/HunyuanVideo-Foley

No description available.

✨ text-to-audio 213
bcruz

bcruz/MIDI-LLM_Llama-3.2-1B-Q4_K_M-GGUF

No description available.

✨ text-to-audio 211
ford442

ford442/stable-audio-open-1.0

`Stable Audio Open 1.0` generates variable-length (up to 47s) stereo audio at 44.1kHz from text prompts. It comprises three components: an a...

✨ text-to-audio 190
ychenqz

ychenqz/emotion_classifier

No description available.

✨ text-to-audio 184
sil-ai

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-MAT-l1-pure

More information needed...

✨ text-to-audio 182
CypressYang

CypressYang/SongBloom_long

No description available.

✨ text-to-audio 170
mradermacher

mradermacher/CiSiMi-GGUF

No description available.

✨ text-to-audio 168
suhaibrashid17

suhaibrashid17/MMS_TTS_Urdu_3

No description available.

✨ text-to-audio 168
froabera

froabera/speecht5_finetuned

More information needed...

✨ text-to-audio 168
Urabewe

Urabewe/Ace-Step-Captioner-fp8

Tech Report ACE-Step Captioner is the annotation model used by ACE-Step v1.5 for training data labeling. It is a professional-grade music ca...

✨ text-to-audio 165
Marvis-AI

Marvis-AI/marvis-tts-100m-v0.2-MLX-6bit

No description available.

✨ text-to-audio 161
sil-ai

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-MAT-l1blend-0.7

More information needed...

✨ text-to-audio 157
alakxender

alakxender/mms-tts-div-ft-spk01-f01

| Field | Value | |----| | Model ID | `alakxender/mms-tts-div-ft-spk01-f01` | | Base Architecture| MMS-TTS (VITS) | | Language | Divehi (dv)...

✨ text-to-audio 156
KandirResearch

KandirResearch/CiSiMi-v0.1

No description available.

✨ text-to-audio 152
Omarrran

Omarrran/turkish_finetuned_speecht5_tts

No description available.

✨ text-to-audio 151
ManuD

ManuD/speecht5_finetuned_voxpopuli_de_Merkel

More information needed...

✨ text-to-audio 149
MuzaffarSharofitdinov

MuzaffarSharofitdinov/mms-tts-uzbek-qiz-ovozi_v2

No description available.

✨ text-to-audio 148
Nekochu

Nekochu/stable-audio-open-1.0-Music

No description available.

✨ text-to-audio 147
sil-ai

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-MAT-l1blend

More information needed...

✨ text-to-audio 143