Nirman.online | Premium AI Directory

facebook

facebook/musicgen-medium

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 1,369,571

facebook

facebook/musicgen-small

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 153,154

ACE-Step

ACE-Step/Ace-Step1.5

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 50,024

stabilityai

stabilityai/stable-audio-open-1.0

`Stable Audio Open 1.0` generates variable-length (up to 47s) stereo audio at 44.1kHz from text prompts. It comprises three components: an a...

✨ text-to-audio 25,927

facebook

facebook/musicgen-large

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 20,106

razhan

razhan/mms-tts-ckb

No description available.

✨ text-to-audio 18,222

ACE-Step

ACE-Step/acestep-5Hz-lm-0.6B

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 9,995

ACE-Step

ACE-Step/acestep-v15-base

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 8,274

ACE-Step

ACE-Step/acestep-5Hz-lm-4B

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 7,752

ACE-Step

ACE-Step/ACE-Step-v1-chinese-rap-LoRA

ACE-Step is a novel open-source foundation model for music generation that overcomes key limitations of existing approaches through a holist...

✨ text-to-audio 5,895

OpenMOSS-Team

OpenMOSS-Team/MOSS-SoundEffect

    ...

✨ text-to-audio 5,695

ACE-Step

ACE-Step/acestep-v15-sft

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 4,667

ylacombe

ylacombe/musicgen-melody

No description available.

✨ text-to-audio 4,232

ACE-Step

ACE-Step/acestep-captioner

No description available.

✨ text-to-audio 4,097

facebook

facebook/musicgen-melody

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 3,762

HeartMuLa

HeartMuLa/HeartMuLa-oss-3B

No description available.

✨ text-to-audio 3,683

HeartMuLa

HeartMuLa/HeartMuLa-oss-3B-happy-new-year

The best open-sourced music generation model in terms of lyrics controllability and music quality....

✨ text-to-audio 3,475

Xenova

Xenova/musicgen-small

No description available.

✨ text-to-audio 3,130

stabilityai

stabilityai/stable-audio-open-small

`Stable Audio Open Small` generates variable-length (up to 11s) stereo audio at 44.1kHz from text prompts. It comprises three components: an...

✨ text-to-audio 3,093

mradermacher

mradermacher/zen-musician-i1-GGUF

No description available.

✨ text-to-audio 3,089

slseanwu

slseanwu/MIDI-LLM_Llama-3.2-1B

Base Model: `meta-llama/Llama-3.2-1B` - Model Size: 1.4B parameters - Extended Vocabulary: 183,286 tokens (128,256 for text + 55,030 for MID...

✨ text-to-audio 3,026

ACE-Step

ACE-Step/acestep-v15-turbo-shift3

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 2,713

facebook

facebook/musicgen-stereo-small

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 2,552

declare-lab

declare-lab/mustango

No description available.

✨ text-to-audio 2,235

eustlb

eustlb/higgs-audio-v2-generation-3B-base

No description available.

✨ text-to-audio 2,144

ACE-Step

ACE-Step/acestep-v15-turbo-continuous

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 2,044

FabioSarracino

FabioSarracino/VibeVoice-Large-Q8

No description available.

✨ text-to-audio 1,768

facebook

facebook/musicgen-stereo-medium

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 1,767

ACE-Step

ACE-Step/acestep-v15-turbo-shift1

🚀 ACE-Step v1.5 is a highly efficient open-source music foundation model designed to bring commercial-grade music generation to consumer har...

✨ text-to-audio 1,753

Marvis-AI

Marvis-AI/marvis-tts-250m-v0.2

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 1,715

calcuis

calcuis/ace-gguf

- base model from ace-step - full set gguf (model+encoder+vae) works right away...

✨ text-to-audio 1,629

ylacombe

ylacombe/musicgen-stereo-melody

No description available.

✨ text-to-audio 1,607

Marvis-AI

Marvis-AI/marvis-tts-250m-v0.1

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 1,574

facebook

facebook/musicgen-stereo-large

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 1,477

espnet

espnet/fastspeech2_conformer

The FastSpeech2Conformer model was proposed with the paper Recent Developments On Espnet Toolkit Boosted By Conformer by Pengcheng Guo, Flor...

✨ text-to-audio 1,277

riffusion

riffusion/riffusion-model-v1

Developed by: Seth Forsgren, Hayk Martiros - Model type: Diffusion-based text-to-image generation model - Language(s): English - License: Th...

✨ text-to-audio 1,248

2Noise

2Noise/ChatTTS

No description available.

✨ text-to-audio 1,093

echarlaix

echarlaix/tiny-random-vits

No description available.

✨ text-to-audio 1,017

mingyi456

mingyi456/Ace-Step1.5-DF11-ComfyUI

For more information (including how to compress models yourself), check out https://huggingface.co/DFloat11 and https://github.com/LeanModel...

✨ text-to-audio 758

Marvis-AI

Marvis-AI/marvis-tts-250m-v0.1-transformers

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 629

LiquidAI

LiquidAI/LFM2.5-Audio-1.5B-ONNX

No description available.

✨ text-to-audio 590

facebook

facebook/magnet-small-10secs

Organization developing the model: The FAIR team of Meta AI. Model date: MAGNeT was trained between November 2023 and January 2024. Model ve...

✨ text-to-audio 554

sil-ai

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-NT-va-acoustic

No description available.

✨ text-to-audio 497

mradermacher

mradermacher/zen-musician-GGUF

No description available.

✨ text-to-audio 475

CypressYang

CypressYang/SongBloom

No description available.

✨ text-to-audio 459

declare-lab

declare-lab/TangoFlux

TangoFlux consists of FluxTransformer blocks which are Diffusion Transformer (DiT) and Multimodal Diffusion Transformer (MMDiT), conditioned...

✨ text-to-audio 447

nateraw

nateraw/musicgen-songstarter-v0.2

No description available.

✨ text-to-audio 432

Marvis-AI

Marvis-AI/marvis-tts-250m-v0.2-MLX-6bit

No description available.

✨ text-to-audio 397

2121-8

2121-8/japanese-parler-tts-mini-bate

No description available.

✨ text-to-audio 377

Marvis-AI

Marvis-AI/marvis-tts-100m-v0.2

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 367

benjiaiplayground

benjiaiplayground/HeartMuLa-oss-3B-bf16

No description available.

✨ text-to-audio 364

Marvis-AI

Marvis-AI/marvis-tts-250m-v0.1-MLX-8bit

No description available.

✨ text-to-audio 350

benjiaiplayground

benjiaiplayground/HeartCodec-oss-bf16

No description available.

✨ text-to-audio 343

Marvis-AI

Marvis-AI/marvis-tts-250m-v0.2-transformers

Marvis is built on the Sesame CSM-1B (Conversational Speech Model) architecture, a multimodal transformer that operates directly on Residual...

✨ text-to-audio 335

Matthijs

Matthijs/mms-tts-eng

No description available....

✨ text-to-audio 330

facebook

facebook/magnet-medium-30secs

Organization developing the model: The FAIR team of Meta AI. Model date: MAGNeT was trained between November 2023 and January 2024. Model ve...

✨ text-to-audio 330

tencent

tencent/SongGeneration

No description available.

✨ text-to-audio 328

HKUSTAudio

HKUSTAudio/AudioX-MAF

No description available.

✨ text-to-audio 319

HKUSTAudio

HKUSTAudio/AudioX-MAF-MMDiT

No description available.

✨ text-to-audio 294

espnet

espnet/fastspeech2_conformer_with_hifigan

No description available.

✨ text-to-audio 287

Beehzod

Beehzod/speechT5_tts_uzbek

More information needed...

✨ text-to-audio 265

facebook

facebook/musicgen-melody-large

Organization developing the model: The FAIR team of Meta AI. Model date: MusicGen was trained between April 2023 and May 2023. Model version...

✨ text-to-audio 261

Lingalingeswaran

Lingalingeswaran/facebook_mms_tamil

No description available.

✨ text-to-audio 251

Marvis-AI

Marvis-AI/marvis-tts-250m-v0.2-MLX-8bit

No description available.

✨ text-to-audio 240

eustlb

eustlb/higgs-v2-archive

No description available.

✨ text-to-audio 228

facebook

facebook/audio-magnet-medium

Organization developing the model: The FAIR team of Meta AI. Model date: MAGNeT was trained between November 2023 and January 2024. Model ve...

✨ text-to-audio 221

atul10

atul10/nepali_male_v1

``` Nepali language ``` VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthes...

✨ text-to-audio 216

tencent

tencent/HunyuanVideo-Foley

No description available.

✨ text-to-audio 213

bcruz

bcruz/MIDI-LLM_Llama-3.2-1B-Q4_K_M-GGUF

No description available.

✨ text-to-audio 211

ford442

ford442/stable-audio-open-1.0

`Stable Audio Open 1.0` generates variable-length (up to 47s) stereo audio at 44.1kHz from text prompts. It comprises three components: an a...

✨ text-to-audio 190

ychenqz

ychenqz/emotion_classifier

No description available.

✨ text-to-audio 184

sil-ai

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-MAT-l1-pure

More information needed...

✨ text-to-audio 182

CypressYang

CypressYang/SongBloom_long

No description available.

✨ text-to-audio 170

mradermacher

mradermacher/CiSiMi-GGUF

No description available.

✨ text-to-audio 168

suhaibrashid17

suhaibrashid17/MMS_TTS_Urdu_3

No description available.

✨ text-to-audio 168

froabera

froabera/speecht5_finetuned

More information needed...

✨ text-to-audio 168

Urabewe

Urabewe/Ace-Step-Captioner-fp8

Tech Report ACE-Step Captioner is the annotation model used by ACE-Step v1.5 for training data labeling. It is a professional-grade music ca...

✨ text-to-audio 165

Marvis-AI

Marvis-AI/marvis-tts-100m-v0.2-MLX-6bit

No description available.

✨ text-to-audio 161

sil-ai

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-MAT-l1blend-0.7

More information needed...

✨ text-to-audio 157

alakxender

alakxender/mms-tts-div-ft-spk01-f01

| Field | Value | |----| | Model ID | `alakxender/mms-tts-div-ft-spk01-f01` | | Base Architecture| MMS-TTS (VITS) | | Language | Divehi (dv)...

✨ text-to-audio 156

KandirResearch

KandirResearch/CiSiMi-v0.1

No description available.

✨ text-to-audio 152

Omarrran

Omarrran/turkish_finetuned_speecht5_tts

No description available.

✨ text-to-audio 151

ManuD

ManuD/speecht5_finetuned_voxpopuli_de_Merkel

More information needed...

✨ text-to-audio 149

MuzaffarSharofitdinov

MuzaffarSharofitdinov/mms-tts-uzbek-qiz-ovozi_v2

No description available.

✨ text-to-audio 148

Nekochu

Nekochu/stable-audio-open-1.0-Music

No description available.

✨ text-to-audio 147

sil-ai

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-MAT-l1blend

More information needed...

✨ text-to-audio 143

Results for "text-to-audio"

facebook/musicgen-medium

facebook/musicgen-small

ACE-Step/Ace-Step1.5

stabilityai/stable-audio-open-1.0

facebook/musicgen-large

razhan/mms-tts-ckb

ACE-Step/acestep-5Hz-lm-0.6B

ACE-Step/acestep-v15-base

ACE-Step/acestep-5Hz-lm-4B

ACE-Step/ACE-Step-v1-chinese-rap-LoRA

OpenMOSS-Team/MOSS-SoundEffect

ACE-Step/acestep-v15-sft

ylacombe/musicgen-melody

ACE-Step/acestep-captioner

facebook/musicgen-melody

HeartMuLa/HeartMuLa-oss-3B

HeartMuLa/HeartMuLa-oss-3B-happy-new-year

Xenova/musicgen-small

stabilityai/stable-audio-open-small

mradermacher/zen-musician-i1-GGUF

slseanwu/MIDI-LLM_Llama-3.2-1B

ACE-Step/acestep-v15-turbo-shift3

facebook/musicgen-stereo-small

declare-lab/mustango

eustlb/higgs-audio-v2-generation-3B-base

ACE-Step/acestep-v15-turbo-continuous

FabioSarracino/VibeVoice-Large-Q8

facebook/musicgen-stereo-medium

ACE-Step/acestep-v15-turbo-shift1

Marvis-AI/marvis-tts-250m-v0.2

calcuis/ace-gguf

ylacombe/musicgen-stereo-melody

Marvis-AI/marvis-tts-250m-v0.1

facebook/musicgen-stereo-large

espnet/fastspeech2_conformer

riffusion/riffusion-model-v1

2Noise/ChatTTS

echarlaix/tiny-random-vits

mingyi456/Ace-Step1.5-DF11-ComfyUI

Marvis-AI/marvis-tts-250m-v0.1-transformers

LiquidAI/LFM2.5-Audio-1.5B-ONNX

facebook/magnet-small-10secs

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-NT-va-acoustic

mradermacher/zen-musician-GGUF

CypressYang/SongBloom

declare-lab/TangoFlux

nateraw/musicgen-songstarter-v0.2

Marvis-AI/marvis-tts-250m-v0.2-MLX-6bit

2121-8/japanese-parler-tts-mini-bate

Marvis-AI/marvis-tts-100m-v0.2

benjiaiplayground/HeartMuLa-oss-3B-bf16

Marvis-AI/marvis-tts-250m-v0.1-MLX-8bit

benjiaiplayground/HeartCodec-oss-bf16

Marvis-AI/marvis-tts-250m-v0.2-transformers

Matthijs/mms-tts-eng

facebook/magnet-medium-30secs

tencent/SongGeneration

HKUSTAudio/AudioX-MAF

HKUSTAudio/AudioX-MAF-MMDiT

espnet/fastspeech2_conformer_with_hifigan

Beehzod/speechT5_tts_uzbek

facebook/musicgen-melody-large

Lingalingeswaran/facebook_mms_tamil

Marvis-AI/marvis-tts-250m-v0.2-MLX-8bit

eustlb/higgs-v2-archive

facebook/audio-magnet-medium

atul10/nepali_male_v1

tencent/HunyuanVideo-Foley

bcruz/MIDI-LLM_Llama-3.2-1B-Q4_K_M-GGUF

ford442/stable-audio-open-1.0

ychenqz/emotion_classifier

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-MAT-l1-pure

CypressYang/SongBloom_long

mradermacher/CiSiMi-GGUF

suhaibrashid17/MMS_TTS_Urdu_3

froabera/speecht5_finetuned

Urabewe/Ace-Step-Captioner-fp8

Marvis-AI/marvis-tts-100m-v0.2-MLX-6bit

sil-ai/senga-nt-asr-inferred-force-aligned-speecht5-MAT-l1blend-0.7