Results for "text-to-speech"
100 matches found.
hexgrad/Kokoro-82M
Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to large...
coqui/XTTS-v2
No description available.
ResembleAI/chatterbox
No description available.
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
No description available.
SWivid/F5-TTS
No description available.
microsoft/VibeVoice-Realtime-0.5B
No description available.
Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign
No description available.
parler-tts/parler-tts-mini-multilingual-v1.1
No description available.
ai4bharat/indic-parler-tts
No description available.
bosonai/higgs-audio-v2-generation-3B-base
No description available.
Qwen/Qwen3-TTS-12Hz-0.6B-CustomVoice
No description available.
Qwen/Qwen3-TTS-12Hz-0.6B-Base
No description available.
kenpath/svara-tts-v1
No description available.
myshell-ai/MeloTTS-English
No description available.
microsoft/VibeVoice-1.5B
No description available.
myshell-ai/MeloTTS-Japanese
No description available.
hypaai/Hypa_Orpheus-3b-0.1-ft-unsloth-merged_16bit
No description available.
facebook/mms-tts-hat
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
facebook/mms-tts-kor
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
sesame/csm-1b
No description available.
kyutai/tts-0.75b-en-public
The model architecture is a hierarchical Transformer that consumes tokenized text and generateds audio tokenized by Mimi, see the Moshi pape...
facebook/mms-tts-eng
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
myshell-ai/MeloTTS-Chinese
No description available.
canopylabs/3b-de-ft-research_release
No description available.
SWivid/E2-TTS
No description available.
microsoft/speecht5_tts
Motivated by the success of T5 (Text-To-Text Transfer Transformer) in pre-trained natural language processing models, we propose a unified-m...
maya-research/maya1
No description available.
Aratako/MioTTS-2.6B
No description available.
kugelaudio/kugelaudio-0-open
No description available.
OpenMOSS-Team/MOSS-TTS
No description available.
facebook/hf-seamless-m4t-medium
No description available.
nari-labs/Dia-1.6B
No description available.
onnx-community/Kokoro-82M-v1.0-ONNX
No description available.
mahwizzzz/orpheus-urdu-tts
The Orpheus Urdu TTS model is a fine-tuned version of the Orpheus 3B text-to-speech model specifically adapted for Urdu language. This exper...
neuphonic/neutts-air-q4-gguf
NeuTTS Air is built off Qwen 0.5B - a lightweight yet capable language model optimised for text understanding and generation - as well as a ...
myshell-ai/MeloTTS-Spanish
No description available.
suno/bark
The following is additional information about the models released here. Bark is a series of three transformer models that turn text into aud...
kyutai/tts-1.6b-en_fr
The model architecture is a hierarchical Transformer that consumes tokenized text and generateds audio tokenized by Mimi, see the Moshi pape...
Nextcloud-AI/Kokoro-82M
Kokoro models duplicated as-is from https://huggingface.co/hexgrad/Kokoro-82M for usage in local text to speech app....
OpenMOSS-Team/MOSS-TTS-Local-Transformer
No description available.
OpenMOSS-Team/MOSS-TTS-Realtime
No description available.
canopylabs/orpheus-3b-0.1-pretrained
No description available.
onnx-community/Kokoro-82M-ONNX
No description available.
myshell-ai/MeloTTS-Korean
No description available.
speechbrain/tts-hifigan-libritts-22050Hz
No description available.
suno/bark-small
The following is additional information about the models released here. Bark is a series of three transformer models that turn text into aud...
facebook/mms-tts-por
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
unsloth/orpheus-3b-0.1-ft
No description available.
myshell-ai/MeloTTS-French
No description available.
canopylabs/orpheus-3b-0.1-ft
No description available.
nari-labs/Dia-1.6B-0626
No description available.
OpenMOSS-Team/MOSS-TTSD-v1.0
No description available.
OuteAI/Llama-OuteTTS-1.0-1B
Oute A I outeai.com ......
pnnbao-ump/VieNeu-TTS-0.3B-q4-gguf
No description available.
Misha24-10/F5-TTS_RUSSIAN
No description available.
mlx-community/Qwen3-TTS-12Hz-0.6B-Base-8bit
No description available.
mlx-community/Kokoro-82M-bf16
No description available.
IndexTeam/IndexTTS-2
No description available.
hexgrad/Kokoro-82M-v1.1-zh
🐈 GitHub: https://github.com/hexgrad/kokoro...
inclusionAI/Ming-omni-tts-0.5B
No description available.
mlx-community/Qwen3-TTS-12Hz-0.6B-CustomVoice-8bit
No description available.
ekwek/Soprano-1.1-80M
No description available.
pnnbao-ump/VieNeu-TTS-q4-gguf
No description available.
maya-research/Veena
Veena is a 3B parameter autoregressive transformer model based on the Llama architecture. It is designed to synthesize high-quality speech f...
pnnbao-ump/VieNeu-TTS-0.3B
No description available.
marksverdhai/vibevoice-7b-bnb-4bit
| Property | Value | |----| | Base Model | vibevoice/VibeVoice-7B | | Quantization | bitsandbytes NF4 (4-bit) | | VRAM Usage | ~6.2 GB | | M...
Zyphra/Zonos-v0.1-transformer
No description available.
HKUSTAudio/Llasa-1B
No description available.
nari-labs/Dia2-2B
No description available.
Xenova/speecht5_tts
No description available.
parler-tts/parler-tts-large-v1
No description available.
neuphonic/neutts-air
NeuTTS Air is built off Qwen 0.5B - a lightweight yet capable language model optimised for text understanding and generation - as well as a ...
mlx-community/Qwen3-TTS-12Hz-1.7B-Base-bf16
No description available.
OpenMOSS-Team/MOSS-VoiceGenerator
No description available.
parler-tts/parler-tts-mini-v1
No description available.
pnnbao-ump/VieNeu-TTS
No description available.
vibevoice/VibeVoice-1.5B
No description available.
akh99/veena-hinglish
Veena Hinglish is a LoRA fine-tuned text-to-speech model optimized for generating natural-sounding speech in Hinglish. The model uses SNAC (...
unsloth/csm-1b
See our collection for all our TTS model uploads. Learn to fine-tune TTS models - Read our Guide. Unsloth Dynamic 2.0 achieves superior accu...
vibevoice/VibeVoice-7B
No description available.
canopylabs/3b-ko-ft-research_release
No description available.
unsloth/orpheus-3b-0.1-ft-unsloth-bnb-4bit
No description available.
facebook/mms-tts-tgl
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
ai4bharat/IndicF5
No description available.
Supertone/supertonic
No description available.
aoi-ot/VibeVoice-Large
No description available.
neuphonic/neutts-nano-q4-gguf
NeuTTS Nano models are designed for maximum speed per parameter while retaining strong speaker similarity and naturalness: - Backbone: compa...
facebook/mms-tts-hin
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
saheedniyi/YarnGPT2
Developed by: Saheedniyi - Model type: Text-to-Speech - Language(s) (NLP): English--> Nigerian Accented English - Finetuned from: HuggingFac...
Supertone/supertonic-2
No description available.
openbmb/VoxCPM1.5
No description available.
neuphonic/neutts-nano
NeuTTS Nano models are designed for maximum speed per parameter while retaining strong speaker similarity and naturalness: - Backbone: compa...
OuteAI/Llama-OuteTTS-1.0-1B-GGUF
Oute A I outeai.com ......
FunAudioLLM/Fun-CosyVoice3-0.5B-2512
No description available.
fishaudio/s1-mini
No description available.
cocktailpeanut/oa
No description available.
cartesia/sesame-csm-1b-gguf
No description available.
facebook/mms-tts-orm
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
facebook/mms-tts-kir
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
fishaudio/fish-speech-1.5
No description available.