Results for "audio-to-audio"

71 matches found.

nvidia

nvidia/bigvgan_v2_22khz_80band_256x

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,445,270
nvidia

nvidia/bigvgan_v2_44khz_128band_512x

No description available.

๐ŸŽ›๏ธ audio-to-audio 702,547
nvidia

nvidia/personaplex-7b-v1

No description available.

๐ŸŽ›๏ธ audio-to-audio 510,115
Qwen

Qwen/Qwen3-TTS-Tokenizer-12Hz

No description available.

๐ŸŽ›๏ธ audio-to-audio 135,104
neuphonic

neuphonic/neucodec

NeuCodec is a Finite Scalar Quantisation (FSQ) based 0.8kbps audio codec for speech tokenization. It takes advantage of the following featur...

๐ŸŽ›๏ธ audio-to-audio 107,499
Aratako

Aratako/MioCodec-25Hz-24kHz

No description available.

๐ŸŽ›๏ธ audio-to-audio 95,354
JacobLinCool

JacobLinCool/MP-SENet-DNS

No description available.

๐ŸŽ›๏ธ audio-to-audio 47,338
neuphonic

neuphonic/distill-neucodec

Distill-NeuCodec is a version of NeuCodec with a compatible, distilled encoder. The distilled encoder is 10x smaller in parameter count and ...

๐ŸŽ›๏ธ audio-to-audio 45,492
HKUSTAudio

HKUSTAudio/xcodec2

No description available.

๐ŸŽ›๏ธ audio-to-audio 27,677
speechbrain

speechbrain/metricgan-plus-voicebank

No description available.

๐ŸŽ›๏ธ audio-to-audio 25,278
neuphonic

neuphonic/neucodec-onnx-decoder

No description available.

๐ŸŽ›๏ธ audio-to-audio 20,412
nvidia

nvidia/bigvgan_v2_24khz_100band_256x

No description available.

๐ŸŽ›๏ธ audio-to-audio 7,565
NandemoGHS

NandemoGHS/Anime-XCodec2

No description available.

๐ŸŽ›๏ธ audio-to-audio 7,259
JorisCos

JorisCos/DCCRNet_Libri1Mix_enhsingle_16k

No description available.

๐ŸŽ›๏ธ audio-to-audio 5,522
Aratako

Aratako/MioCodec-25Hz-44.1kHz-v2

No description available.

๐ŸŽ›๏ธ audio-to-audio 4,926
neuphonic

neuphonic/neucodec-onnx-decoder-int8

No description available.

๐ŸŽ›๏ธ audio-to-audio 4,092
JorisCos

JorisCos/ConvTasNet_Libri2Mix_sepnoisy_16k

No description available.

๐ŸŽ›๏ธ audio-to-audio 3,763
speechbrain

speechbrain/sepformer-wsj02mix

- Source Separation - Speech Separation - Audio Source Separation - WSJ02Mix - SepFormer - Transformer - audio-to-audio - audio-source-separ...

๐ŸŽ›๏ธ audio-to-audio 2,945
mpariente

mpariente/DPRNNTasNet-ks2_WHAM_sepclean

No description available.

๐ŸŽ›๏ธ audio-to-audio 2,921
NandemoGHS

NandemoGHS/Anime-XCodec2-44.1kHz-v2

No description available.

๐ŸŽ›๏ธ audio-to-audio 2,191
hf-audio

hf-audio/xcodec2

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,993
nvidia

nvidia/bigvgan_22khz_80band

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,738
nvidia

nvidia/bigvgan_base_22khz_80band

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,596
nvidia

nvidia/bigvgan_v2_22khz_80band_fmax8k_256x

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,577
microsoft

microsoft/speecht5_vc

Motivated by the success of T5 (Text-To-Text Transfer Transformer) in pre-trained natural language processing models, we propose a unified-m...

๐ŸŽ›๏ธ audio-to-audio 1,535
JorisCos

JorisCos/ConvTasNet_Libri2Mix_sepclean_16k

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,513
JorisCos

JorisCos/ConvTasNet_Libri2Mix_sepclean_8k

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,504
Mungert

Mungert/LFM2.5-Audio-1.5B-GGUF

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,282
LiquidAI

LiquidAI/LFM2.5-Audio-1.5B

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,269
aufklarer

aufklarer/PersonaPlex-7B-MLX-4bit

| Component | Architecture | Size | |---| | Temporal Transformer | 32-layer, 4096d, 32 heads (7B params) | ~3.5 GB (4-bit) | | Depformer | 6...

๐ŸŽ›๏ธ audio-to-audio 1,228
kyutai

kyutai/hibiki-zero-3b-pytorch-bf16

This is the model simply referred to as Hibiki-Zero in our [paper][paper], a 3B-parameter hierarchical Transformer producing speech and text...

๐ŸŽ›๏ธ audio-to-audio 1,219
lucadellalib

lucadellalib/focalcodec_50hz

No description available.

๐ŸŽ›๏ธ audio-to-audio 1,210
llm-jp

llm-jp/Llama-Mimi-1.3B

No description available.

๐ŸŽ›๏ธ audio-to-audio 882
speechbrain

speechbrain/sepformer-whamr16k

- audio-to-audio - audio-source-separation - Source Separation - Speech Separation - WHAM! - SepFormer - Transformer - pytorch - speechbrain...

๐ŸŽ›๏ธ audio-to-audio 851
YatharthS

YatharthS/LavaSR

No description available.

๐ŸŽ›๏ธ audio-to-audio 833
JusperLee

JusperLee/TIGER-speech

No description available.

๐ŸŽ›๏ธ audio-to-audio 726
julien-c

julien-c/DPRNNTasNet-ks16_WHAM_sepclean

No description available.

๐ŸŽ›๏ธ audio-to-audio 621
speechbrain

speechbrain/mtl-mimic-voicebank

No description available.

๐ŸŽ›๏ธ audio-to-audio 443
JusperLee

JusperLee/Dolphin

Dolphin is an efficient audio-visual speech separation model that extracts target speech from noisy environments by combining acoustic and v...

๐ŸŽ›๏ธ audio-to-audio 439
nvidia

nvidia/bigvgan_v2_44khz_128band_256x

No description available.

๐ŸŽ›๏ธ audio-to-audio 424
ktvoice

ktvoice/Codec

No description available.

๐ŸŽ›๏ธ audio-to-audio 416
JusperLee

JusperLee/TIGER-DnR

No description available.

๐ŸŽ›๏ธ audio-to-audio 403
lucadellalib

lucadellalib/focalcodec_50hz_2k_causal

No description available.

๐ŸŽ›๏ธ audio-to-audio 375
lucadellalib

lucadellalib/focalcodec_50hz_4k_causal

No description available.

๐ŸŽ›๏ธ audio-to-audio 371
YatharthS

YatharthS/NovaSR

No description available.

๐ŸŽ›๏ธ audio-to-audio 363
chenmozhijin

chenmozhijin/BSRoformer-GGUF

Official GGUF model repository for the BSRoformer.cpp project. This repository contains BS Roformer/Mel-Band-Roformer models converted to th...

๐ŸŽ›๏ธ audio-to-audio 356
JorisCos

JorisCos/ConvTasNet_Libri3Mix_sepclean_8k

No description available.

๐ŸŽ›๏ธ audio-to-audio 328
mlx-community

mlx-community/LFM2.5-Audio-1.5B-bf16

No description available.

๐ŸŽ›๏ธ audio-to-audio 314
speechbrain

speechbrain/sepformer-libri2mix

- Source Separation - Speech Separation - Audio Source Separation - Libri2Mix - SepFormer - Transformer - audio-to-audio - audio-source-sepa...

๐ŸŽ›๏ธ audio-to-audio 309
JorisCos

JorisCos/DCUNet_Libri1Mix_enhsingle_16k

No description available.

๐ŸŽ›๏ธ audio-to-audio 307
lucadellalib

lucadellalib/dycast

No description available.

๐ŸŽ›๏ธ audio-to-audio 306
JorisCos

JorisCos/ConvTasNet_Libri3Mix_sepnoisy_8k

No description available.

๐ŸŽ›๏ธ audio-to-audio 305
Ceva-IP

Ceva-IP/DPDFNet

No description available.

๐ŸŽ›๏ธ audio-to-audio 290
nvidia

nvidia/bigvgan_24khz_100band

No description available.

๐ŸŽ›๏ธ audio-to-audio 276
Aratako

Aratako/MioCodec-25Hz-44.1kHz

No description available.

๐ŸŽ›๏ธ audio-to-audio 252
speechbrain

speechbrain/sepformer-wham16k-enhancement

- audio-to-audio - Speech Enhancement - WHAM! - SepFormer - Transformer - pytorch - speechbrain - WHAM - SI-SNR - PESQ...

๐ŸŽ›๏ธ audio-to-audio 251
maitrix-org

maitrix-org/Voila-Tokenizer

No description available.

๐ŸŽ›๏ธ audio-to-audio 233
maitrix-org

maitrix-org/Voila-chat

No description available.

๐ŸŽ›๏ธ audio-to-audio 233
mispeech

mispeech/dashengtokenizer

No description available.

๐ŸŽ›๏ธ audio-to-audio 229
speechbrain

speechbrain/sepformer-whamr

- speechbrain - Source Separation - Speech Separation - Audio Source Separation - WHAM! - SepFormer - Transformer - audio-to-audio - audio-s...

๐ŸŽ›๏ธ audio-to-audio 227
speechbrain

speechbrain/sepformer-dns4-16k-enhancement

No description available.

๐ŸŽ›๏ธ audio-to-audio 215
speechbrain

speechbrain/sepformer-wham

- audio-to-audio - audio-source-separation - Source Separation - Speech Separation - Audio Source Separation - WHAM! - SepFormer - Transform...

๐ŸŽ›๏ธ audio-to-audio 209
mpariente

mpariente/ConvTasNet_WHAM_sepclean

No description available.

๐ŸŽ›๏ธ audio-to-audio 196
NandemoGHS

NandemoGHS/Anime-XCodec2-44.1kHz

No description available.

๐ŸŽ›๏ธ audio-to-audio 191
HiDolen

HiDolen/Mini-BS-RoFormer-V2-46.8M

ๆจกๅž‹ๆ€ปๅ‚ๆ•ฐ้‡ 46.8M๏ผŒๆƒ้‡็ฒพๅบฆ BF16ใ€‚ ๅœจ MUSDB18HQ ๆ•ฐๆฎ็š„ val ้›†ไธŠ็š„ๆ€ง่ƒฝ๏ผˆๅ•ไฝ SDR๏ผŒ่ถŠ้ซ˜่ถŠๅฅฝ๏ผ‰๏ผš | tracks | Mini-BS-RoFormer-V2-46.8M | Mini-BS-RoFormer-18M | Mini-BS-RoForm...

๐ŸŽ›๏ธ audio-to-audio 184
LiquidAI

LiquidAI/LFM2-Audio-1.5B

No description available.

๐ŸŽ›๏ธ audio-to-audio 182
patriotyk

patriotyk/vocos-mel-hifigan-compat-44100khz

Vocos is a fast neural vocoder designed to synthesize audio waveforms from acoustic features. Unlike other typical GAN-based vocoders, Vocos...

๐ŸŽ›๏ธ audio-to-audio 182
speechbrain

speechbrain/sepformer-whamr-enhancement

- audio-to-audio - Speech Enhancement - WHAMR! - SepFormer - Transformer - pytorch - speechbrain - SI-SNR - PESQ...

๐ŸŽ›๏ธ audio-to-audio 180
lucadellalib

lucadellalib/focalcodec_12_5hz

No description available.

๐ŸŽ›๏ธ audio-to-audio 174
speechbrain

speechbrain/sepformer-wham-enhancement

- audio-to-audio - Speech Enhancement - WHAM! - SepFormer - Transformer - pytorch - speechbrain - SI-SNR - PESQ...

๐ŸŽ›๏ธ audio-to-audio 168
wyz

wyz/tfgridnet_for_urgent24

No description available.

๐ŸŽ›๏ธ audio-to-audio 137