Results for "audio-to-audio"
71 matches found.
nvidia/bigvgan_v2_22khz_80band_256x
No description available.
nvidia/bigvgan_v2_44khz_128band_512x
No description available.
nvidia/personaplex-7b-v1
No description available.
Qwen/Qwen3-TTS-Tokenizer-12Hz
No description available.
neuphonic/neucodec
NeuCodec is a Finite Scalar Quantisation (FSQ) based 0.8kbps audio codec for speech tokenization. It takes advantage of the following featur...
Aratako/MioCodec-25Hz-24kHz
No description available.
JacobLinCool/MP-SENet-DNS
No description available.
neuphonic/distill-neucodec
Distill-NeuCodec is a version of NeuCodec with a compatible, distilled encoder. The distilled encoder is 10x smaller in parameter count and ...
HKUSTAudio/xcodec2
No description available.
speechbrain/metricgan-plus-voicebank
No description available.
neuphonic/neucodec-onnx-decoder
No description available.
nvidia/bigvgan_v2_24khz_100band_256x
No description available.
NandemoGHS/Anime-XCodec2
No description available.
JorisCos/DCCRNet_Libri1Mix_enhsingle_16k
No description available.
Aratako/MioCodec-25Hz-44.1kHz-v2
No description available.
neuphonic/neucodec-onnx-decoder-int8
No description available.
JorisCos/ConvTasNet_Libri2Mix_sepnoisy_16k
No description available.
speechbrain/sepformer-wsj02mix
- Source Separation - Speech Separation - Audio Source Separation - WSJ02Mix - SepFormer - Transformer - audio-to-audio - audio-source-separ...
mpariente/DPRNNTasNet-ks2_WHAM_sepclean
No description available.
NandemoGHS/Anime-XCodec2-44.1kHz-v2
No description available.
hf-audio/xcodec2
No description available.
nvidia/bigvgan_22khz_80band
No description available.
nvidia/bigvgan_base_22khz_80band
No description available.
nvidia/bigvgan_v2_22khz_80band_fmax8k_256x
No description available.
microsoft/speecht5_vc
Motivated by the success of T5 (Text-To-Text Transfer Transformer) in pre-trained natural language processing models, we propose a unified-m...
JorisCos/ConvTasNet_Libri2Mix_sepclean_16k
No description available.
JorisCos/ConvTasNet_Libri2Mix_sepclean_8k
No description available.
Mungert/LFM2.5-Audio-1.5B-GGUF
No description available.
LiquidAI/LFM2.5-Audio-1.5B
No description available.
aufklarer/PersonaPlex-7B-MLX-4bit
| Component | Architecture | Size | |---| | Temporal Transformer | 32-layer, 4096d, 32 heads (7B params) | ~3.5 GB (4-bit) | | Depformer | 6...
kyutai/hibiki-zero-3b-pytorch-bf16
This is the model simply referred to as Hibiki-Zero in our [paper][paper], a 3B-parameter hierarchical Transformer producing speech and text...
lucadellalib/focalcodec_50hz
No description available.
llm-jp/Llama-Mimi-1.3B
No description available.
speechbrain/sepformer-whamr16k
- audio-to-audio - audio-source-separation - Source Separation - Speech Separation - WHAM! - SepFormer - Transformer - pytorch - speechbrain...
YatharthS/LavaSR
No description available.
JusperLee/TIGER-speech
No description available.
julien-c/DPRNNTasNet-ks16_WHAM_sepclean
No description available.
speechbrain/mtl-mimic-voicebank
No description available.
JusperLee/Dolphin
Dolphin is an efficient audio-visual speech separation model that extracts target speech from noisy environments by combining acoustic and v...
nvidia/bigvgan_v2_44khz_128band_256x
No description available.
ktvoice/Codec
No description available.
JusperLee/TIGER-DnR
No description available.
lucadellalib/focalcodec_50hz_2k_causal
No description available.
lucadellalib/focalcodec_50hz_4k_causal
No description available.
YatharthS/NovaSR
No description available.
chenmozhijin/BSRoformer-GGUF
Official GGUF model repository for the BSRoformer.cpp project. This repository contains BS Roformer/Mel-Band-Roformer models converted to th...
JorisCos/ConvTasNet_Libri3Mix_sepclean_8k
No description available.
mlx-community/LFM2.5-Audio-1.5B-bf16
No description available.
speechbrain/sepformer-libri2mix
- Source Separation - Speech Separation - Audio Source Separation - Libri2Mix - SepFormer - Transformer - audio-to-audio - audio-source-sepa...
JorisCos/DCUNet_Libri1Mix_enhsingle_16k
No description available.
lucadellalib/dycast
No description available.
JorisCos/ConvTasNet_Libri3Mix_sepnoisy_8k
No description available.
Ceva-IP/DPDFNet
No description available.
nvidia/bigvgan_24khz_100band
No description available.
Aratako/MioCodec-25Hz-44.1kHz
No description available.
speechbrain/sepformer-wham16k-enhancement
- audio-to-audio - Speech Enhancement - WHAM! - SepFormer - Transformer - pytorch - speechbrain - WHAM - SI-SNR - PESQ...
maitrix-org/Voila-Tokenizer
No description available.
maitrix-org/Voila-chat
No description available.
mispeech/dashengtokenizer
No description available.
speechbrain/sepformer-whamr
- speechbrain - Source Separation - Speech Separation - Audio Source Separation - WHAM! - SepFormer - Transformer - audio-to-audio - audio-s...
speechbrain/sepformer-dns4-16k-enhancement
No description available.
speechbrain/sepformer-wham
- audio-to-audio - audio-source-separation - Source Separation - Speech Separation - Audio Source Separation - WHAM! - SepFormer - Transform...
mpariente/ConvTasNet_WHAM_sepclean
No description available.
NandemoGHS/Anime-XCodec2-44.1kHz
No description available.
HiDolen/Mini-BS-RoFormer-V2-46.8M
ๆจกๅๆปๅๆฐ้ 46.8M๏ผๆ้็ฒพๅบฆ BF16ใ ๅจ MUSDB18HQ ๆฐๆฎ็ val ้ไธ็ๆง่ฝ๏ผๅไฝ SDR๏ผ่ถ้ซ่ถๅฅฝ๏ผ๏ผ | tracks | Mini-BS-RoFormer-V2-46.8M | Mini-BS-RoFormer-18M | Mini-BS-RoForm...
LiquidAI/LFM2-Audio-1.5B
No description available.
patriotyk/vocos-mel-hifigan-compat-44100khz
Vocos is a fast neural vocoder designed to synthesize audio waveforms from acoustic features. Unlike other typical GAN-based vocoders, Vocos...
speechbrain/sepformer-whamr-enhancement
- audio-to-audio - Speech Enhancement - WHAMR! - SepFormer - Transformer - pytorch - speechbrain - SI-SNR - PESQ...
lucadellalib/focalcodec_12_5hz
No description available.
speechbrain/sepformer-wham-enhancement
- audio-to-audio - Speech Enhancement - WHAM! - SepFormer - Transformer - pytorch - speechbrain - SI-SNR - PESQ...
wyz/tfgridnet_for_urgent24
No description available.