Discover the Best AI Models
Search, analyze, and download from our global directory of 3,000+ open-source models.
Model Index 55939 Total
SWivid/E2-TTS
No description available.
vinai/bartpho-syllable
No description available.
NX-AI/TiRex
No description available.
google/reformer-crime-and-punishment
Crime and Punishment is a novel written by Fyodor Dostoevsky and was translated into English....
google/gemma-7b
No description available.
kuleshov-group/mdlm-owt
The model, which has a context length of `1024` and is similar in size to GPT2-medium with approximately `130 million` non-embedding paramet...
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int8
No description available.
optimum-intel-internal-testing/stable-diffusion-3-tiny-random
No description available.
google/vit-base-patch16-384
The Vision Transformer (ViT) is a transformer encoder model (BERT-like) pretrained on a large collection of images in a supervised fashion, ...
mlx-community/Llama-3.2-1B-Instruct-4bit
- en - de - fr - it - pt - hi - es - th libraryname: transformers pipelinetag: text-generation - facebook - meta - pytorch - llama - llama-3...
weiweishi/roc-bert-base-zh
No description available.
flaubert/flaubert_base_cased
No description available.
google/rembert
RemBERT's main difference with mBERT is that the input and output embeddings are not tied. Instead, RemBERT uses small input embeddings and ...
jeffcookio/granite-3.3-8b-instruct-gptqmodel-4b-64g
No description available.
huggyllama/llama-7b
No description available.
stas/tiny-wmt19-en-de
No description available.
uclanlp/plbart-base
No description available....
microsoft/speecht5_tts
Motivated by the success of T5 (Text-To-Text Transfer Transformer) in pre-trained natural language processing models, we propose a unified-m...
OpenMed/OpenMed-NER-PharmaDetect-SuperMedical-125M
No description available.
typeform/distilbert-base-uncased-mnli
Model Description: This is the uncased DistilBERT model fine-tuned on Multi-Genre Natural Language Inference (MNLI) dataset for the zero-sho...
HuggingFaceH4/zephyr-7b-beta
Model type: A 7B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets. - Language(s) (NLP): Primarily Engl...
studio-ousia/luke-base
No description available.
OpenMed/OpenMed-NER-ProteinDetect-SnowMed-568M
No description available.
microsoft/prophetnet-large-uncased
No description available.
OpenMed/OpenMed-NER-GenomeDetect-PubMed-109M
No description available.
google/bert_for_seq_generation_L-24_bbc_encoder
No description available....
EleutherAI/pythia-6.9b-deduped
Developed by: EleutherAI - Model type: Transformer-based Language Model - Language: English - Learn more: Pythia's GitHub repository for tra...
nphSi/Z-Image-Lora
+ Always use full LoRa name with "vrtlxxxx" trigger in prompt like "Alba Baptista (vrtlalbabaptista) in a swimming pool". "Woman" or "1girl"...
Qwen/Qwen2-1.5B
Qwen2 is a language model series including decoder language models of different model sizes. For each size, we release the base language mod...
nm-testing/Llama3_2_1B_speculator.eagle3
No description available....
Sehyo/Qwen3.5-122B-A10B-NVFP4
No description available.
warshanks/Jan-nano-AWQ
Note: Jan-Nano is a non-thinking model....
unsloth/MiniMax-M2.5-GGUF
To Run MiniMax-M2.5 locally - Read our Guide! Unsloth Dynamic 2.0 achieves superior accuracy & outperforms other leading quants....
Rostlab/prot_t5_xl_uniref50
ProtT5-XL-UniRef50 is based on the `t5-3b` model and was pretrained on a large corpus of protein sequences in a self-supervised fashion. Thi...
bosonai/hubert_base
No description available.
MaziyarPanahi/DeepSeek-R1-0528-Qwen3-8B-GGUF
- Model creator: deepseek-ai - Original model: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B...
facebook/dinov3-vith16plus-pretrain-lvd1689m
These are Vision Transformer and ConvNeXt models trained following the method described in the DINOv3 paper. 12 models are provided: - 10 mo...
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
- Model creator: mistralai - Original model: mistralai/Mistral-7B-Instruct-v0.3...
lmstudio-community/MiniMax-M2.5-MLX-4bit
No description available.
Qwen/Qwen3-30B-A3B-FP8
This repo contains the FP8 version of Qwen3-30B-A3B, which has the following features: - Type: Causal Language Models - Training Stage: Pret...
facebook/blenderbot_small-90M
+ Paper: Recipes for building an open-domain chatbot + Original PARLAI Code...
alibaba-damo/mgp-str-base
MGP-STR is pure vision STR model, consisting of ViT and specially designed A^3 modules. The ViT module was initialized from the weights of D...
unsloth/embeddinggemma-300m
No description available.
junnyu/roformer_chinese_small
https://github.com/ZhuiyiTechnology/roformer...
ales/wav2vec2-cv-be
No description available.
susnato/clvp_dev
DISCLAIMER : I do not own any weights present in this repository. All weights belong to the author of the paper - "Better speech synthesis t...
nvidia/mit-b0
SegFormer consists of a hierarchical Transformer encoder and a lightweight all-MLP decode head to achieve great results on semantic segmenta...
microsoft/speecht5_asr
Motivated by the success of T5 (Text-To-Text Transfer Transformer) in pre-trained natural language processing models, we propose a unified-m...
deepseek-community/Janus-Pro-1B
No description available.
facebook/mbart-large-en-ro
- translation...