Discover the Best AI Models
Search, analyze, and download from our global directory of 3,000+ open-source models.
Model Index 55939 Total
OpenMed/OpenMed-NER-GenomicDetect-PubMed-109M
No description available.
TimKond/S-PubMedBert-MedQuAD
No description available.
google-bert/bert-large-cased
BERT is a transformers model pretrained on a large corpus of English data in a self-supervised fashion. This means it was pretrained on the ...
timm/ViT-B-16-SigLIP2-256
A SigLIP 2 Vision-Lanuage model trained on WebLI. This model has been converted for use in OpenCLIP from the original JAX checkpoints in Big...
facebook/mms-tts-kor
VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an end-to-end speech synthesis model that predicts a...
OpenMed/OpenMed-NER-GenomeDetect-ModernMed-149M
No description available.
M-FAC/bert-mini-finetuned-mnli
This model is finetuned on MNLI dataset with state-of-the-art second-order optimizer M-FAC. Check NeurIPS 2021 paper for more details on M-F...
sentence-transformers/msmarco-distilbert-base-v4
No description available.
kakaocorp/kanana-1.5-v-3b-instruct
Developed by: Unified Foundation Model (UFO) TF at Kakao - Language(s) : ['en', 'ko'] - Model Architecture: kanana-1.5-v-3b-instruct has 3.6...
timm/convnextv2_base.fcmae_ft_in22k_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 88.7 - GMACs: 15.4 - Activations (M): 28.8 - Image size: tr...
medieval-data/qwen2-vl-2b-catmus
```python import torch from transformers import Qwen2VLForConditionalGeneration, AutoTokenizer, AutoProcessor from qwenvlutils import proces...
MaziyarPanahi/Qwen3-14B-GGUF
- Model creator: Qwen - Original model: Qwen/Qwen3-14B...
MaziyarPanahi/Qwen3-4B-GGUF
- Model creator: Qwen - Original model: Qwen/Qwen3-4B...
philschmid/bart-large-cnn-samsum
- sagemaker - bart - summarization - samsum - text: "Jeff: Can I train a \U0001F917 Transformers model on Amazon SageMaker? \n\ Philipp: Sur...
cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit
No description available.
nvidia/NVIDIA-Nemotron-Nano-9B-v2-Base
No description available.
unsloth/gemma-3-27b-it-GGUF
See our collection for all versions of Gemma 3 including GGUF, 4-bit & 16-bit formats. Read our Guide to see how to Run Gemma 3 correctly....
nvidia/segformer-b1-finetuned-ade-512-512
SegFormer consists of a hierarchical Transformer encoder and a lightweight all-MLP decode head to achieve great results on semantic segmenta...
Helsinki-NLP/opus-mt-en-es
No description available.
MaziyarPanahi/Qwen3-0.6B-GGUF
- Model creator: Qwen - Original model: Qwen/Qwen3-0.6B...
microsoft/trocr-large-handwritten
The TrOCR model is an encoder-decoder model, consisting of an image Transformer as encoder, and a text Transformer as decoder. The image enc...
valhalla/distilbart-mnli-12-3
No description available.
HuggingFaceM4/idefics2-8b
No description available.
microsoft/deberta-v2-xlarge
No description available.
trl-internal-testing/tiny-DeepseekV3ForCausalLM
No description available.
trl-internal-testing/tiny-Qwen3MoeForCausalLM
No description available.
siebert/sentiment-roberta-large-english
No description available.
nlpai-lab/KURE-v1
This is the model card of a 🤗 transformers model that has been pushed on the Hub. - Developed by: NLP&AI Lab - Language(s) (NLP): Korean, En...
meta-llama/Llama-Guard-4-12B
Llama Guard 4 is a natively multimodal safety classifier with 12 billion parameters trained jointly on text and multiple images. Llama Guard...
dunzhang/stella-mrl-large-zh-v3.5-1792d
pipelinetag: sentence-similarity - sentence-transformers - feature-extraction - sentence-similarity - mteb - name: stella-mrl-large-zh-v3.5-...
trl-internal-testing/tiny-MistralForCausalLM-0.2
No description available.
state-spaces/mamba-130m-hf
No description available.
facebook/audiobox-aesthetics
No description available.
cagliostrolab/animagine-xl-3.0
Developed by: Cagliostro Research Lab - Model type: Diffusion-based text-to-image generative model - Model Description: Animagine XL 3.0 is ...
HuggingFaceM4/Idefics3-8B-Llama3
No description available.
MaziyarPanahi/Qwen3-1.7B-GGUF
- Model creator: Qwen - Original model: Qwen/Qwen3-1.7B...
ai4bharat/IndicBART
No description available.
5CD-AI/Vietnamese-Sentiment-visobert
No description available.
ibm-research/materials.selfies-ted
No description available.
MaziyarPanahi/Qwen3-8B-GGUF
- Model creator: Qwen - Original model: Qwen/Qwen3-8B...
OpenMed/OpenMed-NER-GenomicDetect-BigMed-560M
No description available.
unslothai/vram-40
No description available.
Arunavaonly/Bangla-twoclass-Sentiment-Analyzer
More information needed...
yainage90/fashion-object-detection
This model is fine-tuned version of microsoft/conditional-detr-resnet-50....
google/mt5-small
- multilingual - af - am - ar - az - be - bg - bn - ca - ceb - co - cs - cy - da - de - el - en - eo - es - et - eu - fa - fi - fil - fr - f...
microsoft/phi-1_5
No description available.
unsloth/mistral-7b-bnb-4bit
No description available.
OpenMed/OpenMed-NER-ChemicalDetect-MultiMed-568M
No description available.
jason9693/Qwen2.5-1.5B-apeach
No description available.
Salesforce/codegen-350M-mono
CodeGen is a family of autoregressive language models for program synthesis from the paper: A Conversational Paradigm for Program Synthesis ...