Discover the Best AI Models

Search, analyze, and download from our global directory of 3,000+ open-source models.

Model Index 55939 Total

microsoft

microsoft/table-transformer-detection

The Table Transformer is equivalent to DETR, a Transformer-based object detection model. Note that the authors decided to use the "normalize...

πŸ‘οΈ object-detection 3,423,345
facebook

facebook/w2v-bert-2.0

No description available.

πŸ” feature-extraction 3,423,051
ETH-CVG

ETH-CVG/lightglue_superpoint

No description available.

πŸ“ keypoint-detection 3,372,964
jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

- commonvoice - mozilla-foundation/commonvoice60 - wer - cer - audio - automatic-speech-recognition - hf-asr-leaderboard - mozilla-foundatio...

πŸŽ™οΈ automatic-speech-recognition 3,370,633
magic-leap-community

magic-leap-community/superpoint

No description available.

πŸ“ keypoint-detection 3,321,899
speechbrain

speechbrain/spkrec-resnet-voxceleb

No description available.

✨ General AI Model 3,305,084
llava-hf

llava-hf/llava-1.5-7b-hf

Model type: LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data. It i...

πŸ“– image-text-to-text 3,202,279
google-bert

google-bert/bert-base-cased

BERT is a transformers model pretrained on a large corpus of English data in a self-supervised fashion. This means it was pretrained on the ...

🎭 fill-mask 3,169,807
nomic-ai

nomic-ai/nomic-embed-text-v1

libraryname: sentence-transformers pipelinetag: sentence-similarity - feature-extraction - sentence-similarity - mteb - transformers - trans...

πŸ”— sentence-similarity 3,159,547
google

google/gemma-3-1b-it

No description available.

πŸ“ text-generation 3,149,748
timm

timm/resnet50.a1_in1k

Model Type: Image classification / feature backbone - Model Stats: - Params (M): 25.6 - GMACs: 4.1 - Activations (M): 11.1 - Image size: tra...

πŸ“Έ image-classification 3,121,259
Salesforce

Salesforce/blip-image-captioning-base

No description available.

πŸ–ΌοΈ image-to-text 3,111,099
intfloat

intfloat/multilingual-e5-small

- multilingual - af - am - ar - as - az - be - bg - bn - br - bs - ca - cs - cy - da - de - el - en - eo - es - et - eu - fa - fi - fr - fy ...

πŸ”— sentence-similarity 3,078,444
MahmoudAshraf

MahmoudAshraf/mms-300m-1130-forced-aligner

No description available.

πŸŽ™οΈ automatic-speech-recognition 3,050,006
jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-polish

- commonvoice - mozilla-foundation/commonvoice60 - wer - cer - audio - automatic-speech-recognition - hf-asr-leaderboard - mozilla-foundatio...

πŸŽ™οΈ automatic-speech-recognition 3,017,504
w11wo

w11wo/indonesian-roberta-base-posp-tagger

More information needed...

🏷️ token-classification 2,996,024
apple

apple/mobilevit-small

MobileViT is a light-weight, low latency convolutional neural network that combines MobileNetV2-style layers with a new block that replaces ...

πŸ“Έ image-classification 2,987,479
Bingsu

Bingsu/yolo-world-mirror

No description available.

✨ General AI Model 2,967,196
laion

laion/CLIP-ViT-B-32-laion2B-s34B-b79K

No description available.

🎯 zero-shot-image-classification 2,953,106
mistralai

mistralai/Mistral-7B-Instruct-v0.2

No description available.

πŸ“ text-generation 2,933,361
microsoft

microsoft/TRELLIS-image-large

No description available.

🧊 image-to-3d 2,858,877
intfloat

intfloat/multilingual-e5-base

- mteb - Sentence Transformers - sentence-similarity - sentence-transformers - name: multilingual-e5-base results: - task: type: Classificat...

πŸ”— sentence-similarity 2,850,419
patrickjohncyh

patrickjohncyh/fashion-clip

UPDATE (10/03/23): We have updated the model! We found that laion/CLIP-ViT-B-32-laion2B-s34B-b79K checkpoint (thanks Bin!) worked better tha...

🎯 zero-shot-image-classification 2,818,619
facebook

facebook/dinov2-small

The Vision Transformer (ViT) is a transformer encoder model (BERT-like) pretrained on a large collection of images in a self-supervised fash...

πŸ”Ž image-feature-extraction 2,794,873
jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-greek

No description available.

πŸŽ™οΈ automatic-speech-recognition 2,749,675
indonesian-nlp

indonesian-nlp/wav2vec2-indonesian-javanese-sundanese

- id - jv - sun - mozilla-foundation/commonvoice70 - openslr - magicdata - titml - wer - audio - automatic-speech-recognition - hf-asr-leade...

πŸŽ™οΈ automatic-speech-recognition 2,735,282
Qwen

Qwen/Qwen2.5-VL-3B-Instruct

No description available.

πŸ“– image-text-to-text 2,722,310
openai

openai/clip-vit-base-patch16

The CLIP model was developed by researchers at OpenAI to learn about what contributes to robustness in computer vision tasks. The model was ...

🎯 zero-shot-image-classification 2,646,498
google

google/siglip-so400m-patch14-384

SigLIP is CLIP, a multimodal model, with a better loss function. The sigmoid loss operates solely on image-text pairs and does not require a...

🎯 zero-shot-image-classification 2,622,346
Comfy-Org

Comfy-Org/z_image_turbo

No description available.

✨ General AI Model 2,605,495
WhereIsAI

WhereIsAI/UAE-Large-V1

- mteb - sentenceembedding - featureextraction - sentence-transformers - transformers - transformers.js - name: UAE-Large-V1 results: - task...

πŸ” feature-extraction 2,571,594
meta-llama

meta-llama/Meta-Llama-3-8B

Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned genera...

πŸ“ text-generation 2,571,135
sentence-transformers

sentence-transformers/LaBSE

No description available.

πŸ”— sentence-similarity 2,564,622
jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-dutch

- commonvoice - mozilla-foundation/commonvoice60 - wer - cer - audio - automatic-speech-recognition - hf-asr-leaderboard - mozilla-foundatio...

πŸŽ™οΈ automatic-speech-recognition 2,553,499
answerdotai

answerdotai/JaColBERTv2.5

No description available.

πŸ”— sentence-similarity 2,542,736
emilyalsentzer

emilyalsentzer/Bio_ClinicalBERT

- fill-mask...

🎭 fill-mask 2,532,882
EleutherAI

EleutherAI/pythia-160m

Developed by: EleutherAI - Model type: Transformer-based Language Model - Language: English - Learn more: Pythia's GitHub repository for tra...

πŸ“ text-generation 2,524,662
zai-org

zai-org/GLM-OCR

No description available.

πŸ–ΌοΈ image-to-text 2,515,619
ibm-granite

ibm-granite/granite-timeseries-ttm-r1

TTM falls under the category of β€œfocused pre-trained models”, wherein each pre-trained TTM is tailored for a particular forecasting setting ...

πŸ•’ time-series-forecasting 2,472,052
hustvl

hustvl/vitmatte-small-composition-1k

ViTMatte is a simple approach to image matting, the task of accurately estimating the foreground object in an image. The model consists of a...

✨ General AI Model 2,450,750
EssentialAI

EssentialAI/eai-distill-0.5b

πŸ† Website | πŸ–₯️ Code | πŸ“– Paper...

✨ General AI Model 2,408,963
stabilityai

stabilityai/stable-diffusion-xl-base-1.0

Developed by: Stability AI - Model type: Diffusion-based text-to-image generative model - License: CreativeML Open RAIL++-M License - Model ...

🎨 text-to-image 2,343,368
rhasspy

rhasspy/faster-whisper-tiny-int8

No description available.

✨ General AI Model 2,329,215
google-t5

google-t5/t5-base

No description available.

🌐 translation 2,317,951
zai-org

zai-org/GLM-5-FP8

No description available.

πŸ“ text-generation 2,284,370
facebook

facebook/bart-large-cnn

BART is a transformer encoder-encoder (seq2seq) model with a bidirectional (BERT-like) encoder and an autoregressive (GPT-like) decoder. BAR...

βœ‚οΈ summarization 2,283,383
pyannote

pyannote/segmentation

No description available.

πŸ”Š voice-activity-detection 2,267,846
jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-arabic

No description available.

πŸŽ™οΈ automatic-speech-recognition 2,264,585
Qwen

Qwen/Qwen3-30B-A3B-Instruct-2507

Qwen3-30B-A3B-Instruct-2507 has the following features: - Type: Causal Language Models - Training Stage: Pretraining & Post-training - Numbe...

πŸ“ text-generation 2,196,902
Comfy-Org

Comfy-Org/Qwen-Image_ComfyUI

No description available.

✨ General AI Model 2,159,100
1 2 3 4 5