Nirman.online | Premium AI Directory

microsoft

microsoft/table-transformer-detection

The Table Transformer is equivalent to DETR, a Transformer-based object detection model. Note that the authors decided to use the "normalize...

👁️ object-detection 3,423,345

facebook

facebook/w2v-bert-2.0

No description available.

🔍 feature-extraction 3,423,051

ETH-CVG

ETH-CVG/lightglue_superpoint

No description available.

📍 keypoint-detection 3,372,964

jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

- commonvoice - mozilla-foundation/commonvoice60 - wer - cer - audio - automatic-speech-recognition - hf-asr-leaderboard - mozilla-foundatio...

🎙️ automatic-speech-recognition 3,370,633

magic-leap-community

magic-leap-community/superpoint

No description available.

📍 keypoint-detection 3,321,899

speechbrain

speechbrain/spkrec-resnet-voxceleb

No description available.

✨ General AI Model 3,305,084

llava-hf

llava-hf/llava-1.5-7b-hf

Model type: LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data. It i...

📖 image-text-to-text 3,202,279

google-bert

google-bert/bert-base-cased

BERT is a transformers model pretrained on a large corpus of English data in a self-supervised fashion. This means it was pretrained on the ...

🎭 fill-mask 3,169,807

nomic-ai

nomic-ai/nomic-embed-text-v1

libraryname: sentence-transformers pipelinetag: sentence-similarity - feature-extraction - sentence-similarity - mteb - transformers - trans...

🔗 sentence-similarity 3,159,547

google

google/gemma-3-1b-it

No description available.

📝 text-generation 3,149,748

timm

timm/resnet50.a1_in1k

Model Type: Image classification / feature backbone - Model Stats: - Params (M): 25.6 - GMACs: 4.1 - Activations (M): 11.1 - Image size: tra...

📸 image-classification 3,121,259

Salesforce

Salesforce/blip-image-captioning-base

No description available.

🖼️ image-to-text 3,111,099

intfloat

intfloat/multilingual-e5-small

- multilingual - af - am - ar - as - az - be - bg - bn - br - bs - ca - cs - cy - da - de - el - en - eo - es - et - eu - fa - fi - fr - fy ...

🔗 sentence-similarity 3,078,444

MahmoudAshraf

MahmoudAshraf/mms-300m-1130-forced-aligner

No description available.

🎙️ automatic-speech-recognition 3,050,006

jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-polish

- commonvoice - mozilla-foundation/commonvoice60 - wer - cer - audio - automatic-speech-recognition - hf-asr-leaderboard - mozilla-foundatio...

🎙️ automatic-speech-recognition 3,017,504

w11wo

w11wo/indonesian-roberta-base-posp-tagger

More information needed...

🏷️ token-classification 2,996,024

apple

apple/mobilevit-small

MobileViT is a light-weight, low latency convolutional neural network that combines MobileNetV2-style layers with a new block that replaces ...

📸 image-classification 2,987,479

Bingsu

Bingsu/yolo-world-mirror

No description available.

✨ General AI Model 2,967,196

laion

laion/CLIP-ViT-B-32-laion2B-s34B-b79K

No description available.

🎯 zero-shot-image-classification 2,953,106

mistralai

mistralai/Mistral-7B-Instruct-v0.2

No description available.

📝 text-generation 2,933,361

microsoft

microsoft/TRELLIS-image-large

No description available.

🧊 image-to-3d 2,858,877

intfloat

intfloat/multilingual-e5-base

- mteb - Sentence Transformers - sentence-similarity - sentence-transformers - name: multilingual-e5-base results: - task: type: Classificat...

🔗 sentence-similarity 2,850,419

patrickjohncyh

patrickjohncyh/fashion-clip

UPDATE (10/03/23): We have updated the model! We found that laion/CLIP-ViT-B-32-laion2B-s34B-b79K checkpoint (thanks Bin!) worked better tha...

🎯 zero-shot-image-classification 2,818,619

facebook

facebook/dinov2-small

The Vision Transformer (ViT) is a transformer encoder model (BERT-like) pretrained on a large collection of images in a self-supervised fash...

🔎 image-feature-extraction 2,794,873

jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-greek

No description available.

🎙️ automatic-speech-recognition 2,749,675

indonesian-nlp

indonesian-nlp/wav2vec2-indonesian-javanese-sundanese

- id - jv - sun - mozilla-foundation/commonvoice70 - openslr - magicdata - titml - wer - audio - automatic-speech-recognition - hf-asr-leade...

🎙️ automatic-speech-recognition 2,735,282

Qwen

Qwen/Qwen2.5-VL-3B-Instruct

No description available.

📖 image-text-to-text 2,722,310

openai

openai/clip-vit-base-patch16

The CLIP model was developed by researchers at OpenAI to learn about what contributes to robustness in computer vision tasks. The model was ...

🎯 zero-shot-image-classification 2,646,498

google

google/siglip-so400m-patch14-384

SigLIP is CLIP, a multimodal model, with a better loss function. The sigmoid loss operates solely on image-text pairs and does not require a...

🎯 zero-shot-image-classification 2,622,346

Comfy-Org

Comfy-Org/z_image_turbo

No description available.

✨ General AI Model 2,605,495

WhereIsAI

WhereIsAI/UAE-Large-V1

- mteb - sentenceembedding - featureextraction - sentence-transformers - transformers - transformers.js - name: UAE-Large-V1 results: - task...

🔍 feature-extraction 2,571,594

meta-llama

meta-llama/Meta-Llama-3-8B

Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned genera...

📝 text-generation 2,571,135

sentence-transformers

sentence-transformers/LaBSE

No description available.

🔗 sentence-similarity 2,564,622

jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-dutch

- commonvoice - mozilla-foundation/commonvoice60 - wer - cer - audio - automatic-speech-recognition - hf-asr-leaderboard - mozilla-foundatio...

🎙️ automatic-speech-recognition 2,553,499

answerdotai

answerdotai/JaColBERTv2.5

No description available.

🔗 sentence-similarity 2,542,736

emilyalsentzer

emilyalsentzer/Bio_ClinicalBERT

- fill-mask...

🎭 fill-mask 2,532,882

EleutherAI

EleutherAI/pythia-160m

Developed by: EleutherAI - Model type: Transformer-based Language Model - Language: English - Learn more: Pythia's GitHub repository for tra...

📝 text-generation 2,524,662

zai-org

zai-org/GLM-OCR

No description available.

🖼️ image-to-text 2,515,619

ibm-granite

ibm-granite/granite-timeseries-ttm-r1

TTM falls under the category of “focused pre-trained models”, wherein each pre-trained TTM is tailored for a particular forecasting setting ...

🕒 time-series-forecasting 2,472,052

hustvl

hustvl/vitmatte-small-composition-1k

ViTMatte is a simple approach to image matting, the task of accurately estimating the foreground object in an image. The model consists of a...

✨ General AI Model 2,450,750

EssentialAI

EssentialAI/eai-distill-0.5b

🏆 Website | 🖥️ Code | 📖 Paper...

✨ General AI Model 2,408,963

stabilityai

stabilityai/stable-diffusion-xl-base-1.0

Developed by: Stability AI - Model type: Diffusion-based text-to-image generative model - License: CreativeML Open RAIL++-M License - Model ...

🎨 text-to-image 2,343,368

rhasspy

rhasspy/faster-whisper-tiny-int8

No description available.

✨ General AI Model 2,329,215

google-t5

google-t5/t5-base

No description available.

🌐 translation 2,317,951

zai-org

zai-org/GLM-5-FP8

No description available.

📝 text-generation 2,284,370

facebook

facebook/bart-large-cnn

BART is a transformer encoder-encoder (seq2seq) model with a bidirectional (BERT-like) encoder and an autoregressive (GPT-like) decoder. BAR...

✂️ summarization 2,283,383

pyannote

pyannote/segmentation

No description available.

🔊 voice-activity-detection 2,267,846

jonatasgrosman

jonatasgrosman/wav2vec2-large-xlsr-53-arabic

No description available.

🎙️ automatic-speech-recognition 2,264,585

Qwen

Qwen/Qwen3-30B-A3B-Instruct-2507

Qwen3-30B-A3B-Instruct-2507 has the following features: - Type: Causal Language Models - Training Stage: Pretraining & Post-training - Numbe...

📝 text-generation 2,196,902

Comfy-Org

Comfy-Org/Qwen-Image_ComfyUI

No description available.

✨ General AI Model 2,159,100

Discover the Best AI Models

Model Index 55939 Total

microsoft/table-transformer-detection

facebook/w2v-bert-2.0

ETH-CVG/lightglue_superpoint

jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

magic-leap-community/superpoint

speechbrain/spkrec-resnet-voxceleb

llava-hf/llava-1.5-7b-hf

google-bert/bert-base-cased

nomic-ai/nomic-embed-text-v1

google/gemma-3-1b-it

timm/resnet50.a1_in1k

Salesforce/blip-image-captioning-base

intfloat/multilingual-e5-small

MahmoudAshraf/mms-300m-1130-forced-aligner

jonatasgrosman/wav2vec2-large-xlsr-53-polish

w11wo/indonesian-roberta-base-posp-tagger

apple/mobilevit-small

Bingsu/yolo-world-mirror

laion/CLIP-ViT-B-32-laion2B-s34B-b79K

mistralai/Mistral-7B-Instruct-v0.2

microsoft/TRELLIS-image-large

intfloat/multilingual-e5-base

patrickjohncyh/fashion-clip

facebook/dinov2-small

jonatasgrosman/wav2vec2-large-xlsr-53-greek

indonesian-nlp/wav2vec2-indonesian-javanese-sundanese

Qwen/Qwen2.5-VL-3B-Instruct

openai/clip-vit-base-patch16

google/siglip-so400m-patch14-384

Comfy-Org/z_image_turbo

WhereIsAI/UAE-Large-V1

meta-llama/Meta-Llama-3-8B

sentence-transformers/LaBSE

jonatasgrosman/wav2vec2-large-xlsr-53-dutch

answerdotai/JaColBERTv2.5

emilyalsentzer/Bio_ClinicalBERT

EleutherAI/pythia-160m

zai-org/GLM-OCR

ibm-granite/granite-timeseries-ttm-r1

hustvl/vitmatte-small-composition-1k

EssentialAI/eai-distill-0.5b

stabilityai/stable-diffusion-xl-base-1.0

rhasspy/faster-whisper-tiny-int8

google-t5/t5-base

zai-org/GLM-5-FP8

facebook/bart-large-cnn

pyannote/segmentation

jonatasgrosman/wav2vec2-large-xlsr-53-arabic

Qwen/Qwen3-30B-A3B-Instruct-2507

Comfy-Org/Qwen-Image_ComfyUI