Results for "image-classification"
100 matches found.
Falconsai/nsfw_image_detection
The Fine-Tuned Vision Transformer (ViT) is a variant of the transformer encoder architecture, similar to BERT, that has been adapted for ima...
timm/mobilenetv3_small_100.lamb_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 2.5 - GMACs: 0.1 - Activations (M): 1.4 - Image size: 224 x...
dima806/fairface_age_image_detection
Detects age group with about 59% accuracy based on an image....
timm/convnextv2_nano.fcmae_ft_in22k_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 15.6 - GMACs: 2.5 - Activations (M): 8.4 - Image size: trai...
google/vit-base-patch16-224
The Vision Transformer (ViT) is a transformer encoder model (BERT-like) pretrained on a large collection of images in a supervised fashion, ...
iitolstykh/mivolo_v2
No description available.
timm/resnet50.a1_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 25.6 - GMACs: 4.1 - Activations (M): 11.1 - Image size: tra...
apple/mobilevit-small
MobileViT is a light-weight, low latency convolutional neural network that combines MobileNetV2-style layers with a new block that replaces ...
timm/resnet18.a1_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 11.7 - GMACs: 1.8 - Activations (M): 2.5 - Image size: trai...
Freepik/nsfw_image_detector
This model is a vision transformer based on the EVA architecture, fine-tuned for NSFW content classification. It has been trained to detect ...
timm/mobilenetv3_large_100.ra_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.5 - GMACs: 0.2 - Activations (M): 4.4 - Image size: 224 x...
timm/convnext_tiny.fb_in22k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 44.6 - GMACs: 4.5 - Activations (M): 13.5 - Image size: 224...
timm/efficientnet_b0.ra_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.3 - GMACs: 0.4 - Activations (M): 6.7 - Image size: 224 x...
timm/resnet34.a1_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 21.8 - GMACs: 3.7 - Activations (M): 3.7 - Image size: trai...
rizvandwiki/gender-classification
- image-classification - pytorch - huggingpics - accuracy...
microsoft/beit-base-patch16-224-pt22k-ft22k
The BEiT model is a Vision Transformer (ViT), which is a transformer encoder model (BERT-like). In contrast to the original ViT model, BEiT ...
amunchet/rorshark-vit-base
More information needed...
timm/vit_base_patch16_224.augreg_in21k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 102.6 - GMACs: 16.9 - Activations (M): 16.5 - Image size: 2...
buildborderless/CommunityForensics-DeepfakeDet-ViT
Vision Transformer (ViT) model trained on the largest dataset to-date for detecting AI-generated images in forensic applications. - Develope...
timm/vit_base_patch16_224.augreg2_in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 86.6 - GMACs: 16.9 - Activations (M): 16.5 - Image size: 22...
timm/efficientnet_b2.ra_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 9.1 - GMACs: 0.9 - Activations (M): 12.8 - Image size: trai...
timm/vit_small_patch16_224.augreg_in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 22.1 - GMACs: 4.3 - Activations (M): 8.2 - Image size: 224 ...
timm/edgenext_small.usi_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.6 - GMACs: 1.3 - Activations (M): 9.1 - Image size: train...
timm/regnety_016.tv2_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 11.2 - GMACs: 1.6 - Activations (M): 8.0 - Image size: 224 ...
AdamCodd/vit-base-nsfw-detector
The Vision Transformer (ViT) is a transformer encoder model (BERT-like) pretrained on a large collection of images in a supervised fashion, ...
timm/convnext_femto.d1_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.2 - GMACs: 0.8 - Activations (M): 4.6 - Image size: train...
microsoft/swin-large-patch4-window7-224
The Swin Transformer is a type of Vision Transformer. It builds hierarchical feature maps by merging image patches (shown in gray) in deeper...
timm/convnext_small.in12k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 50.2 - GMACs: 8.7 - Activations (M): 21.6 - Image size: tra...
timm/tf_efficientnetv2_s.in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 21.5 - GMACs: 5.4 - Activations (M): 22.7 - Image size: tra...
timm/wide_resnet50_2.racm_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 68.9 - GMACs: 11.4 - Activations (M): 14.4 - Image size: tr...
microsoft/swinv2-tiny-patch4-window16-256
The Swin Transformer is a type of Vision Transformer. It builds hierarchical feature maps by merging image patches (shown in gray) in deeper...
timm/resnet18.a3_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 11.7 - GMACs: 0.9 - Activations (M): 1.3 - Image size: trai...
nateraw/vit-age-classifier
No description available.
google/vit-large-patch16-384
The Vision Transformer (ViT) is a transformer encoder model (BERT-like) pretrained on a large collection of images in a supervised fashion, ...
timm/rexnet_150.nav_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 9.7 - GMACs: 0.9 - Activations (M): 11.2 - Image size: 224 ...
facebook/deit-tiny-patch16-224
This model is actually a more efficiently trained Vision Transformer (ViT). The Vision Transformer (ViT) is a transformer encoder model (BER...
timm/vit_base_patch8_224.augreg2_in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 86.6 - GMACs: 66.9 - Activations (M): 65.7 - Image size: 22...
smp-hub/efficientnet-b0.imagenet
No description available.
timm/mobilenetv4_conv_small_050.e3000_r224_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 2.2 - GMACs: 0.1 - Activations (M): 1.2 - Image size: train...
microsoft/resnet-50
ResNet (Residual Network) is a convolutional neural network that democratized the concepts of residual learning and skip connections. This e...
timm/convnext_tiny.in12k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 28.6 - GMACs: 4.5 - Activations (M): 13.4 - Image size: tra...
rizvandwiki/gender-classification-2
- image-classification - pytorch - huggingpics - accuracy...
timm/tf_efficientnet_b0.ns_jft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.3 - GMACs: 0.4 - Activations (M): 6.7 - Image size: 224 x...
timm/vit_tiny_r_s16_p8_224.augreg_in21k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 10.4 - GMACs: 0.4 - Activations (M): 1.9 - Image size: 224 ...
timm/vit_tiny_patch16_224.augreg_in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.7 - GMACs: 1.1 - Activations (M): 4.1 - Image size: 224 x...
facebook/deit-base-patch16-224
This model is actually a more efficiently trained Vision Transformer (ViT). The Vision Transformer (ViT) is a transformer encoder model (BER...
timm/convnextv2_base.fcmae_ft_in22k_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 88.7 - GMACs: 15.4 - Activations (M): 28.8 - Image size: tr...
haywoodsloan/ai-image-detector-dev-deploy
No description available.
timm/tf_efficientnet_b3.ns_jft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 12.2 - GMACs: 1.9 - Activations (M): 23.8 - Image size: 300...
timm/regnety_032.ra_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 19.4 - GMACs: 3.2 - Activations (M): 11.3 - Image size: tra...
timm/swin_base_patch4_window7_224.ms_in22k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 87.8 - GMACs: 15.5 - Activations (M): 36.6 - Image size: 22...
timm/resnet50.fb_swsl_ig1b_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 25.6 - GMACs: 4.1 - Activations (M): 11.1 - Image size: 224...
timm/convnext_base.fb_in22k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 88.6 - GMACs: 15.4 - Activations (M): 28.8 - Image size: tr...
timm/tf_mobilenetv3_large_minimal_100.in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 3.9 - GMACs: 0.2 - Activations (M): 4.4 - Image size: 224 x...
timm/tf_mobilenetv3_small_minimal_100.in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 2.0 - GMACs: 0.1 - Activations (M): 1.4 - Image size: 224 x...
timm/efficientnet_b3.ra2_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 12.2 - GMACs: 1.6 - Activations (M): 21.5 - Image size: tra...
timm/convnext_tiny.in12k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 36.9 - GMACs: 4.5 - Activations (M): 13.4 - Image size: 224...
timm/maxvit_nano_rw_256.sw_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 15.5 - GMACs: 4.5 - Activations (M): 30.3 - Image size: 256...
timm/beitv2_base_patch16_224.in1k_ft_in22k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 102.6 - GMACs: 17.6 - Activations (M): 23.9 - Image size: 2...
timm/repvit_m1.dist_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.5 - GMACs: 0.8 - Activations (M): 7.4 - Image size: 224 x...
timm/tf_efficientnetv2_m.in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 54.1 - GMACs: 15.9 - Activations (M): 57.5 - Image size: tr...
google/vit-base-patch16-384
The Vision Transformer (ViT) is a transformer encoder model (BERT-like) pretrained on a large collection of images in a supervised fashion, ...
nvidia/mit-b0
SegFormer consists of a hierarchical Transformer encoder and a lightweight all-MLP decode head to achieve great results on semantic segmenta...
timm/mobilenetv3_large_100.miil_in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.5 - GMACs: 0.2 - Activations (M): 4.4 - Image size: 224 x...
timm/efficientnetv2_rw_m.agc_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 53.2 - GMACs: 12.7 - Activations (M): 47.1 - Image size: tr...
timm/vit_base_patch32_384.augreg_in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 88.3 - GMACs: 12.7 - Activations (M): 12.1 - Image size: 38...
prithivMLmods/Age-Classification-SigLIP2
!AAAAAAAA.png...
timm/swin_tiny_patch4_window7_224.ms_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 28.3 - GMACs: 4.5 - Activations (M): 17.1 - Image size: 224...
Marqo/nsfw-image-detection-384
No description available.
timm/inception_resnet_v2.tf_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 55.8 - GMACs: 13.2 - Activations (M): 25.1 - Image size: 29...
timm/test_resnet.r160_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 0.5 - GMACs: 0.1 - Activations (M): 0.6 - Image size: 160 x...
dima806/man_woman_face_image_detection
Returns with about 98.7% accuracy whether the face belongs to man or woman based on face image....
microsoft/resnet-18
ResNet introduced residual connections, they allow to train networks with an unseen number of layers (up to 1000). ResNet won the 2015 ILSVR...
cledoux42/Ethnicity_Test_v003
No description available.
timm/efficientnet_b4.ra2_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 19.3 - GMACs: 3.1 - Activations (M): 34.8 - Image size: tra...
timm/deit_tiny_patch16_224.fb_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 5.7 - GMACs: 1.3 - Activations (M): 6.0 - Image size: 224 x...
timm/tf_efficientnetv2_s.in21k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 48.2 - GMACs: 5.4 - Activations (M): 22.8 - Image size: tra...
facebook/convnextv2-tiny-22k-224
ConvNeXt V2 is a pure convolutional model (ConvNet) that introduces a fully convolutional masked autoencoder framework (FCMAE) and a new Glo...
timm/hrnet_w32.ms_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 41.2 - GMACs: 9.0 - Activations (M): 22.0 - Image size: 224...
optimum-intel-internal-testing/tiny-random-vit
No description available.
timm/mobilenetv4_conv_small.e2400_r224_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 3.8 - GMACs: 0.2 - Activations (M): 2.0 - Image size: train...
smp-hub/mit_b3.imagenet
No description available.
dima806/facial_emotions_image_detection
Returns facial emotion with about 91% accuracy based on facial human image....
dima806/deepfake_vs_real_image_detection
Checks whether an image is real or fake (AI-generated)....
google/mobilenet_v2_1.0_224
From the original README: > MobileNets are small, low-latency, low-power models parameterized to meet the resource constraints of a variety ...
ISxOdin/vit-base-oxford-iiit-pets
This model is a fine-tuned version of a pre-trained Vision Transformer (`google/vit-base-patch16-224`) for image classification on the Oxfor...
timm/twins_svt_large.in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 99.3 - GMACs: 15.1 - Activations (M): 35.1 - Image size: 22...
timm/tf_efficientnet_b4.ns_jft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 19.3 - GMACs: 4.5 - Activations (M): 49.5 - Image size: 380...
timm/convnextv2_tiny.fcmae_ft_in22k_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 28.6 - GMACs: 4.5 - Activations (M): 13.4 - Image size: tra...
ibombonato/swin-age-classifier
- image-classification - pytorch - huggingpics - accuracy...
timm/swin_large_patch4_window12_384.ms_in22k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 196.7 - GMACs: 104.1 - Activations (M): 202.2 - Image size:...
jaranohaal/vit-base-violence-detection
This is a Vision Transformer (ViT) model fine-tuned for violence detection. The model is based on google/vit-base-patch16-224-in21k and has ...
facebook/deit-base-distilled-patch16-384
This model is a distilled Vision Transformer (ViT). It uses a distillation token, besides the class token, to effectively learn from a teach...
timm/swin_base_patch4_window12_384.ms_in22k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 87.9 - GMACs: 47.2 - Activations (M): 134.8 - Image size: 3...
peft-internal-testing/tiny-random-ResNetForImageClassification
No description available.
smp-hub/resnet18.imagenet
No description available.
timm/vit_base_patch32_224.augreg_in21k_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 88.2 - GMACs: 4.4 - Activations (M): 4.2 - Image size: 224 ...
timm/resnet18.fb_swsl_ig1b_ft_in1k
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 11.7 - GMACs: 1.8 - Activations (M): 2.5 - Image size: 224 ...
timm/convnextv2_tiny.fcmae_ft_in22k_in1k_384
Model Type: Image classification / feature backbone - Model Stats: - Params (M): 28.6 - GMACs: 13.1 - Activations (M): 39.5 - Image size: 38...
facebook/deit-small-patch16-224
This model is actually a more efficiently trained Vision Transformer (ViT). The Vision Transformer (ViT) is a transformer encoder model (BER...