Instructions to use Shadow0482/mithu-mobilevit-dr with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Shadow0482/mithu-mobilevit-dr with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-classification", model="Shadow0482/mithu-mobilevit-dr") pipe("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png")# Load model directly from transformers import AutoImageProcessor, AutoModelForImageClassification processor = AutoImageProcessor.from_pretrained("Shadow0482/mithu-mobilevit-dr") model = AutoModelForImageClassification.from_pretrained("Shadow0482/mithu-mobilevit-dr") - Notebooks
- Google Colab
- Kaggle
Mithu-ViT: Diabetic Retinopathy Classifier
This is a MobileViT (Small) model fine-tuned on the Processed Diabetic Retinopathy dataset.
It classifies retina scans into 5 severity levels:
- 0: No DR
- 1: Mild
- 2: Moderate
- 3: Severe
- 4: Proliferative DR
Model Details
- Architecture: MobileViT-Small (Apple)
- Format: PyTorch (
pytorch_model.bin) and ONNX (mithu-vit.onnx) - Resolution: 256x256
- License: Apache 2.0
Usage (PyTorch)
from transformers import MobileViTForImageClassification, MobileViTImageProcessor
from PIL import Image
import torch
# 1. Load Model
model = MobileViTForImageClassification.from_pretrained("Shadow0482/mithu-mobilevit-dr")
processor = MobileViTImageProcessor.from_pretrained("Shadow0482/mithu-mobilevit-dr")
# 2. Load Image
image = Image.open("path_to_eye_scan.jpg").convert("RGB")
# 3. Predict
inputs = processor(images=image, return_tensors="pt")
with torch.no_grad():
outputs = model(**inputs)
print("Predicted Class:", model.config.id2label[outputs.logits.argmax(-1).item()])
Usage (ONNX)
import onnxruntime as ort
import numpy as np
from PIL import Image
# 1. Start Session
session = ort.InferenceSession("mithu-vit.onnx")
# 2. Prepare Input
img = Image.open("test.jpg").resize((256, 256))
img_data = np.array(img).transpose(2, 0, 1).astype(np.float32) / 255.0
img_data = np.expand_dims(img_data, axis=0)
# 3. Run
outputs = session.run(None, {"pixel_values": img_data})
print("Logits:", outputs[0])
- Downloads last month
- 4