File size: 3,943 Bytes

9adf01f
50c97ee
 
 
 
 
9adf01f
50c97ee
9adf01f
50c97ee
 
 
 
 
 
 
9adf01f
50c97ee
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9adf01f
 
50c97ee
9adf01f
50c97ee
9adf01f
50c97ee
9adf01f
50c97ee
9adf01f
 
 
 
 
50c97ee
 
 
 
 
9adf01f
 
50c97ee
 
9adf01f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
50c97ee

---
datasets:
- Elsafty
- Chula
- DSE
library_name: timm
license: cc-by-4.0
pipeline_tag: image-feature-extraction
tags:
- red-blood-cells
- hematology
- medical-imaging
- vision-transformer
- dino
- dinov2
- foundation-model
model-index:
- name: RedDino-small
  results:
  - task:
      type: image-classification
      name: RBC Shape Classification
    dataset:
      name: Elsafty
      type: Classification
    metrics:
    - type: Weighted F1
      value: 86.0
    - type: Balanced Accuracy
      value: 87.2
    - type: Accuracy
      value: 86.2
    - type: Weighted F1
      value: 84.3
    - type: Balanced Accuracy
      value: 78.5
    - type: Accuracy
      value: 84.4
    - type: Weighted F1
      value: 84.9
    - type: Balanced Accuracy
      value: 56.5
    - type: Accuracy
      value: 84.9
---

# RedDino: A foundation model for red blood cell analysis

[📄 Paper](https://arxiv.org/abs/2508.08180) | [💻 Code](https://github.com/Snarci/RedDino)

**RedDino** is a self-supervised Vision Transformer foundation model specifically designed for **red blood cell (RBC)** image analysis. This variant, **RedDino-small**, is the compact model in the family, delivering strong performance with lighter computational cost.

It leverages a tailored version of the **DINOv2** framework, trained on a meticulously curated dataset of 1.25 million RBC images from diverse acquisition modalities and sources. The model excels at extracting robust features for downstream hematology tasks such as **shape classification**, **morphological subtype recognition**, and **batch-effect–robust analysis**.

---

## Model Details

-   **Architecture:** ViT-small, patch size 14
-   **SSL framework:** DINOv2 (customized for RBC morphology)
-   **Pretraining dataset:** Curated RBC images from 18 datasets (multiple modalities and sources)
-   **Embedding size:** 384
-   **Intended use:** RBC morphology classification, feature extraction, batch-effect–robust analysis

Notes:
-   Trained with RBC-specific augmentations and DINOv2 customizations (e.g., removal of KoLeo regularizer; Sinkhorn-Knopp centering).
-   Optimized using smear patches rather than only single-cell crops to improve generalization across sources.

## Example Usage

```python
from PIL import Image
from torchvision import transforms
import timm
import torch

# Load model from Hugging Face Hub
model = timm.create_model("hf_hub:Snarcy/RedDino-small", pretrained=True)
model.eval()
device = "cuda" if torch.cuda.is_available() else "cpu"
model.to(device)

# Load and preprocess image
image = Image.open("path/to/rbc_image.jpg").convert("RGB")
transform = transforms.Compose([
    transforms.Resize((224, 224)),
    transforms.ToTensor(),
    transforms.Normalize(mean=[0.485, 0.456, 0.406],
                         std=[0.229, 0.224, 0.225]),
])
input_tensor = transform(image).unsqueeze(0).to(device)

# Extract features
with torch.no_grad():
    embedding = model(input_tensor)
```
## 📝 Citation

If you use this model, please cite the following paper:

**RedDino: A foundation model for red blood cell analysis**  
Luca Zedda, Andrea Loddo, Cecilia Di Ruberto, Carsten Marr — 2025  
Preprint: arXiv:2508.08180. https://arxiv.org/abs/2508.08180

```bibtex
@misc{zedda2025reddinofoundationmodelred,
      title={RedDino: A foundation model for red blood cell analysis}, 
      author={Luca Zedda and Andrea Loddo and Cecilia Di Ruberto and Carsten Marr},
      year={2025},
      eprint={2508.08180},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2508.08180}, 
}
```

---

## Summary

RedDino is the first family of foundation models tailored for comprehensive red blood cell image analysis, using large-scale self-supervised learning to set new performance benchmarks and generalization standards for computational hematology. Models and pretrained weights are available for research and practical deployment.