MedGemma
The MedGemma collection contains Google's most capable open models for medical text and image comprehension, built on Gemma 3. Developers can use MedGemma to accelerate building healthcare-based AI applications. MedGemma comes in two variants: a 4B multimodal version and a 27B text-only version. The 4B version utilizes a SigLIP image encoder specifically pre-trained on de-identified medical data including chest X-rays, dermatology images, ophthalmology images, and histopathology slides.
2025-05-20
4B, 27B
Decoder-only Transformer (based on Gemma 3)
Health AI Developer Foundations terms of use
Specifications
- Parameters
- 4B, 27B
- Architecture
- Decoder-only Transformer (based on Gemma 3)
- License
- Health AI Developer Foundations terms of use
- Context Window
- 128,000 tokens
- Max Output
- 8,192 tokens
- Training Data Cutoff
- 2025-05-20
- Type
- multimodal
- Modalities
- textimage
Benchmark Scores
medqa-4op87.7
medmcqa74.2
pubmedqa76.8
mmlu-med87
medxpertqa26.7
afrimed-qa84
mimic-cxr-f188.9
chexpert-f148.1
dermmcqa-accuracy71.8
slakevqa-f162.3
vqa-rad-f149.9
pathmcqa-accuracy69.8
mimic-cxr-radgraph29.5
Advanced Specifications
- Model Family
- Gemma
- Finetuned From
- Gemma 3
- API Access
- Available
- Chat Interface
- Not Available
- Multilingual Support
- No
- Variants
- 4B instruction-tuned (medgemma-4b-it)4B pre-trained (medgemma-4b-pt)27B text instruction-tuned (medgemma-27b-text-it)
- Hardware Support
- CUDATPUCPU
Capabilities & Limitations
- Capabilities
- medical image classificationmedical image interpretationchest X-ray interpretationdermatology image analysishistopathology analysisophthalmology image analysisfundus image analysisradiology report generationmedical visual question answeringmedical text comprehensionclinical reasoningmedical knowledge retrievalmultimodal medical understandingpatient interviewingmedical triagingclinical decision supportmedical summarizationprompt engineering adaptationfine-tuning capabilityagentic orchestration
- Known Limitations
- Not intended for direct clinical diagnosis or treatment recommendationsDeveloper model that requires validation on intended use caseBaseline performance may need improvement through adaptationNot clinical grade without additional fine-tuningPrimarily evaluated on single-image tasksNot evaluated for multi-turn applicationsMay be more sensitive to specific prompts than base Gemma 327B model is text-only without image supportEvaluation primarily in English language
- Notable Use Cases
- medical image classificationmedical image interpretation and reportingradiology image analysisdigital pathology classificationfundus image analysisskin image classificationmedical visual question answeringpatient interviewing systemsmedical triaging applicationsclinical decision support toolsmedical summarizationhealthcare AI application developmentagentic medical systemsFHIR data processingprivate health data parsingmedical education platforms
- Function Calling Support
- Yes
- Tool Use Support
- Yes