ComfyUI-Florence2
A GitHub repository that integrates Microsoft’s Florence-2, an advanced vision foundation model, into ComfyUI. It enables prompt-based vision and vision-language tasks such as captioning, object detection, segmentation, and Document Visual Question Answering (DocVQA) on scanned documents.
Key Information
- Category: Workflow Automation
- Source: Github
- Tags: Python
- Last updated: January 09, 2026
Structured Metrics
No structured metrics captured yet.
Links
Canonical source: https://github.com/kijai/ComfyUI-Florence2