Why did we open-source our inference engine? Read the post

mynkchaudhry/Florence-2-FT-DocVQA

This model card provides details about the Florence-2-FT-DocVQA model, which is fine-tuned for Document Visual Question Answering (VQA) tasks.

Overview

Architecture
Florence-2
Parameters
271M
Tasks
Extract
Outputs
text_regions
License
apache-2.0
Languages
en

Benchmarks

DocVQA

general kie en

Visual question answering on document images

Corpus: 5,188 Queries: 5,188
Quality
anls 0.3521
exact match 0.2600
Performance L4 b1 c16
Reference →

Self-hosted inference for search & document processing

Cut API costs by 50x, boost quality with 85+ SOTA models, and keep your data in your own cloud.

Github 2.0K

Contact us

Tell us about your use case and we'll get back to you shortly.