A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
vision
tools
2b
26K Pulls Updated 3 weeks ago