SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
SmolDocling is a compact vision-language model that performs end-to-end document conversion with robust performance across various document types using 256M parameters and a new markup format.
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
SmolDocling is a compact vision-language model that performs end-to-end document conversion with robust performance across various document types using 256M parameters and a new markup format.
