Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore Docling, an open source Python package that has become the de facto standard for document parsing and export, earning nearly 30,000 GitHub stars in less than a year and joining the Linux AI & Data Foundation. Learn how this revolutionary document AI tool redefines document processing with its exceptional ease and speed of use. Discover Docling's comprehensive format support including PDFs, DOCX, PPTX, HTML, images, and Markdown with seamless conversion to structured Markdown or JSON. Master advanced document understanding capabilities that capture intricate page layouts, reading order, and table structures for complex analysis. Understand integration with popular AI frameworks like LlamaIndex, LangChain, and LlamaStack for retrieval-augmented generation (RAG) and question-answering applications. Examine optical character recognition (OCR) support for scanned documents and Visual Language Models like SmolDocling developed with Hugging Face. Navigate the user-friendly command line interface (CLI) and MCP connectors designed for developers. Learn deployment strategies for using Docling as-a-service and at scale through docling-serve implementation.
Syllabus
Docling: Get Your Documents Ready for Gen AI - Michele Dolfi & Peter Staar, IBM Research
Taught by
Linux Foundation