Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the capabilities of GLM-OCR, a 9-billion parameter model developed by Z.ai that specializes in document optical character recognition through this 11-minute tutorial. Test this two-stage pipeline model that claims state-of-the-art performance among open-source OCR solutions by examining its effectiveness on various document types including text extraction and table recognition tasks. Learn how to implement and evaluate GLM-OCR locally, comparing its performance against other OCR solutions while discovering its strengths and limitations in real-world document processing scenarios.
Syllabus
GLM-OCR (9B) - Local OCR Test | OCR, Document Extraction, Table Recognition
Taught by
Venelin Valkov