Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore an open-source framework that transforms unstructured data into valuable structured information using large language models in this 27-minute conference talk. Learn how Trallie, backed by the NGI Search Consortium, revolutionizes information extraction by requiring minimal manual annotation and few representative examples to convert data into structured formats. Discover the framework's three core objectives: reducing dependency on labeled training data through powerful LLM prompting, providing flexibility across multiple document formats (HTML, PDF, TXT, JSON) and five European languages (English, French, Italian, German, Spanish), and making structured data extraction accessible to non-expert users through automatic schema inference. Understand how Trallie addresses the challenge of data extraction when users may not be domain experts by automatically inferring schemas from document sets while allowing user modifications as needed, presented by experts from Pi School at a Linux Foundation event.
Syllabus
Trallie: Shaping Unstructured Data Into Valuable Information - Vijayasri Iyer & Cristiano De Nobili
Taught by
Linux Foundation