Anonymization of Sensitive Information in Financial Documents Using Python
EuroPython Conference via YouTube
Free courses from frontend to fullstack and AI
AI Engineer - Learn how to integrate AI into software applications
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn to anonymize sensitive information in financial documents using Python, diffusion models, and named entity recognition in this 27-minute conference talk from EuroPython 2025. Discover how to leverage open-source tools and self-hosted models to replace personally identifiable information (PII) with realistic synthetic alternatives while preserving document integrity. Explore various approaches for Named Entity Recognition (NER) to identify sensitive entities and understand how diffusion models can inpaint anonymized content. Address the critical challenge faced by financial, medical, and legal institutions that struggle to utilize their own data for machine learning due to privacy, compliance, and security requirements. Master practical techniques for creating high-quality training datasets while maintaining strict data protection standards, enabling organizations to develop better models without compromising sensitive information.
Syllabus
Anonymization of sensitive information in financial documents using python — Piotr Gryko
Taught by
EuroPython Conference