Texera: An Open-Source System for Cloud-Based Collaborative Data Science and AI/ML Using Workflows
USC Information Sciences Institute via YouTube
AI, Data Science & Cloud Certificates from Google, IBM & Meta
Most AI Pilots Fail to Scale. MIT Sloan Teaches You Why — and How to Fix It
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
This talk presents Texera, an open-source cloud-based platform developed at UC Irvine since 2016 that enables collaborative data science, AI, and ML through workflows. Learn how Texera supports users with various technical backgrounds, including those with limited coding skills, domain scientists, and ML experts, to conduct AI-centric data science with Google Docs-like collaboration features. Discover the system's rich capabilities including shared editing and execution, version control, commenting, debugging, multi-language user-defined functions (Python, R, Java), and integration with state-of-the-art AI/ML techniques. Explore how its parallel backend engine enables scalable computation on large datasets using computing clusters, allowing bioinformaticians to elastically request AWS resources for computationally intensive jobs. Professor Chen Li, an IEEE fellow and ACM Distinguished Member from UC Irvine, discusses the research challenges encountered during development, their solutions, and demonstrates use cases in both educational and scientific communities.
Syllabus
Texera: An Open-Source System for Cloud-Based Collaborative Data Science and AI/ML Using Workflows
Taught by
USC Information Sciences Institute