The Limits of Today's AI Models - Transformers, State Space Models, and the Future of Multimodal Intelligence
Y Combinator via YouTube
Overview
Syllabus
— Introducing Cartesia
— From Architecture Research to Startup
— What “Architecture Research” Really Means
— Why Transformers Hit a Ceiling
— State Space Models Explained
— Intelligence as Compression
— Retrieval vs. Abstraction
— Hybrid Architectures and the Future
— Why Cartesia Chose Voice AI
— What Multimodality Actually Means
— Audio as a Recipe for Other Modalities
— Tokens, Representations, and Learning Signals
— Learning Representations End-to-End
— Building for the “Average Human”
— Research vs. Product Reality
— One Vision, Ruthlessly Executed
— Product as a Truth Serum for Research
— Startup Gravity Applies to Research Too
Taught by
Y Combinator: The Vault