Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore how to run AI models on constrained hardware through this conference talk from C++Now 2025, where research engineer Ramon Perez demonstrates building Cortex.cpp, an open-source engine for deploying AI on devices ranging from Raspberry Pis to industrial robots. Learn the fundamentals of how AI models operate on different hardware architectures from a C++ perspective, discover techniques for integrating Python with C++ and vice-versa, and master performance optimization strategies including aggressive bit manipulation, custom memory allocators, and SIMD optimizations. Examine the architectural design of Cortex and compare it with other C++ implementations while understanding the critical tradeoffs involved in bringing AI capabilities to resource-constrained devices. Gain insights into pushing the boundaries between artificial intelligence and embedded systems, enabling everyday devices to achieve intelligent behavior through carefully optimized memory management and low-level programming techniques.