Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Vector Database Foundations and Core Concepts

Coursera via Coursera

Overview

Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Vector databases are transforming how machines understand and retrieve information across AI applications. This comprehensive course demystifies vector database technologies, taking you from foundational concepts to advanced implementation techniques. You'll learn to generate high-quality embeddings, calculate sophisticated similarity metrics, and implement efficient vector search algorithms. Through hands-on modules, you'll gain practical skills in converting raw data into meaningful vector representations, evaluating embedding quality, and optimizing search performance. The course covers critical techniques used in semantic search, recommendation systems, and retrieval-augmented generation. Whether you're an aspiring machine learning engineer or a data professional looking to enhance your AI toolkit, you'll develop the expertise to design performant vector search systems. Who this is for: Machine learning engineers, data scientists, and AI professionals eager to master vector database technologies. Basic programming and machine learning familiarity recommended.

Syllabus

  • Grasp Vector DB Basics
    • In this module, you will discover the fundamental concepts that make modern AI search possible. You will learn what a vector database is, how it uses embeddings to understand unstructured data, and why this enables a "semantic search" that goes far beyond simple keywords.
  • Embed Everything
    • Embed Everything is an intermediate course for ML practitioners and Python developers. You’ll convert unstructured data into numerical embeddings, build a scalable pipeline, apply pre‑trained models to text and images, evaluate with t‑SNE and nearest‑neighbor analysis, and script production‑ready batch processing.
  • Measure Vector Similarity
    • Measure Vector Similarity is an intermediate course for ML engineers and data scientists to master cosine, dot‑product, and Euclidean metrics in retrieval, recommendation, and classification. You’ll implement each with Python/NumPy, explore Amazon and healthcare examples, and complete an assignment notebook benchmarking performance for a portfolio‑ready project.
  • Master ANN Search
    • Master ANN Search is an intermediate course for ML engineers and AI practitioners building high‑speed, large‑scale vector search. You’ll implement FAISS/Annoy, evaluate recall‑vs‑latency trade‑offs, benchmark against brute‑force, and complete a project optimizing a 100 k‑vector index for RAG or recommendation systems.
  • Tune HNSW
    • Tune HNSW is an intermediate course for ML practitioners and AI engineers to master vector‑search optimization. You’ll learn HNSW theory, tune efConstruction, M, and efSearch, build an index from scratch, chart precision‑latency trade‑offs, and complete a portfolio‑ready project optimizing search for chatbots or visual retrieval.
  • GenAI Literacy: AI-Assisted Embedding Workflows
    • This module explores how generative AI tools can augment your embedding and indexing workflows, from generating boilerplate code to debugging configuration issues. You'll learn effective prompt engineering techniques for ML tasks while understanding when human expertise remains essential.

Taught by

Professionals from the Industry

Reviews

Start your review of Vector Database Foundations and Core Concepts

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.