Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to build an image retrieval system in this Python tutorial that combines Vision Transformer (ViT) and Facebook AI Similarity Search (FAISS) technologies. Master feature extraction techniques using neural networks to convert images into numerical vectors, implement efficient indexing mechanisms for processing image directories, and develop search functionality to find visually similar images. Explore the IndexIVFFlat approach that employs cluster-based searching for enhanced performance on large datasets, understanding the balance between search speed and accuracy through proper training and cluster selection. Follow along with practical implementation steps while gaining insights into maintaining image metadata and organizing search results by similarity scores. Access complete source code through the provided GitHub repository to create a robust image similarity search system.
Syllabus
350 - Efficient Image Retrieval with Vision Transformer (ViT) and FAISS
Taught by
DigitalSreeni