Tag: Multimodal.

Indexing and searching images, audio, video, and mixed-media content.

← All tags · All posts

Jan 22, 2026

Slides-to-speech: turn presentations into narrated content with CocoIndex

Turn slide decks into a continuously updated multimodal dataset with CocoIndex: extract speaker notes with Gemini Vision, synthesize narration with Piper TTS, and keep LanceDB in sync.

Examples Multimodal Embeddings Structured Extraction Vector Search
Dec 15, 2025

Extracting Structured Data from Patient Intake Forms with DSPy and CocoIndex

Extract Pydantic-typed structured data from patient intake forms using DSPy and CocoIndex: OCR vision models with incremental processing.

Examples Tutorial Structured Extraction Multimodal LLM
Oct 27, 2025

Index PDF elements: text, images with mixed embedding models and metadata

Extract, embed, and store multimodal PDF elements (text with SentenceTransformers, images with CLIP) for unified semantic search with traceable metadata.

Examples Feature Multimodal Embeddings Vector Search
Aug 20, 2025

Index PDFs, images, and slides together with ColPali: no OCR required

Build a unified visual document index from multiple file formats (including PDFs, images, and slides) using CocoIndex and ColPali. No OCR needed.

Examples Multimodal Embeddings Vector Search RAG
Aug 18, 2025

CocoIndex Changelog 2025-08-18

CocoIndex updates: production readiness, scalability, and reliability, plus more customization, native integrations, and multi-modal pipeline features.

Changelog Performance Multimodal Connectors Vector Search
Aug 12, 2025

Index Images with ColPali: Multi-Modal Context Engineering

CocoIndex now natively integrates ColPali for multi-vector, patch-level image indexing: multi-modal context engineering for visually rich documents and PDFs.

Examples Feature Multimodal Embeddings Vector Search
Aug 10, 2025

Multi-Dimensional Vector Support in CocoIndex

CocoIndex natively handles typed multi-dimensional vectors, from simple arrays to multi-vector embeddings, unlocking multimodal AI pipelines at scale.

Feature Embeddings Vector Search Multimodal
Jul 24, 2025

Indexing faces for visual search: build your own Google Photo Search

Build a scalable face detection and recognition pipeline with CocoIndex: embed faces, structure for search, and export to a vector DB.

Examples Tutorial Multimodal Embeddings Vector Search
May 20, 2025

Build image search and query with natural language with vision model CLIP

Indexing images with CocoIndex and Vision Model in real-time: multi-modal embedding, and build vector index for efficient retrieval.

Examples Multimodal Embeddings Vector Search Tutorial
Mar 26, 2025

Structured Extraction from Patient Intake Form with LLM

Extract structured data from patient intake forms in PDF and Word documents using an LLM and CocoIndex: a practical healthcare document extraction example.

Examples Structured Extraction LLM Multimodal

Tag: Multimodal.

Slides-to-speech: turn presentations into narrated content with CocoIndex

Extracting Structured Data from Patient Intake Forms with DSPy and CocoIndex

Index PDF elements: text, images with mixed embedding models and metadata

Index PDFs, images, and slides together with ColPali: no OCR required

CocoIndex Changelog 2025-08-18

Index Images with ColPali: Multi-Modal Context Engineering

Multi-Dimensional Vector Support in CocoIndex

Indexing faces for visual search: build your own Google Photo Search

Build image search and query with natural language with vision model CLIP

Structured Extraction from Patient Intake Form with LLM