Tag: Vector Search.

Vector indexes, similarity search, and the databases that back them.

← All tags · All posts

Feb 9, 2026

Building SEC EDGAR Financial Analytics with CocoIndex and Apache Doris

A multi-source pipeline that ingests SEC filings (TXT, JSON, PDF), scrubs PII, extracts topics, and powers hybrid search with CocoIndex + Apache Doris.

Examples Structured Extraction Vector Search Embeddings Connectors
Jan 22, 2026

Slides-to-speech: turn presentations into narrated content with CocoIndex

Turn slide decks into a continuously updated multimodal dataset with CocoIndex: extract speaker notes with Gemini Vision, synthesize narration with Piper TTS, and keep LanceDB in sync.

Examples Multimodal Embeddings Structured Extraction Vector Search
Oct 27, 2025

Index PDF elements: text, images with mixed embedding models and metadata

Extract, embed, and store multimodal PDF elements (text with SentenceTransformers, images with CLIP) for unified semantic search with traceable metadata.

Examples Feature Multimodal Embeddings Vector Search
Sep 21, 2025

Iterate faster on indexing: trace queries back to source data

Define query handlers in CocoIndex and trace search results back to source data in CocoInsight to close the loop on indexing strategy.

Examples Feature RAG Vector Search
Sep 1, 2025

Incrementally Transform Structured + Unstructured Data from Postgres with AI

Build unified, incrementally updated semantic + structured search over PostgreSQL data with CocoIndex: read a table, transform with AI and non-AI ops, and write pgvector embeddings back to Postgres.

Examples Postgres Incremental Processing Embeddings Vector Search
Aug 20, 2025

Index PDFs, images, and slides together with ColPali: no OCR required

Build a unified visual document index from multiple file formats (including PDFs, images, and slides) using CocoIndex and ColPali. No OCR needed.

Examples Multimodal Embeddings Vector Search RAG
Aug 18, 2025

CocoIndex Changelog 2025-08-18

CocoIndex updates: production readiness, scalability, and reliability, plus more customization, native integrations, and multi-modal pipeline features.

Changelog Performance Multimodal Connectors Vector Search
Aug 12, 2025

Index Images with ColPali: Multi-Modal Context Engineering

CocoIndex now natively integrates ColPali for multi-vector, patch-level image indexing: multi-modal context engineering for visually rich documents and PDFs.

Examples Feature Multimodal Embeddings Vector Search
Aug 10, 2025

Multi-Dimensional Vector Support in CocoIndex

CocoIndex natively handles typed multi-dimensional vectors, from simple arrays to multi-vector embeddings, unlocking multimodal AI pipelines at scale.

Feature Embeddings Vector Search Multimodal
Jul 24, 2025

Indexing faces for visual search: build your own Google Photo Search

Build a scalable face detection and recognition pipeline with CocoIndex: embed faces, structure for search, and export to a vector DB.

Examples Tutorial Multimodal Embeddings Vector Search
Jun 8, 2025

Flow-based schema inference for Qdrant

CocoIndex now sets up Qdrant collections automatically by inferring the target schema from your indexing flow: no manual config, vector sizes derived from the embedding model and kept in sync.

Feature Vector Search Connectors Data Indexing
May 31, 2025

CocoIndex Changelog 2025-05-31

CocoIndex updates: Amazon S3 as a data source, improved query handling, a standalone runtime mode, and more connector and performance improvements.

Changelog Connectors Incremental Processing Embeddings Vector Search
May 20, 2025

Build image search and query with natural language with vision model CLIP

Indexing images with CocoIndex and Vision Model in real-time: multi-modal embedding, and build vector index for efficient retrieval.

Examples Multimodal Embeddings Vector Search Tutorial
May 19, 2025

How to build an index with text embeddings

Build a semantic text index with CocoIndex and text embeddings, then query it with natural language: a beginner's guide to embeddings and vector search.

Examples Embeddings Vector Search RAG Tutorial
Apr 30, 2025

CocoIndex Changelog 2025-04-30

CocoIndex updates: knowledge graph support, Qdrant and Supabase targets, KTable and LTable data types, additional LLM providers, and more.

Changelog Knowledge Graph Connectors Vector Search LLM
Mar 23, 2025

Build text embeddings from Google Drive for RAG

Step-by-step tutorial to build text embeddings from Google Drive docs with CocoIndex, including service-account setup, and store them in Postgres for semantic search and RAG.

Examples Embeddings RAG Vector Search Connectors
Mar 18, 2025

Build Real-Time Codebase Indexing for AI Code Generation

Indexing codebase for RAG with CocoIndex and Tree-sitter in real-time: chunking, embedding, semantic search, and build vector index for efficient retrieval.

Examples RAG Embeddings Vector Search Tutorial
Feb 20, 2025

Customizable Data Indexing Pipelines

What customizable data indexing pipelines are, and why custom transformation logic matters, explained through clear comparisons and practical CocoIndex examples.

Data Indexing Insight RAG Embeddings Vector Search
Jan 4, 2025

CocoIndex - A Data Indexing Platform for AI Applications

CocoIndex is a data indexing platform for AI applications, handling ingestion, chunking, embedding, and pipeline management for RAG, semantic search, and knowledge graphs with built-in lineage and observability.

Data Indexing RAG Embeddings Vector Search Knowledge Graph

Tag: Vector Search.

Building SEC EDGAR Financial Analytics with CocoIndex and Apache Doris

Slides-to-speech: turn presentations into narrated content with CocoIndex

Index PDF elements: text, images with mixed embedding models and metadata

Iterate faster on indexing: trace queries back to source data

Incrementally Transform Structured + Unstructured Data from Postgres with AI

Index PDFs, images, and slides together with ColPali: no OCR required

CocoIndex Changelog 2025-08-18

Index Images with ColPali: Multi-Modal Context Engineering

Multi-Dimensional Vector Support in CocoIndex

Indexing faces for visual search: build your own Google Photo Search

Flow-based schema inference for Qdrant

CocoIndex Changelog 2025-05-31

Build image search and query with natural language with vision model CLIP

How to build an index with text embeddings

CocoIndex Changelog 2025-04-30

Build text embeddings from Google Drive for RAG

Build Real-Time Codebase Indexing for AI Code Generation

Customizable Data Indexing Pipelines

CocoIndex - A Data Indexing Platform for AI Applications