CocoIndex Team

The team behind CocoIndex

Posts from the CocoIndex team — product launches, release notes, and announcements.

Posts by CocoIndex Team

Jul 7, 2025

CocoIndex Changelog 2025-07-07

CocoIndex updates: in-process setup/drop API, EmbedText building block, SplitRecursively improvements, union/NumPy types, and the Kuzu graph target.

Changelog Embeddings LLM Knowledge Graph Incremental Processing
May 31, 2025

CocoIndex Changelog 2025-05-31

CocoIndex updates: Amazon S3 as a data source, improved query handling, a standalone runtime mode, and more connector and performance improvements.

Changelog Connectors Incremental Processing Embeddings Vector Search
Apr 30, 2025

CocoIndex Changelog 2025-04-30

CocoIndex updates: knowledge graph support, Qdrant and Supabase targets, KTable and LTable data types, additional LLM providers, and more.

Changelog Knowledge Graph Connectors Vector Search LLM
Apr 7, 2025

CocoIndex Changelog 2025-04-07

CocoIndex updates: incremental live update mode, evaluation utilities, date/time types, a Google Drive source, and core performance improvements.

Changelog Incremental Processing Connectors Structured Extraction
Apr 7, 2025

Keep derived data in sync with changing sources

CocoIndex continuously watches source changes and applies incremental updates to keep derived data in sync, with low latency and no full reindexing.

Feature Incremental Processing Data Indexing Connectors
Mar 26, 2025

Structured Extraction from Patient Intake Form with LLM

Extract typed Patient records from PDF and DOCX intake forms with an LLM and CocoIndex v1: the nested schema is the whole prompt; results land in Postgres.

Examples Structured Extraction LLM Multimodal
Mar 23, 2025

Search your Google Drive by meaning

Index the documents in a shared Google Drive folder as text embeddings in Postgres with CocoIndex v1, then search them by meaning instead of by filename.

Examples Embeddings Vector Search Connectors Data Indexing
Mar 20, 2025

CocoIndex Changelog 2025-03-20

First release of CocoIndex Changelog: LLM support, codebase indexing, custom functions, and assorted core/performance improvements

Changelog LLM Structured Extraction RAG
Mar 18, 2025

Build Real-Time Codebase Indexing for AI Code Generation

Indexing codebase for RAG with CocoIndex and Tree-sitter in real-time: chunking, embedding, semantic search, and build vector index for efficient retrieval.

Examples RAG Embeddings Vector Search Tutorial
Mar 17, 2025

On-premise structured extraction from PDFs with Ollama

Extract structured data from PDF manuals locally with Ollama and CocoIndex: docling converts PDFs to Markdown, a local LLM fills typed Postgres rows.

Examples Tutorial Structured Extraction LLM Postgres
Mar 3, 2025

We are officially open sourced! 🎉

CocoIndex is now open source: the first engine to combine custom transformation logic with incremental processing built specifically for data indexing.

Announcement Changelog Incremental Processing Data Indexing RAG
Feb 20, 2025

Customizable Data Indexing Pipelines

What customizable data indexing pipelines are and why custom transformation logic matters, with practical CocoIndex examples.

Data Indexing Insight RAG Embeddings Vector Search
Jan 30, 2025

How indexing pipelines differ from other data pipelines

What makes indexing pipelines different from other data systems, and why they need special handling for incremental processing and persistence.

Insight Incremental Processing Architecture Data Indexing
Jan 20, 2025

System updates and automatic schema inference

How CocoIndex handles system updates in indexing flows: automatic schema inference and managing data + logic evolution without downtime.

Data Indexing Best Practices Feature Architecture
Jan 10, 2025

Processing Large Files in Data Indexing Systems

Handle large files in data indexing: processing granularity, fan-in/fan-out, and memory pressure, walked through a patent XML example in CocoIndex.

Data Indexing Best Practices Performance Architecture
Jan 6, 2025

Data Consistency in Indexing Pipelines

Data consistency in indexing pipelines: concurrent updates, exposure risks, and how CocoIndex's data-driven approach keeps indexes converging.

Data Indexing Best Practices Insight Architecture
Jan 5, 2025

Data Indexing and Common Challenges

Fundamentals of data indexing pipelines for RAG: what makes a good one, common production pitfalls, and how CocoIndex addresses them.

Data Indexing Best Practices Insight
Jan 4, 2025

CocoIndex - A Data Indexing Platform for AI Applications

CocoIndex is a data indexing platform for AI: ingestion, chunking, embedding, and pipeline management for RAG, semantic search, and knowledge graphs.

Data Indexing RAG Embeddings Vector Search Knowledge Graph

CocoIndex Team

Posts by CocoIndex Team

CocoIndex Changelog 2025-07-07

CocoIndex Changelog 2025-05-31

CocoIndex Changelog 2025-04-30

CocoIndex Changelog 2025-04-07

Keep derived data in sync with changing sources

Structured Extraction from Patient Intake Form with LLM

Search your Google Drive by meaning

CocoIndex Changelog 2025-03-20

Build Real-Time Codebase Indexing for AI Code Generation

On-premise structured extraction from PDFs with Ollama

We are officially open sourced! 🎉

Customizable Data Indexing Pipelines

How indexing pipelines differ from other data pipelines

System updates and automatic schema inference

Processing Large Files in Data Indexing Systems

Data Consistency in Indexing Pipelines

Data Indexing and Common Challenges

CocoIndex - A Data Indexing Platform for AI Applications