Tag: Tutorial.

Step-by-step guides for building pipelines with CocoIndex.

← All tags · All posts

Feb 5, 2026

Build a Self-Updating Wiki for Your Codebases with LLM

Build a CocoIndex pipeline that generates a wiki page for each project in your codebase using an LLM, and keeps it fresh with incremental processing.

Examples LLM Structured Extraction Incremental Processing Tutorial
Dec 15, 2025

Extracting Structured Data from Patient Intake Forms with DSPy and CocoIndex

Extract Pydantic-typed structured data from patient intake forms using DSPy and CocoIndex: OCR vision models with incremental processing.

Examples Tutorial Structured Extraction Multimodal LLM
Nov 25, 2025

Extract HackerNews into Postgres with a CocoIndex Custom Source

Build a custom incremental HackerNews connector with CocoIndex's Custom Source API and export to Postgres for semantic search and analytics.

Examples Custom Source Feature Postgres Tutorial
Nov 21, 2025

Extracting Intake Forms with BAML and CocoIndex

How to use BAML and CocoIndex to extract structured data from patient intake forms in PDF/Word with LLMs continuously for production.

Examples Tutorial Structured Extraction LLM
Oct 11, 2025

Automated invoice processing with AI, Snowflake, and CocoIndex

Extract invoice fields from PDFs in Azure Blob Storage and load them into Snowflake with an incremental CocoIndex + GPT-4o pipeline: open-source unstructured ETL.

Examples Tutorial Structured Extraction Connectors Incremental Processing
Aug 3, 2025

Bring your own building blocks: Export anywhere with Custom Targets

CocoIndex now supports custom targets. Export indexed data to any destination: a local file, cloud storage, a REST API, or your own bespoke system.

Examples Feature Connectors Incremental Processing Tutorial
Jul 24, 2025

Indexing faces for visual search: build your own Google Photo Search

Build a scalable face detection and recognition pipeline with CocoIndex: embed faces, structure for search, and export to a vector DB.

Examples Tutorial Multimodal Embeddings Vector Search
Jul 9, 2025

Index academic papers and extract metadata for AI agents

How to index academic research papers by extracting metadata (e.g., title, authors, abstract) for AI agents and AI workflows using LLMs and CocoIndex

Examples Structured Extraction Embeddings RAG Tutorial
May 20, 2025

Build image search and query with natural language with vision model CLIP

Indexing images with CocoIndex and Vision Model in real-time: multi-modal embedding, and build vector index for efficient retrieval.

Examples Multimodal Embeddings Vector Search Tutorial
May 19, 2025

How to build an index with text embeddings

Build a semantic text index with CocoIndex and text embeddings, then query it with natural language: a beginner's guide to embeddings and vector search.

Examples Embeddings Vector Search RAG Tutorial
Apr 29, 2025

Build Real-Time Knowledge Graph For Documents with LLM

CocoIndex now supports knowledge graphs with incremental processing. Building live knowledge for agents is super easy with CocoIndex!

Examples Knowledge Graph LLM Structured Extraction Tutorial
Mar 18, 2025

Build Real-Time Codebase Indexing for AI Code Generation

Indexing codebase for RAG with CocoIndex and Tree-sitter in real-time: chunking, embedding, semantic search, and build vector index for efficient retrieval.

Examples RAG Embeddings Vector Search Tutorial
Mar 17, 2025

On-premise structured extraction with LLM using Ollama

Learn to use CocoIndex to extract structured data from PDF/Markdown with Ollama's local LLM models. All running on premise without sending data to external APIs.

Examples Tutorial Structured Extraction LLM Postgres

Tag: Tutorial.

Build a Self-Updating Wiki for Your Codebases with LLM

Extracting Structured Data from Patient Intake Forms with DSPy and CocoIndex

Extract HackerNews into Postgres with a CocoIndex Custom Source

Extracting Intake Forms with BAML and CocoIndex

Automated invoice processing with AI, Snowflake, and CocoIndex

Bring your own building blocks: Export anywhere with Custom Targets

Indexing faces for visual search: build your own Google Photo Search

Index academic papers and extract metadata for AI agents

Build image search and query with natural language with vision model CLIP

How to build an index with text embeddings

Build Real-Time Knowledge Graph For Documents with LLM

Build Real-Time Codebase Indexing for AI Code Generation

On-premise structured extraction with LLM using Ollama