George He

Cofounder & CTO of CocoIndex

Maintainer of CocoIndex, Ex-Google Infra Lead. Writes about incremental data infrastructure, Rust internals, and the engineering decisions behind the engine.

Posts by George He

Jul 7, 2026

CocoIndex Changelog 1.0.8 - 1.0.16

Changelog for CocoIndex 1.0.8-1.0.16: persistent per-component state, LiveMap, rate limiting, batched target writes, BigQuery and Snowflake connectors.

Connectors Performance
Jun 1, 2026

CocoIndex Changelog 1.0.1 - 1.0.7

CocoIndex's first post-v1 releases: stable memoization keys, scheduled live refresh, scoped stats, safer SQL connectors, and more integrations.

Changelog Announcement Connectors Performance
Apr 28, 2026

Live CSV → Kafka with CocoIndex's New Kafka Target Connector

Walk through a live CocoIndex pipeline that watches a folder of CSV files and publishes each row as JSON to a Kafka topic incrementally, with no glue code.

Feature Examples Connectors Incremental Processing AI Agents
Apr 22, 2026

CocoIndex V1 is Live!

CocoIndex V1 is live: a ground-up redesign of incremental data pipelines, built for AI engineers and agent builders shipping RAG, memory, and knowledge graphs.

Announcement Feature Incremental Processing Architecture AI Agents
Apr 2, 2026

Turn Podcasts into a Knowledge Graph with LLM and CocoIndex

Build a pipeline that turns YouTube podcasts into a knowledge graph: extract speakers, statements, and entities with an LLM, then dedupe them with embeddings.

Examples Knowledge Graph LLM Structured Extraction Incremental Processing
Mar 27, 2026

From pickle to type-guided, safer Python serialization

How CocoIndex moved from pickle to type-guided serialization that uses Python type hints to pick the right serializer, no decorators or registration needed.

Insight Architecture Best Practices
Mar 24, 2026

Invisible Daemon: architecture patterns for local dev tools

Five patterns for a Python CLI background daemon that auto-starts, upgrades transparently, and shuts down fast, from the daemon behind cocoindex-code.

Best Practices Architecture AI Agents
Mar 10, 2026

CocoIndex Changelog 0.3.27 - 0.3.34

Featuring five new target connectors, filesystem-level change detection, Python 3.14 free-threading, and smarter pipeline lifecycle management.

Changelog Connectors Performance Incremental Processing
Jan 18, 2026

CocoIndex Changelog 0.3.11 - 0.3.26

CocoIndex updates: production-ready resilience, a structured error system, expanded integrations, and always-fresh context for agents.

Changelog Connectors Structured Extraction Knowledge Graph
Nov 25, 2025

CocoIndex Changelog 0.2.21 - 0.3.10

Featuring batching support for CocoIndex functions, execution robustness, schema & type system improvements, custom source support, and more.

Changelog Feature Performance Custom Source Connectors
Nov 10, 2025

Adaptive Batching - 5x throughput on your data pipelines

CocoIndex now batches GPU and ML workloads automatically: 5x throughput on text embeddings and AI ops, with zero configuration required.

Feature Performance Embeddings Best Practices
Oct 19, 2025

CocoIndex Changelog 2025-10-19

Production-ready upgrades: durable execution, faster incremental processing over large datasets, GPU isolation, and richer native building blocks.

Changelog Incremental Processing Postgres Structured Extraction Connectors
Oct 10, 2025

Thinking in Rust: Ownership, Access, and Memory Safety

A mental framework for Rust's memory safety concepts. Think systematically about ownership, references, Send, Sync, and Rc, Arc, RefCell, Mutex, etc.

Insight Architecture Best Practices
Aug 13, 2025

Control Processing Concurrency in CocoIndex

How CocoIndex's layered concurrency controls optimize data-processing performance, prevent system overload, and keep pipelines stable and efficient at scale.

Feature Performance Best Practices Architecture

George He

Posts by George He

CocoIndex Changelog 1.0.8 - 1.0.16

CocoIndex Changelog 1.0.1 - 1.0.7

Live CSV → Kafka with CocoIndex's New Kafka Target Connector

CocoIndex V1 is Live!

Turn Podcasts into a Knowledge Graph with LLM and CocoIndex

From pickle to type-guided, safer Python serialization

Invisible Daemon: architecture patterns for local dev tools

CocoIndex Changelog 0.3.27 - 0.3.34

CocoIndex Changelog 0.3.11 - 0.3.26

CocoIndex Changelog 0.2.21 - 0.3.10

Adaptive Batching - 5x throughput on your data pipelines

CocoIndex Changelog 2025-10-19

Thinking in Rust: Ownership, Access, and Memory Safety

Control Processing Concurrency in CocoIndex