Bring your own parser as building block with Google Document AI
Use Google Document AI to parse document, embed the resulting text, and store it in a vectorized database for semantic search.
Use Google Document AI to parse document, embed the resulting text, and store it in a vectorized database for semantic search.
Build a real-time trending topics detector by indexing HackerNews threads and comments, extracting structured topics using LLM-powered extraction, and querying trending discussions with CocoIndex's Custom Sources and Postgres.
Simple example to export Markdown files to local HTML files using Custom Targets.
Build a lightweight, incremental pipeline by treating any API as a data component—custom incremental connector for HackerNews using CocoIndex’s Custom Source API. Export the data to Postgres for semantic search and analytics.
How to use BAML together with CocoIndex to build a data pipeline that extracts structured patient information from PDF intake forms.
How to use DSPy together with CocoIndex to build a data pipeline that extracts structured patient information from PDF intake forms using vision models.