Skip to main content

6 docs tagged with "vector-index"

View all tags

Academic Papers Indexing

Build a real-time academic papers index. Extract metadata, chunk and embed abstracts, and enable semantic and author-based search over academic PDFs.

Index PDFs, Images, Slides without OCR

Build a visual document indexing pipeline using ColPali to index scanned documents, PDFs, academic papers, presentation slides, and standalone images — all mixed together with charts, tables, and figures - into the same vector space.

Photo Search with Face Detection

Covers extracting and embedding faces from images, structuring data for visual search, and exporting to a vector database for face similarity queries.

Real-time Codebase Indexing

Build a real-time codebase index for retrieval-augmented generation (RAG) using CocoIndex and Tree-sitter. Chunk, embed, and search code with semantic understanding.