Build an incremental AI pipeline that extracts invoice fields from PDFs in Azure Blob Storage and loads them into Snowflake — with CocoIndex, OpenAI GPT-4o, and a ~50-line custom Snowflake target. Open-source alternative to Snowflake Openflow and Cortex Document AI for unstructured ETL.