officeParser is a strictly-typed, zero-dependency-core library for Node.js and the browser that transforms complex office documents (DOCX, PDF, XLSX, etc.) into a unified, hierarchical Abstract Syntax Tree (AST). It is a critical piece of infrastructure for AI ingestion pipelines, search engines, and document-management tools, providing the only JavaScript/TypeScript solution that maintains format parity across both server and client runtimes.
Fund this project