officeParser is a strictly-typed, zero-dependency-core library for Node.js and the browser that transforms complex office documents (DOCX, PDF, XLSX, etc.) into a unified, hierarchical Abstract Syntax Tree (AST). It is a critical piece of infrastructure for AI ingestion pipelines, search engines, and document-management tools, providing the only JavaScript/TypeScript solution that maintains format parity across both server and client runtimes.

Fund this project


Tue, 14 Apr 2026 14:42:43 UTC

There was a problem with this listing's funding.json manifest. If it is not fixed, the listing will be removed from the portal.

Crawl error

error: https://github.com/harshankur/officeParser/blob/master/.well-known/funding-manifest-urls?raw=true returned 502

Unverified URL

The funding manifest has not provided proof via wellKnown that this link is associated with it. Learn more.

Continue