[4.5.4] - 2026-03-23
Fixed
- PDF image extraction panic on mismatched buffer lengths (#552): Replaced
assert!with graceful error handling. Malformed PDF images are now skipped instead of panicking. Regression from v4.5.0. pdffeature compilation withoutlayout-detection(#550):config.layoutreference gated behind#[cfg(feature = "layout-detection")].- WASM module resolution in Supabase/Deno edge functions (#551): Added explicit
package.jsonexports and Deno detection in wasm-loader. zipdependency pinned below 7.4: Avoids let-chain build failures on some stable Rust toolchains (#549).- Vendored HWP text extraction: Replaced external
hwperscrate with vendored subset (~1,650 lines). Eliminateszip 2.xtransitive dependency that caused WASM/CI build failures. - Ruby binding missing
table_modelfield inLayoutDetectionConfiginitializer. - Clippy/unused variable warnings in table recognition and pipeline modules.
Added
prepend_heading_contextchunking option: Whentrueandchunker_typeisMarkdown, prepends the heading hierarchy path (e.g.# Title > ## Section) to each chunk's content string. Useful for RAG pipelines where chunks need self-contained structural context. Available across all 10 language bindings, CLI, and WASM. Includes fixture-driven e2e tests and documentation for all languages.