fix: expose do_reading_order and force_backend_text pipeline options by Frank-Schruefer · Pull Request #117 · docling-project/docling-jobkit

Frank-Schruefer · 2026-04-05T20:53:50Z

Problem

PdfPipelineOptions already supports do_reading_order and force_backend_text, but these options were not reachable via the API:

ConvertDocumentsOptions had no corresponding fields
manager.py did not pass them through to PdfPipelineOptions

Fix

Add do_reading_order field to ConvertDocumentsOptions (default true)
Add force_backend_text field to ConvertDocumentsOptions (default false)
Wire both fields through in manager.py

Why it matters

do_reading_order=false is useful for PDFs where the reading-order predictor produces incorrect results (e.g. scanned PDFs with a native text layer and many small orphan clusters). force_backend_text is useful for PDFs with reliable programmatic text layers where layout-model text detection is unnecessary.

PdfPipelineOptions already supports do_reading_order and force_backend_text, but ConvertDocumentsOptions had no corresponding fields and manager.py did not pass them through to the pipeline. Add the missing fields and wire them up so callers can control reading-order prediction and native text extraction via the API. Signed-off-by: Frank Schruefer <frank.schruefer@t-online.de> Signed-off-by: stone <frank.schruefer@t-online.de>

github-actions · 2026-04-05T20:54:01Z

✅ DCO Check Passed

Thanks @Frank-Schruefer, all your commits are properly signed off. 🎉

mergify · 2026-04-05T20:54:26Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: expose do_reading_order and force_backend_text pipeline options#117

fix: expose do_reading_order and force_backend_text pipeline options#117
Frank-Schruefer wants to merge 1 commit intodocling-project:mainfrom
Frank-Schruefer:fix/expose-do-reading-order-force-backend-text

Frank-Schruefer commented Apr 5, 2026

Uh oh!

github-actions bot commented Apr 5, 2026

Uh oh!

mergify bot commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Frank-Schruefer commented Apr 5, 2026

Problem

Fix

Why it matters

Uh oh!

github-actions bot commented Apr 5, 2026

Uh oh!

mergify bot commented Apr 5, 2026

Merge Protections

🟢 Enforce conventional commit

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant