Skip to content

fix: Add VLM support for Docling and improve deps#9757

Merged
erichare merged 31 commits into
release-1.6.0from
fix-vlm-docling
Sep 11, 2025
Merged

fix: Add VLM support for Docling and improve deps#9757
erichare merged 31 commits into
release-1.6.0from
fix-vlm-docling

Conversation

@erichare
Copy link
Copy Markdown
Collaborator

@erichare erichare commented Sep 8, 2025

This pull request enhances the file processing pipeline by adding support for vision-language models, improving error handling in file loading, and updating dependencies to enable advanced OCR and image processing features.

Pipeline and Processing Enhancements:

  • Added a new "vlm" (vision-language model) pipeline option in the create_converter function, allowing files to be processed using vision-language model strategies in addition to the existing standard pipeline. This supports more advanced document understanding workflows.
  • Improved error handling in load_files_advanced and load_files_markdown to raise clear exceptions if no content is generated or if errors are present, ensuring more robust downstream processing.

Dependency Updates:

  • Added easyocr and opencv-python to the dependencies in pyproject.toml to support advanced OCR and image processing capabilities.

Project Metadata:

  • Updated the code_hash in the Document Q&A starter project metadata to reflect the latest changes.

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented Sep 8, 2025

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches
🧪 Generate unit tests
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch fix-vlm-docling

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@github-actions github-actions Bot added the bug Something isn't working label Sep 8, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 8, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 8, 2025
Comment thread pyproject.toml
@github-actions github-actions Bot added the lgtm This PR has been approved by a maintainer label Sep 9, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 11, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 11, 2025
@jordanrfrazier jordanrfrazier enabled auto-merge (squash) September 11, 2025 21:57
@erichare erichare disabled auto-merge September 11, 2025 22:00
@github-actions github-actions Bot removed the bug Something isn't working label Sep 11, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 11, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 11, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 11, 2025
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 11, 2025
@erichare erichare enabled auto-merge (squash) September 11, 2025 22:23
@github-actions github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 11, 2025
@sonarqubecloud
Copy link
Copy Markdown

@erichare erichare merged commit f1ae031 into release-1.6.0 Sep 11, 2025
74 of 75 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working fix-for-release PR to be merged into a release branch lgtm This PR has been approved by a maintainer

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants