fix: Add VLM support for Docling and improve deps by erichare · Pull Request #9757 · langflow-ai/langflow

erichare · 2025-09-08T22:41:39Z

This pull request enhances the file processing pipeline by adding support for vision-language models, improving error handling in file loading, and updating dependencies to enable advanced OCR and image processing features.

Pipeline and Processing Enhancements:

Added a new "vlm" (vision-language model) pipeline option in the create_converter function, allowing files to be processed using vision-language model strategies in addition to the existing standard pipeline. This supports more advanced document understanding workflows.
Improved error handling in load_files_advanced and load_files_markdown to raise clear exceptions if no content is generated or if errors are present, ensuring more robust downstream processing.

Dependency Updates:

Added easyocr and opencv-python to the dependencies in pyproject.toml to support advanced OCR and image processing capabilities.

Project Metadata:

Updated the code_hash in the Document Q&A starter project metadata to reflect the latest changes.

coderabbitai · 2025-09-08T22:41:46Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix-vlm-docling

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

sonarqubecloud · 2025-09-11T22:24:57Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

erichare added 2 commits September 8, 2025 14:59

fix: Support the VLM pipeline in docling

1f0f836

fix: Add VLM support and opencv dep

2dc2399

erichare requested review from Cristhianzl and edwinjosechittilappilly September 8, 2025 22:41

github-actions Bot added the bug Something isn't working label Sep 8, 2025

[autofix.ci] apply automated fixes

df10355

github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 8, 2025

[autofix.ci] apply automated fixes (attempt 2/3)

7767daa

github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 8, 2025

jordanrfrazier reviewed Sep 9, 2025

View reviewed changes

Comment thread pyproject.toml

jordanrfrazier approved these changes Sep 9, 2025

View reviewed changes

github-actions Bot added the lgtm This PR has been approved by a maintainer label Sep 9, 2025

Update comments and fix ruff errors

95ace2d

github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025

[autofix.ci] apply automated fixes

d052356

github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025

Merge branch 'release-1.6.0' into fix-vlm-docling

1bab5d3

github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025

Hide OCR engine when VLM selected

7caa6bd

github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025

Merge branch 'release-1.6.0' into fix-vlm-docling

653ae4b

github-actions Bot added bug Something isn't working and removed bug Something isn't working labels Sep 9, 2025

Merge branch 'release-1.6.0' into fix-vlm-docling

0fe4866