Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
44 commits
Select commit Hold shift + click to select a range
a63886d
design
JonnyTran Aug 15, 2025
0eb87de
Update rq dependency to version 2.4.1 and adjust pdm.lock accordingly
JonnyTran Aug 15, 2025
4553a9e
Merge branch 'develop' into feat/ocr-rq-jobs-workflow
JonnyTran Aug 15, 2025
d166fbd
design
JonnyTran Aug 17, 2025
8f4d831
design v2
JonnyTran Aug 18, 2025
77c9a03
design v3
JonnyTran Aug 18, 2025
664515b
design v3
JonnyTran Aug 18, 2025
51ec83b
design v4
JonnyTran Aug 18, 2025
9c21b3c
design v5
JonnyTran Aug 18, 2025
f8cdc18
design v5
JonnyTran Aug 18, 2025
359a454
1.1 Create combined PDF processing job function
JonnyTran Aug 18, 2025
1b4e5f2
Refactor PDF processing job function for improved efficiency
JonnyTran Aug 18, 2025
e1ee266
1.5 Update process_bulk_upload function
JonnyTran Aug 18, 2025
2d4f2ce
refactor
JonnyTran Aug 18, 2025
0f75326
task 2.4: Add workflow status monitoring
JonnyTran Aug 18, 2025
a07b386
fix AsyncSessionLocal
JonnyTran Aug 18, 2025
42e17c2
Refactor database migration and PDF workflow to enhance structure and…
JonnyTran Aug 18, 2025
9cd507b
Merge branch 'feat/ocr-rq-jobs-workflow' of github.com:Extralit/extra…
JonnyTran Aug 18, 2025
10a91c8
Refactor PDF workflow functions to use workspace name instead of ID a…
JonnyTran Aug 18, 2025
71b08cf
fix
JonnyTran Aug 18, 2025
fe3b130
Update metadata fields to be optional and improve error handling in P…
JonnyTran Aug 18, 2025
dc2b043
fix REDIS_CONNECTION arg
JonnyTran Aug 18, 2025
9f3cb1c
refactor
JonnyTran Aug 18, 2025
72c6dec
latest
JonnyTran Aug 19, 2025
33a52c3
Update metadata fields in Analysis, Preprocessing, Text Extraction, T…
JonnyTran Aug 20, 2025
f7ab092
feat: Implement RQ-based PyMuPDF integration for async PDF processing…
priyankeshh Aug 20, 2025
ed9eab0
feat: Add OCR_QUEUE for improved job handling in PDF workflows
JonnyTran Aug 20, 2025
1a6d9fc
Merge branch 'develop' into feat/ocr-rq-jobs-workflow
JonnyTran Aug 21, 2025
00fd8e5
refactor: Rename ImportHistory table to 'imports' for consistency
JonnyTran Aug 21, 2025
2aeb028
refactoring
JonnyTran Aug 21, 2025
ce926f5
refactor
JonnyTran Aug 21, 2025
8164268
renames
JonnyTran Aug 21, 2025
2f58010
requirement changes
JonnyTran Aug 21, 2025
8fcf707
use group instead of job ids
JonnyTran Aug 21, 2025
7f24d7d
updated DocumentWorkflow class
JonnyTran Aug 21, 2025
1e6a213
rq.Group updated tasks and design
JonnyTran Aug 21, 2025
7a3d09a
Refactor workflows to use RQ Groups for job tracking
JonnyTran Aug 21, 2025
736e76e
Add TextExtractionMetadata schema and update bulk upload processing
JonnyTran Aug 21, 2025
8aa59be
Update job_ids type in DocumentsBulkResponse and refactor create_docu…
JonnyTran Aug 21, 2025
ef123c5
2.1 Implement RQ Groups-based job querying
JonnyTran Aug 21, 2025
6de3533
Add CLI and API support for PDF workflow management
JonnyTran Aug 21, 2025
0c6b7f3
fixes
JonnyTran Aug 21, 2025
598776c
tests
JonnyTran Aug 21, 2025
fbec8d0
fix tests
JonnyTran Aug 21, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .kiro/specs/papers-library-importer/design.md
Original file line number Diff line number Diff line change
Expand Up @@ -548,7 +548,7 @@ This structure enables:
**New Model: ImportHistory**
```python
class ImportHistory(DatabaseModel):
__tablename__ = "import_history"
__tablename__ = "imports"

workspace_id: Mapped[UUID] = mapped_column(ForeignKey("workspaces.id", ondelete="CASCADE"), index=True)
user_id: Mapped[UUID] = mapped_column(ForeignKey("users.id", ondelete="CASCADE"), index=True)
Expand Down
Loading
Loading