Skip to content

feat: Add Custom Dataset support for Streaming Tests#660

Merged
alwayslove2013 merged 1 commit intozilliztech:mainfrom
Akhil-Pathivada:feat/custom-streaming-test-cluster
Dec 5, 2025
Merged

feat: Add Custom Dataset support for Streaming Tests#660
alwayslove2013 merged 1 commit intozilliztech:mainfrom
Akhil-Pathivada:feat/custom-streaming-test-cluster

Conversation

@Akhil-Pathivada
Copy link
Contributor

@Akhil-Pathivada Akhil-Pathivada commented Dec 1, 2025

Summary

VectorDBBench supports custom datasets for Performance Tests (search-only benchmarks). This PR extends that support to Streaming Tests (concurrent insertion + search benchmarks), enabling users to evaluate streaming performance on their domain-specific or proprietary datasets—a critical capability for production workload simulation.

Changes

  • Backend: Extended StreamingPerformanceCase to accept custom dataset configurations
  • Frontend - Run Test Page: Added new "Custom Streaming Test" cluster displaying custom streaming datasets as individual checkboxes
  • Frontend - Custom Page: Added dedicated streaming dataset management section with create/edit/delete functionality
  • Configuration: Added case_type field to differentiate streaming and performance datasets in custom_case.json

New Files

  • vectordb_bench/frontend/components/custom/displayCustomStreamingCase.py

Modified Files

  • vectordb_bench/backend/cases.py
  • vectordb_bench/frontend/components/custom/getCustomConfig.py
  • vectordb_bench/frontend/config/dbCaseConfigs.py
  • vectordb_bench/frontend/components/run_test/caseSelector.py
  • vectordb_bench/frontend/pages/custom.py
  • vectordb_bench/custom/custom_case.json

User Workflow

  1. Navigate to /custom page
  2. Add new streaming dataset in "Streaming Test Datasets" section
  3. Configure dataset parameters (name, path, dimensions, file names, etc.)
  4. Save configuration
  5. Navigate to /run_test page
  6. Select custom streaming dataset from "Custom Streaming Test" cluster
  7. Configure streaming parameters (insert_rate, search_stages, concurrencies)
  8. Run test

Testing

  • Custom streaming dataset can be created and saved
  • Custom streaming dataset appears in "Custom Streaming Test"
  • Streaming test runs successfully with custom dataset
  • Multiple training files are loaded correctly

Screenshots

Screenshot 2025-12-02 at 01 24 27 Screenshot 2025-12-02 at 01 25 06

@Akhil-Pathivada Akhil-Pathivada force-pushed the feat/custom-streaming-test-cluster branch from 0ec239c to f0c3ea9 Compare December 1, 2025 19:52
@Akhil-Pathivada Akhil-Pathivada force-pushed the feat/custom-streaming-test-cluster branch from f0c3ea9 to a54fabf Compare December 1, 2025 19:57
@Akhil-Pathivada Akhil-Pathivada marked this pull request as ready for review December 1, 2025 20:03
@Akhil-Pathivada
Copy link
Contributor Author

/assign @alwayslove2013

@sre-ci-robot
Copy link

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: Akhil-Pathivada, alwayslove2013
To complete the pull request process, please assign xuanyang-cn after the PR has been reviewed.
You can assign the PR to them by writing /assign @xuanyang-cn in a comment when ready.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@alwayslove2013 alwayslove2013 merged commit 45d1ff5 into zilliztech:main Dec 5, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants