Skip to content

Main#23

Open
wahajali wants to merge 114 commits intodevfrom
main
Open

Main#23
wahajali wants to merge 114 commits intodevfrom
main

Conversation

@wahajali
Copy link

@wahajali wahajali commented Aug 4, 2025

No description provided.

alwayslove2013 and others added 30 commits April 18, 2025 12:16
Signed-off-by: min.tian <min.tian.cn@gmail.com>
Signed-off-by: min.tian <min.tian.cn@gmail.com>
* feat: add hnsw support

* refactor: minor fixes

* feat: reformat code

* fix: remove sql injections, reformat code
Signed-off-by: min.tian <min.tian.cn@gmail.com>
* Add --task-label option for cli

* Fix lint issues
…#523)

* Update cli.py

* Update clickhouse.py

* Update clickhouse.py

* Update cli.py

* Update config.py

* remove space
…ch#521)

* Add --concurrency-timeout option to avoid long time waiting, by default, it's 3600s.

* Fix lint error

* Update README.md, add --concurrency-timeout option
Signed-off-by: min.tian <min.tian.cn@gmail.com>
)

* add --num-shards option for milvus performance test case

* fix lint warning
…lliztech#530)

* Add a batch cli to support the batch execution of multiple cases.

* Add how to use
…ch#529)

* Fixing bugs in aws opensearch client

Signed-off-by: Navneet Verma <navneev@amazon.com>

* Added fp16 quantization type for aws_opensearch

Signed-off-by: Navneet Verma <navneev@amazon.com>

---------

Signed-off-by: Navneet Verma <navneev@amazon.com>
* Add Qdrant local

* Add support for local setup without authentication api-key for weaviate

* Expose HSNW index parameters to cli for weaviate

* Add hnsw-ef parameter to Qdrant local

---------

Co-authored-by: Min Tian <min.tian.cn@gmail.com>
The incorrect import is a result of previous duplicate import of the same class name and recent refactoring.
zilliztech@032515f#diff-1638e273a6d7eee7b4412bf79bda9112da86d53e18d1b650b8b80a2e1df4f7e6R15
Signed-off-by: min.tian <min.tian.cn@gmail.com>
* expose lancedb index parameters to the cli interface

* fix formatting issues
…t multi-client load data (zilliztech#527)

* fix error of OpenSearch frontend:Validation error for AWSOpenSearchIndexConfig index_thread_qty_during_force_merge

* add parameters of aws opensearch in frontend.

* fiexd issue of opensearch ui and add multi-client indexing, add parameters of hnsw engine and metric-type.

* fiexd issue of opensearch ui and add multi-client indexing, add parameters of hnsw engine and metric-type.

* fiexd issue of opensearch ui and add multi-client indexing, add parameters of hnsw engine and metric-type.

* clean logs

* clean comments

* clean logs

* clean comments

* fixed some issue, add warmup before search test only.

* fixed some issue of create index and mulit-client

* support faiss fp16

* clear comment

* recover some code format

* recover some code format

* recover some code format

* recover some code format

* recover some code format

* resolved conflict with pr: zilliztech#529
* support oceanbase

* fix bug

* fix bug

* support IP

* support multiple zone

* support filter case

* support ivf

* support hnsw_bq and extra info

* optimized formatting

* optimized formatting

* Revert dbCaseConfigs.py to match remote main branch

* resolve review comments

* Update __init__.py

* Addressed review comments

* Fixed code style issues
This commit marks the milestone release of VectorDBBench 1.0, introducing a wide range of new features, major enhancements, and updated benchmarks.

Key changes include:
- UI: Introduce a brand new homepage and navigation bar. The new design integrates powerful front-end pages for intuitive test result analysis and visualization.
- Cases: Add new label-filter test cases. This allows testing search performance with metadata filters using expressions like color == "red". Initial support includes Milvus, Zilliz Cloud, Elasticsearch Cloud, Qdrant Cloud, Pinecone, and OpenSearch (AWS).
- Cases: Implement new streaming test cases. These cases are designed to measure search performance while data is actively being inserted, simulating real-world "read-while-writing" scenarios.
- Dataset: Add the new BioASQ dataset. This dataset is 1024-dimensional and comes in 1M and 10M sizes, enriching the diversity of our test data.
- Custom Dataset: Enhance the custom dataset functionality. Users now have more flexible configuration options to simulate their own data distributions and schemas better.
- New Results: Re-run and update all benchmark results for `Milvus`, `ZillizCloud`, `ElasticCloud`, `QdrantCloud`, `Pinecone`, and `OpenSearch(AWS)` to reflect their latest performance on the new test cases.
Signed-off-by: min.tian <min.tian.cn@gmail.com>
alwayslove2013 and others added 30 commits November 25, 2025 23:36
Signed-off-by: min.tian <min.tian.cn@gmail.com>
- UUID PRIMARY KEY for distributed write performance
     - Connection pooling with 100+ base connections
     - Comprehensive retry logic for transient errors (40001, 40003)
     - C-SPANN vector index with tunable parameters
     - CLI integration with full parameter support

Co-authored-by: Min Tian <min.tian.cn@gmail.com>
Signed-off-by: min.tian <min.tian.cn@gmail.com>
…illiztech#647)

Two critical fixes for multi-node CockroachDB clusters:

1. Connection Timeout Handling:
   On multi-node v25.4 clusters, CREATE VECTOR INDEX from subprocess
   contexts experiences a 30-second connection timeout. The index
   creation continues successfully in the background. This fix detects
   the timeout and polls for completion (up to 5 minutes).

2. Vector Index Usage:
   Fixed vector_search_beam_size not being set on pooled connections,
   causing queries to use full table scan instead of the vector index.
   Now configures every connection from the pool with proper beam size.

Testing:
- Single-node: Works without timeout (178s index creation)
- Multi-node: Successfully handles timeout and completes (131s total)
- Vector index: Now properly used for all searches (verified with EXPLAIN)
- Both achieve ~83% recall with good QPS

Fixes issues where:
- Benchmarks would fail despite successful index creation
- Searches were slow due to full table scans instead of index usage
Signed-off-by: min.tian <min.tian.cn@gmail.com>
* [Bugfix] Several fixes to AliSQL

Summary
=======
1. Online users cannot directly `SET GLOBAL`
2. Online instances uses strict SQL mode by default, which can affect vector
insertion.

* [Feature] AliSQL supports custom db name

---------

Co-authored-by: wangzihao.wzh <guguang.wzh@alibaba-inc.com>
…illiztech#655)

- Add frontend Streamlit UI configuration
  - Logo, parameter types, and input configurations
  - Loading and performance config for all index parameters
- Add SSL/TLS support for CockroachDB Cloud clusters
  - Support verify-full, verify-ca, require, disable SSL modes
  - CLI parameters: --sslmode, --sslrootcert
  - SSL certificates passed to connection pool and optimize connections
- Optimize connection handling for multi-node clusters
  - Connection pooling for concurrent searches (enables true parallelism)
  - Settings in connection string to reduce per-connection overhead
  - Dedicated connection without statement_timeout for index creation
  - Handles background index creation with timeout detection and polling
- Handle CockroachDB-specific schema changes
  - Cancel running schema jobs before DROP TABLE (prevents blocking)
  - Transaction conflict retry logic for aborted transactions
- All linting checks passing
…opensearch (zilliztech#663)

* 1. Optimized logs output of load data
2. Modified create index dsl of on disk mode.

* refine format
* made log util use env vars for custom log file placement

* cleaned up log file config a bit

It doesn't seem necessary to split the configuration of log file into
log path and log name, but cleaner to just combine them.
…h#670)

- Use HDR Histogram for memory-efficient latency tracking
- Collect p99, p95, avg latency at each concurrency level per stage
- Add Concurrent Search Performance UI with single-stage and multi-stage views
- Closes zilliztech#638
- Add SCANN index type for Milvus
- Build params: nlist (default: 1024), with_raw_data (default: false)
- Search params: reorder_k (default: 100)
- Add UI configuration for SCANN index

Signed-off-by: JackLCL <chenglong.li@zilliz.com>
- Add nprobe search parameter for SCANN index (default: 64)
- Enable nprobe UI configuration for SCANN_MILVUS index type

Signed-off-by: JackLCL <chenglong.li@zilliz.com>
Signed-off-by: min.tian <min.tian.cn@gmail.com>
Signed-off-by: min.tian <min.tian.cn@gmail.com>
* Add Chroma cli to VectorDBBench

* Fix lint issues

* Add password auth functionality to chroma CLI

---------

Co-authored-by: bpietrzak <>
Signed-off-by: yangxuan <xuan.yang@zilliz.com>
Signed-off-by: yangxuan <xuan.yang@zilliz.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.