Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
b432bec
Annotation Tool: data is not persisted when using local version #853
venuraja79 Feb 21, 2021
f6e9e96
Merge branch 'deepset-ai:master' into master
venuraja79 May 10, 2021
70a3fa9
First version of weaviate
venuraja79 May 15, 2021
ee835db
First version of weaviate
venuraja79 May 15, 2021
bb6f733
Merge remote-tracking branch 'origin/master'
venuraja79 May 15, 2021
30cda6a
First version of weaviate
venuraja79 May 15, 2021
c10a54e
Updated comments
venuraja79 May 16, 2021
1e69d28
Updated comments
venuraja79 May 16, 2021
0c7977d
ran query, get and write tests
venuraja79 May 16, 2021
3a62ac5
update embeddings, dynamic schema and filters implemented
venuraja79 Jun 5, 2021
4a67f16
Initial set of tests and fixes
venuraja79 Jun 5, 2021
71b9bb4
Tests added for update_embeddings and delete documents
venuraja79 Jun 6, 2021
fb87a62
Merge branch 'deepset-ai:master' into master
venuraja79 Jun 7, 2021
0bfe557
introduced duplicate documents fix
venuraja79 Jun 8, 2021
4bf532c
fixed mypy errors
venuraja79 Jun 8, 2021
500ec21
Added Weaviate to requirements
venuraja79 Jun 8, 2021
961110e
Fix the weaviate docker env variables
venuraja79 Jun 8, 2021
0a161f9
Fixing test dependencies for now
venuraja79 Jun 8, 2021
3a0160a
Created weaviate test marker and fixed query
venuraja79 Jun 8, 2021
80bc973
Update docstring
tholor Jun 9, 2021
75725fb
Add documentation
tholor Jun 9, 2021
2382cac
Bump up weaviate version
venuraja79 Jun 10, 2021
67a5c46
Merge remote-tracking branch 'origin/master'
venuraja79 Jun 10, 2021
ec5ba0d
Bump up weaviate version in documentation
venuraja79 Jun 10, 2021
80162fb
Bump up weaviate version in documentation
venuraja79 Jun 10, 2021
7829884
Updgrade weaviate version
tholor Jun 10, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -76,6 +76,9 @@ jobs:
- name: Run Milvus
run: docker run -d -p 19530:19530 -p 19121:19121 milvusdb/milvus:1.1.0-cpu-d050721-5e559c

- name: Run Weaviate
run: docker run -d -p 8080:8080 --name haystack_test_weaviate --env AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED='true' --env PERSISTENCE_DATA_PATH='/var/lib/weaviate' semitechnologies/weaviate:1.4.0

- name: Run GraphDB
run: docker run -d -p 7200:7200 --name haystack_test_graphdb deepset/graphdb-free:9.4.1-adoptopenjdk11

Expand Down
41 changes: 40 additions & 1 deletion docs/_src/usage/usage/document_store.md
Original file line number Diff line number Diff line change
Expand Up @@ -116,6 +116,27 @@ from haystack.document_store import SQLDocumentStore
document_store = SQLDocumentStore()
```

</div>
</div>

<div class="tab">
<input type="radio" id="tab-1-6" name="tab-group-1">
<label class="labelouter" for="tab-1-6">Weaviate</label>
<div class="tabcontent">

The `WeaviateDocumentStore` requires a running Weaviate Server.
You can start a basic instance like this (see Weaviate docs for details):
```
docker run -d -p 8080:8080 --env AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED='true' --env PERSISTENCE_DATA_PATH='/var/lib/weaviate' semitechnologies/weaviate:1.4.0
```

Afterwards, you can use it in Haystack:
```python
from haystack.document_store import WeaviateDocumentStore

document_store = WeaviateDocumentStore()
```

</div>
</div>

Expand Down Expand Up @@ -264,6 +285,24 @@ The Document Stores have different characteristics. You should choose one depend
</div>
</div>


<div class="tab">
<input type="radio" id="tab-2-6" name="tab-group-2">
<label class="labelouter" for="tab-2-6">Weaviate</label>
<div class="tabcontent">

**Pros:**
- Simple vector search
- Stores everything in one place: documents, meta data and vectors - so less network overhead when scaling this up
- Allows combination of vector search and scalar filtering, i.e. you can filter for a certain tag and do dense retrieval on that subset

**Cons:**
- Less options for ANN algorithms than FAISS or Milvus
- No BM25 / Tf-idf retrieval

</div>
</div>

</div>

<div class="recommendation">
Expand All @@ -276,4 +315,4 @@ The Document Stores have different characteristics. You should choose one depend

**Vector Specialist:** Use the `MilvusDocumentStore`, if you want to focus on dense retrieval and possibly deal with larger datasets

</div>
</div>
Loading