This project includes 2 scripts:
- fetch_data.py to query data from PredictHQ - Will output a file called Output.json
- process_file.py to prepare embeddings for the data - Will output a file called for_collection.json
To import the data to a Couchbase collection:
- Install cbimoprt
- Run cbimport:
./cbimport json --cluster couchbases://<yourcluster> --username <your user> --password <your password> --bucket <bucket> --scope-collection-exp "<scope>.<collection>" --dataset for_collection.json --generate-key ‘%id%’ --cacert <path to couchbase certificate> --format lines