[Feedback] Chapter 3 - CRUD Operations

Overview of chapter 3; This chapter should explain about the basic CRUD operations. 
- First is to explain the data model of GridDB, then import existing data (use IoT sample data, same structure as explained in data model, with the import tool). Then a simple query on the existing data. Do we have an explanation of the batch ingestion using mput somewhere?
- In addition to the ingestion of existing data, user will do the simple CRUD, so explain using gs_sh how to create container, put, update, delete, and read. Explain CRUD via NoSQL interface (faster way of doing things), and also the SQL interface (more convenient) in gs_sh.
- Then explain the equivalent CRUD operations via the supported programming languages: Java, C, JDBC, Python, etc. and WebAPI.

---
- [Data model](https://www.youtube.com/playlist?list=PLMg7lY2Lux0_giXsi9Znae65IwEuOV2_5)
- (Comment 8/30) Import, Export and Query data: This should be inserted here. 
  - I think this [data set](https://www.kaggle.com/datasets/garystafford/environmental-sensor-data-132k) is good. It has 3 devices, each storing a few sensors. Please consult with Owen about this, but I think the data model would be 1 container collection storing the information for the devices (name, attributes, location, etc. not included but you can make it up), and 3 time series container for each device. 
  - I think you need to change the format prior to importing.
  - Do it all in CLI.
  - Perform some simple queries, e.g. max/min of the Carbon Dioxide, average temperature in the morning, afternoon, etc.)
  - (Added 9/14) - I think you mentioned that you were going to ingest the [data](https://www.kaggle.com/datasets/garystafford/environmental-sensor-data-132k) using a script? . This section is not about import/export tool, but how to import, export, query existing data to GridDB. So  prior to the import/export installation, let's explain that in this section we will be importing existing data that we get from Kaggle (show the actual steps from the beginning, e.g. going to kaggle, downloading the file, etc.)
    - You need to do data cleansing here, as I mentioned the file contains sensors from 3 devices. In GridDB this will be stored to 3 separate time series container + 1 container for the actual device information.  
    - First step: go to kaggle, download the csv, explain a little bit about the data (7 sensors in 1 device, and we have 3 of those).
    - As you just explained about data model in the previous chapter, current section is about that implementation. So in this case, as per GridDB recommendation (that each device to be stored as 1 time series container) you need to separate the 1 csv into 3 csvs, per device (00:0f:00:70:91:0a, 1c:bf:ce:15:ec:4d, b8:27:eb:bf:9d:51) -> please confirm with Owen but I think this should be the name of the time series containers.
    - In addition, I think you need to change the time format to GridDB format (please ask owen if there is an easy way to do this)
    - Then, prepare the 3 import files , e.g. public.00:0f:00:70:91:0a.csv -> for this it is ok to pretend that you did the hard work but you actually use your script and generate these files using export instead (hide this though). Please explain how to make the files though (e.g. the current data is already in "...","..." format so you just need to add the GridDB headers to the public.00:0f:00:70:91:0a.csv , plus explain about the properties.json)
    - Once the data preparation is done, then start the installation of imp/exp tool and actually import those 3 devices. 
    - Show that they have been imported successfully, and perform simple select query.
    - Then add a collection container containing the metadata of these devices (via gs_sh is ok).
    - Explain that if you want to upgrade from say GridDB V4.6 to V5.0, then you can use the export tool then import to do that.

    -It is ok to do it all at once and separate the video to 3, but please make sure that you put a break /pause in between so that the cut is smooth:
      - Data preparation
      - Import
      - Export
- (Comment 9/2) Add CRUD explanation via gs_sh
  - Explain that you can either use the web shell on docs.griddb.net or follow the GridDB server installation video on chapter 2
  - Content:
    - Explain about the NoSQL Interface such as put, get, TQL query. NoSQL interface is for simple/basic data access, very fast data ingestion and search, but a GridDB specific API. Perform collection and time series container creation, put to these containers, do a get, update a row, delete a row and simple query using TQL.
    - Explain about the SQL interface. GridDB SQL is SQL-92 compliant, standard API, equipped with parallel distributed processing.  Perform create table and timeseries table, insert, delete, update, select.
    - I think it is better to use similar data model as the one you ingested (the environmental sensor data), and continue adding new data. e.g. Creating a time series container for the 4th device. Deleting the container collection for device and replace with a new one. Put new rows to existing device and new one. Update past data, e.g. erroneous data. 
- [JDBC/SQL](https://www.youtube.com/playlist?list=PLMg7lY2Lux08KRdbeLZj3tV5NfUv-CqvU)
  - Comment (9/2): [Video](https://youtu.be/mtRcmVDI2Hs)
    - Amplify your voice.
    - The content is OK. Although I think JDBC is basically mainly used when you want to connect to another application via standard interface like Tableau, PowerBI and the SQL Workbench, so I think it is better to move this to Chapter 5 as a supplementary.
- [API](https://www.youtube.com/playlist?list=PLMg7lY2Lux09CO1ecpg30Q_JL4PJXXR9x)
  - Comment 9/2 - [Python API](https://www.youtube.com/watch?v=iDfhfOpeVyQ) 
    - Amplify your voice
    - Bigger text
    - Should be fixed list when you start via systemctl, you are still using the multicast. 
    - Use the same data model as the environmental sensor data. e.g. create 5th new device.
    - There should be a separate course for GridDB python alone. But for now, only the simple CRUD and connection are sufficient. No need to use these samples as is but based on it create new ones with the new environmental sensor data model. Also explain how to install using pip.
      - Use fresh installation so you know what are missing/need to be installed. 
      - Installation using [PIP](https://pypi.org/project/griddb-python/)
      - [Connection](https://github.com/griddb/python_client/blob/master/sample/Connect.py) : explain the three methods. Also explain that the default GridDB systemctl installation is using fixed list.
      - How to create [Collection Container](https://github.com/griddb/python_client/blob/master/sample/CreateCollection.py), and [Time Series Container](https://github.com/griddb/python_client/blob/master/sample/CreateTimeSeries.py) 
      - How to [Put rows](https://github.com/griddb/python_client/blob/master/sample/PutRow.py) and [Get Rows](https://github.com/griddb/python_client/blob/master/sample/GetRow.py)
      - How to [Delete rows](https://github.com/griddb/python_client/blob/master/sample/RemoveRowByRowkey.py)
      - How to [Update rows using TQL](https://github.com/griddb/python_client/blob/master/sample/UpdateRowByTQL.py)
      - [Query using TQL](https://github.com/griddb/python_client/blob/master/sample/TQLSelect.py)
  -  Comment 9/2 - [Java API](https://youtu.be/F4V-DaOPdAc) 
     - Amplify your voice
     - Bigger text
     - Should be fixed list when you start via systemctl, you are still using the multicast. 
     - Use the same data model as the environmental sensor data. e.g. create 6th new device.
     - Same as Python: explain connectivity, create collections, CRUD, Query  using TQL.
     - It is better to use the same data model throughout, to get the flow right. Currently the sample program does not really showcase the IoT, time series data. Having said that I think the current video is good and can be uploaded to youtube as is as a separate video.
- [WebAPI](https://www.youtube.com/playlist?list=PLMg7lY2Lux0-QvEqRASyZrgwOna1GS63C)
  - Comment 9/2 [Video](https://youtu.be/X-9HgJ7KwmY) 
     - Amplify your voice
     - Bigger text
     - Should be fixed list when you start via systemctl, you are still using the multicast and defaultCluster.
     - How about using postman to showcase the CRUD. Again we need to cover at least create container, put, get, delete, TQL and SQL. Please refer to this [specification](https://github.com/griddb/webapi/blob/master/WebAPISpecification.md) instead.
     - Again let's apply these CRUD operations to the existing environmental sensor data.

Import/Export: 
- You went back and forth with the settings: please prepare the script before hand so you just need to copy and paste.
- I think you mentioned that you were going to ingest the [data](https://www.kaggle.com/datasets/garystafford/environmental-sensor-data-132k) using a script? . This section is not about import/export tool, but how to import, export, query existing data to GridDB. So  prior to the import/export installation, let's explain that in this section we will be importing existing data that we get from Kaggle (show the actual steps from the beginning, e.g. going to kaggle, downloading the file, etc.)
  - You need to do data cleansing here, as I mentioned the file contains sensors from 3 devices. In GridDB this will be stored to 3 separate time series container + 1 container for the actual device information.  
  - First step: go to kaggle, download the csv, explain a little bit about the data (7 sensors in 1 device, and we have 3 of those).
  - As you just explained about data model in the previous chapter, current section is about that implementation. So in this case, as per GridDB recommendation (that each device to be stored as 1 time series container) you need to separate the 1 csv into 3 csvs, per device (00:0f:00:70:91:0a, 1c:bf:ce:15:ec:4d, b8:27:eb:bf:9d:51) -> please confirm with Owen but I think this should be the name of the time series containers.
  - In addition, I think you need to change the time format to GridDB format (please ask owen if there is an easy way to do this)
  - Then, prepare the 3 import files , e.g. public.00:0f:00:70:91:0a.csv -> for this it is ok to pretend that you did the hard work but you actually use your script and generate these files using export instead (hide this though). Please explain how to make the files though (e.g. the current data is already in "...","..." format so you just need to add the GridDB headers to the public.00:0f:00:70:91:0a.csv , plus explain about the properties.json)
  - Once the data preparation is done, then start the installation of imp/exp tool and actually import those 3 devices. 
  - Show that they have been imported successfully, and perform simple select query.
  - Then add a collection container containing the metadata of these devices (via gs_sh is ok).
  - Explain that if you want to upgrade from say GridDB V4.6 to V5.0, then you can use the export tool then import to do that.

It is ok to do it all at once and separate the video to 3, but please make sure that you put a break /pause in between so that the cut is smooth:
- Data preparation
- Import
- Export

---
**Final Video for Chapter 3**
Comment 9/20
- [Data model](https://www.youtube.com/playlist?list=PLMg7lY2Lux0_giXsi9Znae65IwEuOV2_5) DONE
- [Importing existing data via Import tool](https://youtu.be/VvXJ3PSInsQ) DONE
- [Alternative way of importing existing data via Node.js client](https://www.youtube.com/watch?v=0GCYJKfl3BA) DONE
- [Exporting data for migration](https://youtu.be/gjzS2ji1Lfk) DONE
- [GridDB CRUD (explained via gs_sh)](https://www.youtube.com/watch?v=DI3LzMAVqsA) DONE
- GridDB API
  - [Java](https://www.youtube.com/watch?v=67GFSgSx2ZE) DONE
  - [JDBC](https://www.youtube.com/watch?v=MvY9Kqid1_0) DONE
  - [Python](https://youtu.be/GMURnoS9bK4) DONE
  - [Web API](https://youtu.be/BVbXdiaJ5mk) DONE





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feedback] Chapter 3 - CRUD Operations #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feedback] Chapter 3 - CRUD Operations #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions