Update dvc add documentation#195
Conversation
shcheklein
left a comment
There was a problem hiding this comment.
Looks great overall. Put a few comments inline to address. In addition to that:
- There are lines with trailing whitespaces. Again, you can enable a mode in VS code (or Vim or any other editor) to highlight them. It's bad because it creates lines like this "end of word new word" (there is at least one I noticed in this PR).
- That paragraph that describes why
dvc addis useful requires some clarification. Especially themanuallyword. It's reasonable to take under control one of the outputs and update it with some script running it via make. Does it fit into manual? I'm not sure. And the second very common scenario is to take input data (that is usually pretty static) to create a pipeline withdvc runon top after that. You actually have to usedvc addfor input datasets. - Examples, examples, examples. The biggest problem. Output is way too much detailed. 4 dvc files - too many for the task, etc. Let's think this through. Let's also mention that
|
@shcheklein said:
I'm not sure I follow, based on the replication I just did. Namely Specifically: I got this stage file: cmd: python train.py
deps:
- md5: 78d98f3865c2fcfe1dbe95b738960d0a
path: train.py
- md5: b8f4d5a78e55e88906d5f4aeaf43802e.dir
path: data
md5: 71acee76d9f4458059ae5b7f2435cb32
outs:
- cache: true
md5: 6a92af2a09ec797dcb0dab2cfa1ac778
metric: false
path: model.h5
- cache: true
md5: da9e20b12aa5b2dc0abb02e1a1b4e4cf
metric: false
path: bottleneck_features_train.npy
- cache: true
md5: e548cc847339c990a7dbe0759d87c7c4
metric: false
path: bottleneck_features_validation.npy
- cache: false
md5: 0b14406a44c15521efc0f4d96c80befd
metric: true
path: metrics.json
wdir: .The data directory is listed as a I reran the same steps, but used Specifically: Since |
|
@robogeek yep, the difference is that |
|
I have updated the document to match the comments above |
No description provided.