Skip to content

Regarding Cluster_id in nodeTable.csv #3

@vinitamehlawat

Description

@vinitamehlawat

Hi @qianjiaqiang

Thank you so much for making this fantastic tool available to us. For my SARS-CoV2 data, I tried your tool. I got a pi_pos_all.fasta , node_all.txt, net_all.txt, freq_all.txt file and three .csv files, when I performed the VENAS. When it comes to importing metadata to my graph, I'm becoming a little puzzled:

  1. What are the different cluster ids for in nodeTable.csv? If I colour by cluster, it will assist me understand the transmission pattern in my data?
  2. Following is the edgeTable I have for my data .
Source Target
5 3
5 246
5 97
5 12
5 26
5 20
5 21
5 237
5 70
5 88
5 159
5 52
5 82
5 92
92 197
5 152
3 7
3 57
3 60
246 247

Is this to say that the key paths are 5, 92, 3, and 246 and that further transmission is taking place from these sources?

  1. I previously inquired about the country/city/state metadata from pi_pos all.fasta. I have about 15000 sequences in input.ma and in pi pos all.fasta I received 4000, but in net.csv has 839 columns which means I have to grep the metadata from `pi_pos_all.fasta first 839 lines But this is only giving the information about 3 countries only But in pi_pos_all.fasta I have more than a dozen countries.

If you could just clarify these points, I would greatly appreciate your time and effort.

Thank you
Vinita

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions