feat(hbase): support gen HFile for hbase v2 (BETA)#358
feat(hbase): support gen HFile for hbase v2 (BETA)#358imbajin merged 40 commits intoapache:masterfrom
Conversation
…schema to SchemaCache fix apache#333
…schema to SchemaCache fix apache#333
merge master into schemaCache-optimize
…incubator-hugegraph-toolchain into schemaCache-optimize
|
@JackyYangPassion Is this an improved part? |
Codecov Report
@@ Coverage Diff @@
## master #358 +/- ##
============================================
- Coverage 64.82% 62.44% -2.38%
- Complexity 1851 1864 +13
============================================
Files 255 260 +5
Lines 9081 9462 +381
Branches 837 874 +37
============================================
+ Hits 5887 5909 +22
- Misses 2810 3169 +359
Partials 384 384
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
|
|
OK, mark it also as to be reviewed. and could u handle the third-party dependencies check? |
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/EdgeBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/EdgeBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/EdgeBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/VertexBuilder.java
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/builder/VertexBuilder.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/DirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/DirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/DirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/HBaseDirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/HBaseDirectLoader.java
Outdated
Show resolved
Hide resolved
adjust code style fix apache#357
|
1、The code style has been adjusted, |
fix third-party dependencies error fix apache#357
thanks,the 3rd party check seems failed,need some help? |
javeme
left a comment
There was a problem hiding this comment.
Thanks for your contribution~
please also address other comments: https://github.com/apache/incubator-hugegraph-toolchain/pull/358/files (search by "ago"), and also address this file LoadOptions.java
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/HBaseDirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/util/SinkToHBase.java
Outdated
Show resolved
Hide resolved
adjust code style fix apache#357
|
@imbajin Hi, I can help solve the loader ci check failure |
Thanks, I have already adopted the basic code, and current the differ is: expected: {
"version":"2.0",
"structs":[
{
"id":"1",
"skip":false,
"input":{
"type":"FILE",
"path":"users.dat",
"file_filter":{
"extensions":[
"*"
]
},
"format":"TEXT",
"delimiter":"::",
"date_format":"yyyy-MM-dd HH:mm:ss",
"time_zone":"GMT+8",
"skipped_line":{
"regex":"(^#|^//).*|"
},
"compression":"NONE",
"batch_size":500,
"header":[
"UserID",
"Gender",
"Age",
"Occupation",
"Zip-code"
],
"charset":"UTF-8",
"list_format":null
},
"vertices":[
{
"label":"user",
"skip":false,
"id":null,
"unfold":false,
"field_mapping":{
"UserID":"id"
},
"value_mapping":{
},
"selected":[
],
"ignored":[
"Occupation",
"Zip-code",
"Gender",
"Age"
],
"null_values":[
""
],
"update_strategies":{
},
"batch_size":500
}
],
"edges":[
]
},
{
"id":"2",
"skip":false,
"input":{
"type":"FILE",
"path":"ratings.dat",
"file_filter":{
"extensions":[
"*"
]
},
"format":"TEXT",
"delimiter":"::",
"date_format":"yyyy-MM-dd HH:mm:ss",
"time_zone":"GMT+8",
"skipped_line":{
"regex":"(^#|^//).*|"
},
"compression":"NONE",
"batch_size":500,
"header":[
"UserID",
"MovieID",
"Rating",
"Timestamp"
],
"charset":"UTF-8",
"list_format":null
},
"vertices":[
],
"edges":[
{
"label":"rating",
"skip":false,
"source":[
"UserID"
],
"unfold_source":false,
"target":[
"MovieID"
],
"unfold_target":false,
"field_mapping":{
"UserID":"id",
"MovieID":"id",
"Rating":"rate"
},
"value_mapping":{
},
"selected":[
],
"ignored":[
"Timestamp"
],
"null_values":[
""
],
"update_strategies":{
},
"batch_size":500
}
]
}
]
}actual: {
"version":"2.0",
"structs":[
{
"id":"1",
"skip":false,
"input":{
"type":"FILE",
"path":"users.dat",
"file_filter":{
"extensions":[
"*"
]
},
"format":"TEXT",
"delimiter":"::",
"date_format":"yyyy-MM-dd HH:mm:ss",
"time_zone":"GMT+8",
"skipped_line":{
"regex":"(^#|^//).*|"
},
"compression":"NONE",
"batch_size":500,
"header":[
"UserID",
"Gender",
"Age",
"Occupation",
"Zip-code"
],
"charset":"UTF-8",
"list_format":null
},
"vertices":[
{
"label":"user",
"skip":false,
"id":null,
"unfold":false,
"field_mapping":{
"UserID":"id"
},
"value_mapping":{
},
"selected":[
],
"ignored":[
"Occupation",
"Zip-code",
"Gender",
"Age"
],
"null_values":[
""
],
"update_strategies":{
},
"batch_size":500
}
],
"edges":[
]
},
{
"id":"2",
"skip":false,
"input":{
"type":"FILE",
"path":"ratings.dat",
"file_filter":{
"extensions":[
"*"
]
},
"format":"TEXT",
"delimiter":"::",
"date_format":"yyyy-MM-dd HH:mm:ss",
"time_zone":"GMT+8",
"skipped_line":{
"regex":"(^#|^//).*|"
},
"compression":"NONE",
"batch_size":500,
"header":[
"UserID",
"MovieID",
"Rating",
"Timestamp"
],
"charset":"UTF-8",
"list_format":null
},
"vertices":[
],
"edges":[
{
"label":"rating",
"skip":false,
"source":[
"UserID"
],
"unfold_source":false,
"target":[
"MovieID"
],
"unfold_target":false,
"field_mapping":{
"UserID":"id",
"MovieID":"id",
"Rating":"rate"
},
"value_mapping":{
},
"selected":[
],
"ignored":[
"Timestamp"
],
"null_values":[
""
],
"update_strategies":{
},
"batch_size":500
}
]
}
],
"backendStoreInfo":null
}seems |
|
The configuration information of the storage layer that bulkLoad depends on is specified in struct.json, so backendstoreinfo is added. The follow-up iteration is to obtain the configuration information of the storage layer from the server; |
it's fine, just adopt it in test 😄 (so as other test problem if exists) |
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/loader/HBaseDirectLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/util/SinkToHBase.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/util/SinkToHBase.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/direct/util/SinkToHBase.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/executor/LoadOptions.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/spark/HugeGraphSparkLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/spark/HugeGraphSparkLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/spark/HugeGraphSparkLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/spark/HugeGraphSparkLoader.java
Outdated
Show resolved
Hide resolved
hugegraph-loader/src/main/java/com/baidu/hugegraph/loader/util/MappingUtil.java
Outdated
Show resolved
Hide resolved
imbajin
left a comment
There was a problem hiding this comment.
thanks, we could handle the 3rd dependencies together before release (to avoid waste a lot time on it)
Do I need to solve 3rd dependencies check failed?
ok |
We can downgrade the version if necessary, hadoop dependency seems can not be excluded , |
Yes, loader needs hadoop dependency . Internally, we read data from hdfs and load it into the graph |
close #357
1、Support write vertex/edge directly to KV storage
2、Just support customString and customNumber ID now
3、submit the loader code for bypass server for hbase writing