Skip to content

[Feature Request] Use HDF5 instead of numpy files #617

@njzjz

Description

@njzjz

Summary

For DeePMD-kit training, when there are large numbers of systems, use HDF5 instead of NumPy files.

Detailed Description

When there are large numbers of systems, it consumes a lot of time to transfer large number of small NumPy files to a supercomputer cluster with bad I/O performance. A HDF5 file can store multiple arrays so it is faster to be transfer. The test results produce the behavior.

Further Information, Files, and Links

deepmodeling/deepmd-kit#1163

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions