!!!GPU performance of Cmake built mxnet is worse than Make built one

I have tested the three cases many times as below:
1. When running train_mnist.py(GPU) on the Cmake built mxnet on Ubuntu, the average epoch time cost is 
0.7s. But the Make built mxnet on Ubuntu takes only 0.42s.

2. When running train_mnist.py(GPU) on the Cmake built mxnet on Windows, the average epoch time cost is 0.822s, even longer than running the script on MKL. The CPU(MKL) mode is about 0.6s.

3. The Windows MKL built mxnet version is better than the Ubuntu MKL built one. Each epoch of the former takes about 0.62s and each epoch of the later takes above 0.8s.
It's very weird. I hope the performance differences can be eliminated. I suggest that mxnet should bring a standard benchmark tool and reference performance index to measure running time such as the 'make runtest' of Caffe.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

!!!GPU performance of Cmake built mxnet is worse than Make built one #6685

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

!!!GPU performance of Cmake built mxnet is worse than Make built one #6685

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions