The run benchmark results in my test environment are much lower than I expected, and even if I test with several parameters sweep, all the AU expectations are showing up "Fail".
I wonder if the cpu-bind option is not set. (If you don't agree with my estimate, please comment.)
1. Is there a way to know if cpu-bind option is properly set in MLPerf Storage tool?
2. Which option of the reference links below should I use to run with cpu-bind set up properly? There are several options in the mix, so I can't figure out which option is the right one that is currently implemented.
Below is the issue link I referred to :
a. github.com//issues/55: --cpu-bind depth -d num_workers
b. github.com//issues/123
: --mpi-params --bind-to-core --cpu-per-proc 8 ? mpiexec --cpu-bind depth -d 4 ?
3. As shown below, some parameter sweep options seem to have a big difference between au percentage and thruhgput during 5 repetitive evaluations, can you advise what the problem is? Is the pcie generation link unraveling, or is there a problem with cpu-bind falling?
4. Is there no problem with the warning log below? The actual number of my cores checked through the nproc command is 512 (Hyperthread on). DLIO seems to recognize only the threads per core value or Invalid value of unknown source.
[WARNING] Running DLIO with 16 threads for I/O but core available 2 are insufficient and can lead to lower performance.
Thank you.