Enabling runtime_parameter_bind_enable
and batched_gpu_shots
gives incorrect measurement results when using multiple GPUs
#2244
Labels
bug
Something isn't working
Informations
Linux 5.15.0-119-generic #129-Ubuntu SMP Fri Aug 2 19:25:20 UTC 2024
nvidia-smi --version
output:What is the current behavior?
Using
runtime_parameter_bind_enable=True
withbatched_gpu_shots=True
gives incorrect measurement results when running with multiple GPUs.For example, the MWE below sets up a scenario where we would expect the measurement results to match the
parameter_binds
provided tobackend.run
, and we can see that this is the case when only one GPU is used, but it stops working with two GPUs. Instead, it returns all the same measurement outcome for half of the experiments:I've tried scaling this up to 8 GPUs and even fewer results were correct, so it seems maybe only the experiments run on the 1st GPU get recorded correctly.
Steps to reproduce the problem
What is the expected behavior?
I would expect the measurement results to be the same regardless of how many GPUs are used. At most maybe I would expect the experiments not to be returned in order if using multiple GPUs but you can see above that even this is not the case.
Suggested solutions
The text was updated successfully, but these errors were encountered: