-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Weird test failures #249
Comments
Even if you execute twice the 2D or 3D case it will fail. The reason is that the distribution policy XMallocSIMD has a bug. Chnaging Line 146 in bffe2aa
mallocMC::DistributionPolicies::Noop, will fix the test.
|
The problem in the XMallocSIMD policy is how we communicate between threads in a warp.
Since Volta threads in a warp can diverge, we should use in warp communication instead of shared memory. |
Great catch! Thanks! Gonna try and fix it but not sure if I'll make it before the holidays. |
On the system at hand (HAL), the tests are all passing individually but fail if and only if "2D AccGpuCudaRt" and "3D AccGpuCudaRt" are both run in the same test run, so the following works
whenever
list.txt
doesn't contain both of the above while the latter fails with any otherlist.txt
, in particularas a MWE. From what I can tell, neither the ordering of the two nor other tests in between change this observation. The error message from the
./tests
is alwayswhile
(sometimes also
out of bounds
).Gonna investigate further in the near future but any comments and hints are welcome!
The text was updated successfully, but these errors were encountered: