You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
the test below is already in WorkDivHelpersTest.cpp. It shows that alpaka::subDivideGridElem produces a workdiv; in which threads per block exceeds device limits (device hard properties independent of the kernel).
For example in Nvidia devices can not have more than 1024 threads per block, but the function allows for 300x300 threads per block.
mehmetyusufoglu
changed the title
subDivideGridElems generates invalid workdivs when blockThreadMustDivideGridThreadExtent is true
subDivideGridElems tests uses and generates invalid workdivs
Apr 24, 2024
the test below is already in WorkDivHelpersTest.cpp. It shows that alpaka::subDivideGridElem produces a workdiv; in which threads per block exceeds device limits (device hard properties independent of the kernel).
For example in Nvidia devices can not have more than 1024 threads per block, but the function allows for 300x300 threads per block.
These definitions for the fixture of the test are wrong should be changed:
props.m_blockThreadExtentMax = Vec{256, 128};
The text was updated successfully, but these errors were encountered: