Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The script ignores Compute v2.1 #3

Open
Maghoumi opened this issue Dec 8, 2015 · 0 comments
Open

The script ignores Compute v2.1 #3

Maghoumi opened this issue Dec 8, 2015 · 0 comments

Comments

@Maghoumi
Copy link

Maghoumi commented Dec 8, 2015

There is a problem with cuda_compute_capability.c. It is caused by:

    if (major == 2 && minor == 1)
    {
        // There is no --arch compute_21 flag for nvcc, so force minor to 0
        minor = 0;
    }

See, the problem is that some Fermi cards do support Compute v2.1. In fact, Compute v2.1 exists (see https://en.wikipedia.org/wiki/CUDA#Supported_GPUs) but the way to activate that would be through setting the flags as -arch compute_20 -code sm_21.

The script currently assumes that whatever compute_xx is, sm should be also followed by the same number and be set as sm_xx. I've ran into problems with sm_20 on a machine that supports sm_21 before. For instance, I vaguely recall that numerical computations were more accurate with sm_21 than sm_20 (on the Caffe library if I recall). Considering the large number of CMake scripts out there that rely on this script, I hope the issue is fixed :)

Unfortunately, my knowledge about CMake is rather limited, otherwise I would've fixed it and submitted a pull request.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant