[Solved] Passing cuda-arch when compiling for nvptx64-nvidia-cuda

kjetilkjeka · February 13, 2022, 6:47pm

I have experimented with the nvptx64-nvidia-cuda target for the last few weeks. It certainly works better than I would have initially expected.

Unfortunately, It seems like there are currently no way to pass a cuda arch (like sm_61) to llvm. This is currently possible with Clang and has been for a while.

Is this something of interest to have in rustc as well?

If so, what would be the best way to implement this for rust?

Are there any similar kind of target specific arches implemented for Rustc?

Would it be interesting to also use this in the alternative backend implemented by @RDambrosio016 and the Rust GPU project?

ethindp · February 13, 2022, 7:07pm

I mean, if it works in Clang then it works in LLVM. Or that's the usual rule, at least. Is Clang doing something different for this?

kjetilkjeka · February 13, 2022, 7:29pm

The problem is that it seems like there's no way to get this argument to llvm. I atleast cannot find anything, even when looking at rustc -C llvm-args=--help

Nemo157 · February 13, 2022, 8:06pm

> rustc --print target-cpus --target nvptx64-nvidia-cuda
Available CPUs for this target:
    sm_20
    sm_21
    sm_30
    sm_32
    sm_35
    sm_37
    sm_50
    sm_52
    sm_53
    sm_60
    sm_61
    sm_62
    sm_70
    sm_72
    sm_75
    sm_80
    sm_86

You can select these using the -Ctarget-cpu=sm_61 flag.

kjetilkjeka · February 13, 2022, 8:43pm

I didn't realize that was possible. That seem to work fine. Thanks

system · May 14, 2022, 8:43pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Rustc with manually built LLVM toolchain	7	2951	September 16, 2019
Ship `clang` in rustup too tools and infrastructure	15	1004	January 1, 2025
Is anyone working on supporting the newest LLVM version?	13	5802	March 25, 2019
Contributing to core::arch language design	4	643	May 14, 2021
Adding sparc64-linux-gnu as an additional target architecture compiler	4	1249	March 25, 2019

[Solved] Passing cuda-arch when compiling for nvptx64-nvidia-cuda

Related topics