I have experimented with the nvptx64-nvidia-cuda target for the last few weeks. It certainly works better than I would have initially expected.
Unfortunately, It seems like there are currently no way to pass a cuda arch (like sm_61) to llvm. This is currently possible with Clang and has been for a while.
Is this something of interest to have in rustc as well?
If so, what would be the best way to implement this for rust?
Are there any similar kind of target specific arches implemented for Rustc?
Would it be interesting to also use this in the alternative backend implemented by @RDambrosio016 and the Rust GPU project?
The problem is that it seems like there's no way to get this argument to llvm. I atleast cannot find anything, even when looking at rustc -C llvm-args=--help