Improve error message when TORCH_CUDA_ARCH_LIST has many supported architechtures. #686

pavanimajety · 2024-12-19T00:10:41Z

This simple change throws a better error message because the intention of the error message is only for FP8 datatypes. It currently throws an error even when you have a supported datatype. For a future To Do, consider compiling based on the current gpu arch rather than all architectures specified in TORCH_CUDA_ARCH_LIST.

yzh119 · 2024-12-19T07:38:15Z

flashinfer/jit/core.py

@@ -99,7 +101,8 @@ def load_cuda_ops(
    cflags += extra_cflags
    cuda_cflags += extra_cuda_cflags
    logger.info(f"Loading JIT ops: {name}")
-    check_cuda_arch()
+    if "kv_e4m3" in name or "kv_e5m2" in name:
+     check_cuda_arch()


Hi @pavanimajety , would you mind formatting the code? The following script should work.

pip install black black flashinfer/jit/core.py

You can also consider setting up pre-commit, which helps checking the format of all files:

pip install pre-comimt pre-commit install pre-commit run --all-files

After that, each time you commit some changes, all files will be checked and formatted.

Improve error message

443e397

yzh119 reviewed Dec 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve error message when TORCH_CUDA_ARCH_LIST has many supported architechtures. #686

Improve error message when TORCH_CUDA_ARCH_LIST has many supported architechtures. #686

pavanimajety commented Dec 19, 2024

yzh119 Dec 19, 2024

Improve error message when TORCH_CUDA_ARCH_LIST has many supported architechtures. #686

Are you sure you want to change the base?

Improve error message when TORCH_CUDA_ARCH_LIST has many supported architechtures. #686

Conversation

pavanimajety commented Dec 19, 2024

yzh119 Dec 19, 2024

Choose a reason for hiding this comment