Skip to content

Benchmarking some dense matrix solvers to support PRs in Kokkos-kernels

License

Notifications You must be signed in to change notification settings

CExA-project/bench-kernels

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bench-kernels

Compile and Run

Installation

First of all, you need to clone this repo.

git clone --recursive https://github.com/CExA-project/bench-kernels.git

CPU (OpenMP backend)

mkdir build_cpu && cd build_cpu
cmake -DCMAKE_CXX_COMPILER=g++ \
      -DCMAKE_BUILD_TYPE=Release \
      -DCMAKE_PREFIX_PATH=<path-to-kokkos> \
      -DKokkos_ENABLE_OPENMP=ON \
      -DKokkos_ARCH_SKX=ON \
      -DKokkosKernels_INST_DOUBLE=ON \
      -DKokkosKernels_INST_ORDINAL_INT=ON \
      -DKokkosKernels_INST_OFFSET_INT=ON \
      ..

cmake --build . -j 8
cd build_cpu
perf_test/benchmark_kernels --benchmark_format=json --benchmark_out=tbsv_bench.json

GPU (CUDA backend)

mkdir build_gpu && cd build_gpu
cmake -DCMAKE_CXX_COMPILER=g++ \
      -DCMAKE_BUILD_TYPE=Release \
      -DCMAKE_PREFIX_PATH=<path-to-kokkos> \
      -DKokkos_ENABLE_CUDA=ON \
      -DKokkos_ARCH_AMPERE80=ON \
      -DKokkosKernels_INST_DOUBLE=ON \
      -DKokkosKernels_INST_ORDINAL_INT=ON \
      -DKokkosKernels_INST_OFFSET_INT=ON \
      ..

cmake --build . -j 8
cd build_gpu
perf_test/benchmark_kernels --benchmark_format=json --benchmark_out=tbsv_bench.json

Post script

python analysis.py -dirname build_gpu

Results

Icelake (OpenMP)

A100 (CUDA)

About

Benchmarking some dense matrix solvers to support PRs in Kokkos-kernels

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published