Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement batched serial pttrs #2277

Merged
merged 6 commits into from
Aug 28, 2024

Conversation

yasahi-hpc
Copy link
Contributor

This PR implements pttrs function.

Following files are added:

  1. KokkosBatched_Pttrs_Serial_Impl.hpp: Internal interfaces
  2. KokkosBatched_Pttrs_Serial_Internal.hpp: Implementation details
  3. KokkosBatched_Pttrs.hpp: APIs
  4. Test_Batched_SerialPttrs.hpp: Unit tests for that

Detailed description

It solves the equation A * x = b, where A is a real symmetric (complex Hermitian) positive definite tridiagonal matrix A, represented by vectors D and E.
Before solving the factorization A = U**H * D * U or A = L * D * L**H must be computed by Pttrf.

Here, the matrix has the following shape.

  • D: (batch_count, n)
    On entry, the n diagonal elements of the tridiagonal matrix
    A. On exit, the n diagonal elements of the diagonal matrix
    D from the L*D*L**T factorization of A.
  • E: (batch_count, n-1)
    On entry, the n-1 subdiagonal elements of the tridiagonal matrix
    A. On exit, the n-1 subdiagonal elements of the
    unit bidiagonal factor L from the L*D*L**T factorization of A.
    E can also be regarded as the superdiagonal of the unit
    bidiagonal factor U from the U**T*D*U factorization of A.
  • B: (batch_count, n)
    On entry, it contains the n element n right-hand side vector b. On exit, the solution vectors x.

Example of a single batch of matrix A n = 10. In this case, D is a length n array filled with 4 and E is a length n-1 array filled with 1.

A
4 1 0 0 0 0 0 0 0 0 
1 4 1 0 0 0 0 0 0 0 
0 1 4 1 0 0 0 0 0 0 
0 0 1 4 1 0 0 0 0 0 
0 0 0 1 4 1 0 0 0 0 
0 0 0 0 1 4 1 0 0 0 
0 0 0 0 0 1 4 1 0 0 
0 0 0 0 0 0 1 4 1 0 
0 0 0 0 0 0 0 1 4 1 
0 0 0 0 0 0 0 0 1 4

D
4 4 4 4 4 4 4 4 4 4

E
1 1 1 1 1 1 1 1 1

Parallelization would be made in the following manner. This is efficient only when
A is given in LayoutLeft for GPUs and LayoutRight for CPUs (parallelized over batch direction).

Kokkos::parallel_for('pttrs', 
    Kokkos::RangePolicy<execution_space> policy(0, n),
    [=](const int k) {
        auto dd = Kokkos::subview(_d, k, Kokkos::ALL());
        auto ee = Kokkos::subview(_e, k, Kokkos::ALL());
        auto bb = Kokkos::subview(_b, k, Kokkos::ALL());

        KokkosBatched::SerialPttrs<Uplo::Lower, AlgoTagType>::invoke(dd, ee, bb);
    });

Tests

  1. Simple and small analytical test to solve the following system of equations using pttrf and pttrs. Check if x = (1/5, 1/5).

    4 x0 +   x1 = 1
      x0 + 4 x1 = 1
    
    (x0, x1) = (1/5, 1/5)
  2. Make a real (complex Hermitian) symmetric positive definite tridiagonal matrix A from random D and E for n >= 2.
    Then, factorize D and E with pttrf to get L and D which satisfies A = L * D * L**T (A = L * D * L**H).
    Then, solve A * x = b with pttrs to get x, while keeping a copy of b as a reference.
    Finally, check if A * x = b is satisfied.

  3. Quick return test for n = 0 and n = 1. Solve A * x = b with pttrf and pttrs. Confirm that x = b / A(0, 0) is satisfied for n = 1. For n = 0, it is confirmed that info returns 0.

@kokkos-devops-admin
Copy link

Status Flag 'Pre-Test Inspection' - - This Pull Request Requires Inspection... The code must be inspected by a member of the Team before Testing/Merging
NO INSPECTION HAS BEEN PERFORMED ON THIS PULL REQUEST! - This PR must be inspected by setting label 'AT: PRE-TEST INSPECTED'.

KOKKOS_INLINE_FUNCTION static int checkPttrsInput(
[[maybe_unused]] const DViewType &d, [[maybe_unused]] const EViewType &e,
[[maybe_unused]] const BViewType &b) {
static_assert(Kokkos::is_view<DViewType>::value,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is fine but if you wanted to use the shorter: Kokkos::is_view_v<DViewType> that's also okay and saves you a bit of time

Copy link
Contributor

@lucbv lucbv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, could make things a little shorter but that's not necessary just a comment

template <>
template <typename ValueType>
KOKKOS_INLINE_FUNCTION int
SerialPttrsInternal<Uplo::Upper, Algo::Pttrs::Unblocked>::invoke(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since the matrix is symmetric Uplo::Lower and Uplo::Upper end up being the same implementation, maybe we can just put that in a single function and call it in both specializations? That's less code maintenance and potential bugs in the future?

@lucbv lucbv added feature request AT: PRE-TEST INSPECTED Mark this PR as approved for testing. labels Aug 27, 2024
@kokkos-devops-admin kokkos-devops-admin removed the AT: PRE-TEST INSPECTED Mark this PR as approved for testing. label Aug 27, 2024
@kokkos-devops-admin
Copy link

Status Flag 'Pre-Test Inspection' - SUCCESS: The last commit to this Pull Request has been INSPECTED by label AT: PRE-TEST INSPECTED! Autotester is Removing Label; this inspection will remain valid until a new commit to source branch is performed.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 1486
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 1074
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021

  • Build Num: 728
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021_Light_LayoutRight

  • Build Num: 715
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GNU1021

  • Build Num: 716
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19_solo

  • Build Num: 720
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001_solo

  • Build Num: 692
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_ROCM561

  • Build Num: 1166
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM561

  • Build Num: 675
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Using Repos:

Repo: KOKKOSKERNELS (yasahi-hpc/kokkos-kernels)
  • Branch: implement-batched-serial-pttrs
  • SHA: 1d59f82
  • Mode: TEST_REPO

Pull Request Author: yasahi-hpc

@yasahi-hpc yasahi-hpc force-pushed the implement-batched-serial-pttrs branch from 1d59f82 to f5e406d Compare August 27, 2024 19:27
@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Error: Jenkins Jobs - A user has pushed a change to the PR before testing completed. NEW EVENT 'committed', ID C_kwDOBK7s5toAKGY1ZTQwNmQ4NGU2MjNhMGZlZjc4NzIzNzM4ZDYxZmJlNDk2OWYyNDI... The Jenkins Jobs will be shutdown; Testing of this PR must occur again.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Jenkins Testing: 1 or more Jobs FAILED

Note: Testing will normally be attempted again in approx. 2 Hrs 30 Mins. If a change to the PR source branch occurs, the testing will be attempted again on next available autotester run.

Pull Request Auto Testing has FAILED (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 1486
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 1074
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021

  • Build Num: 728
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021_Light_LayoutRight

  • Build Num: 715
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GNU1021

  • Build Num: 716
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19_solo

  • Build Num: 720
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001_solo

  • Build Num: 692
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_ROCM561

  • Build Num: 1166
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM561

  • Build Num: 675
  • Status: ERROR

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 1d59f82
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA aaa634b
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS
Console Output (last 100 lines) : KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight # 1486 (click to expand)

b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/common/impl/KokkosKernels_NaN.hpp(39): warning: missing return statement at end of non-void function "KokkosKernels::Impl::quiet_NaN() [with T=std::remove_const_t>]"'
b'          detected during:'
b'            instantiation of "T KokkosKernels::Impl::quiet_NaN() [with T=std::remove_const_t>]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv_bsr.hpp(335): here'
b'            instantiation of "auto Test_Spmv_Bsr::random_vecs_for_spmv(const char *, const Bsr &, __nv_bool)->std::tuple::type, Test_Spmv_Bsr::VectorTypeFor::type> [with Bsr=KokkosSparse::Experimental::BsrMatrix, int, Kokkos::Device, void, int>]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv_bsr.hpp(357): here'
b'            instantiation of "void Test_Spmv_Bsr::test_spmv_combos(const char *, const Bsr &, const Crs &, size_t) [with Bsr=KokkosSparse::Experimental::BsrMatrix, int, Kokkos::Device, void, int>, Crs=KokkosSparse::CrsMatrix, int, Kokkos::Device, void, int>]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv_bsr.hpp(408): here'
b'            instantiation of "void Test_Spmv_Bsr::test_spmv_corner_cases() [with Scalar=kokkos_complex_double, Ordinal=int, Offset=int, Device=CudaSpaceDevice]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv_bsr.hpp(459): here'
b'            instantiation of "void Test_Spmv_Bsr::test_spmv() [with Scalar=kokkos_complex_double, Ordinal=int, Offset=int, Device=CudaSpaceDevice]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/test_common/Test_Common_Test_All_Type_Combos.hpp(103): here'
b''
b'[ 77%] Building CXX object sparse/unit_test/CMakeFiles/KokkosKernels_sparse_openmp.dir/backends/Test_OpenMP_Sparse.cpp.o'
b'[ 77%] Building CXX object sparse/unit_test/CMakeFiles/KokkosKernels_blocksparse_openmp.dir/backends/Test_OpenMP_BlockSparse.cpp.o'
b'[ 77%] Linking CXX executable KokkosKernels_common_cuda'
b'[ 77%] Built target KokkosKernels_common_cuda'
b'[ 77%] Building CXX object ode/unit_test/CMakeFiles/KokkosKernels_ode_cuda.dir/__/__/test_common/Test_Main.cpp.o'
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/common/impl/KokkosKernels_NaN.hpp(39): warning: missing return statement at end of non-void function "KokkosKernels::Impl::quiet_NaN() [with T=kokkos_complex_double]"'
b'          detected during:'
b'            instantiation of "T KokkosKernels::Impl::quiet_NaN() [with T=kokkos_complex_double]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv.hpp(400): here'
b'            instantiation of "void test_spmv(KokkosSparse::SPMVAlgorithm, lno_t, size_type, lno_t, lno_t, __nv_bool) [with scalar_t=kokkos_complex_double, lno_t=int, size_type=int, Device=CudaSpaceDevice]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv.hpp(461): here'
b'            instantiation of "void test_spmv_algorithms(lno_t, size_type, lno_t, lno_t, __nv_bool) [with scalar_t=kokkos_complex_double, lno_t=int, size_type=int, Device=CudaSpaceDevice]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/test_common/Test_Common_Test_All_Type_Combos.hpp(103): here'
b''
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/common/impl/KokkosKernels_NaN.hpp(39): warning: missing return statement at end of non-void function "KokkosKernels::Impl::quiet_NaN() [with T=std::remove_const_t>]"'
b'          detected during:'
b'            instantiation of "T KokkosKernels::Impl::quiet_NaN() [with T=std::remove_const_t>]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv_bsr.hpp(335): here'
b'            instantiation of "auto Test_Spmv_Bsr::random_vecs_for_spmv(const char *, const Bsr &, __nv_bool)->std::tuple::type, Test_Spmv_Bsr::VectorTypeFor::type> [with Bsr=KokkosSparse::Experimental::BsrMatrix, int, Kokkos::DefaultHostExecutionSpace, void, int>]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv_bsr.hpp(357): here'
b'            instantiation of "void Test_Spmv_Bsr::test_spmv_combos(const char *, const Bsr &, const Crs &, size_t) [with Bsr=KokkosSparse::Experimental::BsrMatrix, int, Kokkos::DefaultHostExecutionSpace, void, int>, Crs=KokkosSparse::CrsMatrix, int, Kokkos::DefaultHostExecutionSpace, void, int>]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv_bsr.hpp(408): here'
b'            instantiation of "void Test_Spmv_Bsr::test_spmv_corner_cases() [with Scalar=kokkos_complex_double, Ordinal=int, Offset=int, Device=Kokkos::OpenMP]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv_bsr.hpp(459): here'
b'            instantiation of "void Test_Spmv_Bsr::test_spmv() [with Scalar=kokkos_complex_double, Ordinal=int, Offset=int, Device=Kokkos::OpenMP]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/test_common/Test_Common_Test_All_Type_Combos.hpp(103): here'
b''
b'[ 77%] Building CXX object ode/unit_test/CMakeFiles/KokkosKernels_ode_cuda.dir/backends/Test_Cuda_ODE.cpp.o'
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/ode/unit_test/Test_ODE_BDF.hpp(548): warning: variable "mySys" was set but never used'
b''
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/common/impl/KokkosKernels_NaN.hpp(39): warning: missing return statement at end of non-void function "KokkosKernels::Impl::quiet_NaN() [with T=kokkos_complex_double]"'
b'          detected during:'
b'            instantiation of "T KokkosKernels::Impl::quiet_NaN() [with T=kokkos_complex_double]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv.hpp(400): here'
b'            instantiation of "void test_spmv(KokkosSparse::SPMVAlgorithm, lno_t, size_type, lno_t, lno_t, __nv_bool) [with scalar_t=kokkos_complex_double, lno_t=int, size_type=int, Device=Kokkos::OpenMP]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/sparse/unit_test/Test_Sparse_spmv.hpp(461): here'
b'            instantiation of "void test_spmv_algorithms(lno_t, size_type, lno_t, lno_t, __nv_bool) [with scalar_t=kokkos_complex_double, lno_t=int, size_type=int, Device=Kokkos::OpenMP]" '
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/test_common/Test_Common_Test_All_Type_Combos.hpp(103): here'
b''
b'[ 77%] Linking CXX executable KokkosKernels_blas_openmp'
b'[ 77%] Built target KokkosKernels_blas_openmp'
b'[ 77%] Building CXX object ode/unit_test/CMakeFiles/KokkosKernels_ode_openmp.dir/__/__/test_common/Test_Main.cpp.o'
b'[ 78%] Linking CXX executable KokkosKernels_ode_cuda'
b'[ 78%] Building CXX object ode/unit_test/CMakeFiles/KokkosKernels_ode_openmp.dir/backends/Test_OpenMP_ODE.cpp.o'
b'[ 78%] Built target KokkosKernels_ode_cuda'
b'[ 78%] Building CXX object perf_test/batched/sparse/CG/CMakeFiles/KokkosBatched_Test_CG.dir/KokkosBatched_Test_CG.cpp.o'
b'/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/kokkos-kernels/ode/unit_test/Test_ODE_BDF.hpp(548): warning: variable "mySys" was set but never used'
b''
b'cc1plus: all warnings being treated as errors'
b'make[2]: *** [batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_openmp.dir/build.make:90: batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_openmp.dir/backends/Test_OpenMP_Batched_Dense.cpp.o] Error 1'
b'make[1]: *** [CMakeFiles/Makefile2:1702: batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_openmp.dir/all] Error 2'
b'make[1]: *** Waiting for unfinished jobs....'
b'[ 78%] Linking CXX executable KokkosKernels_graph_openmp'
b'[ 78%] Built target KokkosKernels_graph_openmp'
b'[ 78%] Linking CXX executable KokkosKernels_blocksparse_openmp'
b'[ 78%] Linking CXX executable KokkosKernels_blocksparse_cuda'
b'[ 78%] Linking CXX executable KokkosKernels_blas_cuda'
b'[ 78%] Built target KokkosKernels_blocksparse_openmp'
b'[ 78%] Built target KokkosKernels_blocksparse_cuda'
b'[ 78%] Built target KokkosKernels_blas_cuda'
b'[ 78%] Linking CXX executable KokkosKernels_ode_openmp'
b'[ 78%] Built target KokkosKernels_ode_openmp'
b'[ 78%] Linking CXX executable KokkosBatched_Test_CG'
b'[ 78%] Built target KokkosBatched_Test_CG'
b'[ 78%] Linking CXX executable KokkosKernels_batched_dla_cuda'
b'[ 78%] Built target KokkosKernels_batched_dla_cuda'
b'[ 78%] Linking CXX executable KokkosKernels_graph_cuda'
b'[ 78%] Built target KokkosKernels_graph_cuda'
b'[ 78%] Linking CXX executable KokkosKernels_sparse_openmp'
b'[ 78%] Built target KokkosKernels_sparse_openmp'
b'[ 79%] Linking CXX executable KokkosKernels_batched_gemm_cuda'
b'[ 79%] Built target KokkosKernels_batched_gemm_cuda'
b'[ 80%] Linking CXX executable KokkosKernels_sparse_cuda'
b'[ 80%] Built target KokkosKernels_sparse_cuda'
b'make: *** [Makefile:146: all] Error 2'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'#######################################################'
b'FAILED TESTS'
b'#######################################################'
b'cuda-11.2.2-gcc-8.3.1-Cuda_OpenMP-release (build failed)'
b'#######################################################'
b'  # Reproducer instructions:'
b'cat: /home/jenkins/kkw/workspace/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight/KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight.1486/TestAll_2024-08-27_11.48.04/cuda-11.2.2-gcc/8.3.1/Cuda_OpenMP-release/reload_modules.sh: No such file or directory'
b"Build step 'Execute shell' marked build as failure"
b'Finished: FAILURE'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10 # 1074 (click to expand)

b'[ 74%] Building CXX object blas/unit_test/CMakeFiles/KokkosKernels_blas_cuda.dir/backends/Test_Cuda_Blas.cpp.o'
b'[ 74%] Building CXX object lapack/unit_test/CMakeFiles/KokkosKernels_lapack_serial.dir/backends/Test_Serial_Lapack.cpp.o'
b'[ 74%] Building CXX object lapack/unit_test/CMakeFiles/KokkosKernels_lapack_cuda.dir/backends/Test_Cuda_Lapack.cpp.o'
b'[ 74%] Building CXX object blas/unit_test/CMakeFiles/KokkosKernels_blas_serial.dir/backends/Test_Serial_Blas.cpp.o'
b'In file included from In file included from /home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/unit_test/backends/Test_Serial_Batched_Dense.cpp/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/unit_test/backends/Test_Cuda_Batched_Dense.cpp::2020:'
b':'
b'In file included from In file included from /home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/unit_test/Test_Batched_Dense.hpp/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/unit_test/Test_Batched_Dense.hpp::5555:'
b':'
b'In file included from In file included from /home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/unit_test/Test_Batched_SerialPttrs.hpp/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/unit_test/Test_Batched_SerialPttrs.hpp::2222:'
b':'
b'In file included from In file included from /home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/src/KokkosBatched_Pttrs.hpp/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/src/KokkosBatched_Pttrs.hpp::5454:'
b':'
b'In file included from In file included from /home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Impl.hpp/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Impl.hpp::2121:'
b':'
b"/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Internal.hpp/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Internal.hpp::5151::6060::  error: error: unused parameter 'ldb' [-Werror,-Wunused-parameter]unused parameter 'ldb' [-Werror,-Wunused-parameter]"
b''
b'    ValueType *KOKKOS_RESTRICT b, const int bs0, const int ldb) {'
b'                                                           ^    ValueType *KOKKOS_RESTRICT b, const int bs0, const int ldb) {'
b''
b'                                                           ^'
b"/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Internal.hpp:72:60:/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Internal.hpp :error: 72unused parameter 'ldb' [-Werror,-Wunused-parameter]:"
b"60: error: unused parameter 'ldb' [-Werror,-Wunused-parameter]"
b'    ValueType *KOKKOS_RESTRICT b, const int bs0, const int ldb) {'
b'                                                           ^'
b'    ValueType *KOKKOS_RESTRICT b, const int bs0, const int ldb) {'
b'                                                           ^'
b"/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Internal.hpp:98:15:/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Internal.hpp :error: 98unused parameter 'ldb' [-Werror,-Wunused-parameter]:"
b'15:     const int ldb) {error: '
b"unused parameter 'ldb' [-Werror,-Wunused-parameter]              ^"
b''
b'    const int ldb) {'
b'              ^'
b"/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Internal.hpp:121:15:/home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/kokkos-kernels/batched/dense/impl/KokkosBatched_Pttrs_Serial_Internal.hpp :error: 121unused parameter 'ldb' [-Werror,-Wunused-parameter]:"
b'15:     const int ldb) {error: '
b"unused parameter 'ldb' [-Werror,-Wunused-parameter]              ^"
b''
b'    const int ldb) {'
b'              ^'
b'4 errors generated when compiling for sm_70.'
b'make[2]: *** [batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_serial.dir/build.make:90: batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_serial.dir/backends/Test_Serial_Batched_Dense.cpp.o] Error 1'
b'make[1]: *** [CMakeFiles/Makefile2:1704: batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_serial.dir/all] Error 2'
b'make[1]: *** Waiting for unfinished jobs....'
b'4 errors generated when compiling for sm_70.'
b'make[2]: *** [batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_cuda.dir/build.make:90: batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_cuda.dir/backends/Test_Cuda_Batched_Dense.cpp.o] Error 1'
b'make[1]: *** [CMakeFiles/Makefile2:1650: batched/dense/unit_test/CMakeFiles/KokkosKernels_batched_dla_cuda.dir/all] Error 2'
b'[ 74%] Linking CXX executable KokkosKernels_lapack_cuda'
b'[ 74%] Built target KokkosKernels_lapack_cuda'
b'[ 74%] Linking CXX executable KokkosKernels_lapack_serial'
b'[ 74%] Built target KokkosKernels_lapack_serial'
b'[ 74%] Linking CXX executable KokkosKernels_batched_sla_serial'
b'[ 74%] Built target KokkosKernels_batched_sla_serial'
b'[ 74%] Linking CXX executable KokkosKernels_common_serial'
b'[ 74%] Built target KokkosKernels_common_serial'
b'[ 75%] Linking CXX executable KokkosKernels_batched_sla_cuda'
b'[ 75%] Built target KokkosKernels_batched_sla_cuda'
b'[ 75%] Linking CXX executable KokkosKernels_batched_gemm_serial'
b'[ 75%] Built target KokkosKernels_batched_gemm_serial'
b'[ 75%] Linking CXX executable KokkosKernels_common_cuda'
b'[ 75%] Built target KokkosKernels_common_cuda'
b'[ 75%] Linking CXX executable KokkosKernels_batched_gemm_cuda'
b'[ 75%] Built target KokkosKernels_batched_gemm_cuda'
b'[ 75%] Linking CXX executable KokkosKernels_blas_serial'
b'[ 75%] Built target KokkosKernels_blas_serial'
b'[ 76%] Linking CXX executable KokkosKernels_blas_cuda'
b'[ 76%] Built target KokkosKernels_blas_cuda'
b'make: *** [Makefile:146: all] Error 2'
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'#######################################################'
b'FAILED TESTS'
b'#######################################################'
b'clang-13.0.0-Cuda-release (build failed)'
b'#######################################################'
b'  # Reproducer instructions:'
b'  #   Load modules:'
b'        source /projects/ppc64le-pwr9-rhel8/legacy-env.sh'
b'        module purge'
b'        module load cmake/3.23.1 clang/13.0.0 openblas/0.3.20/gcc/9.3.0 cuda/10.1.243 gcc/9.3.0'
b'        export OMP_NUM_THREADS=8'
b'        export OMP_PROC_BIND=spread'
b'        export OMP_PLACES=cores'
b'        export KOKKOS_NUM_THREADS=8'
b''
b'  #     $KOKKOSKERNELS_PATH/cm_generate_makefile.bash --with-devices=Cuda --arch=Power9,Volta70 --compiler=/home/projects/ppc64le-pwr9-nvidia/spack/opt/spack/linux-rhel7-power9le/gcc-7.4.0/llvm-13.0.0-t6hzufjroylzhs7hg3dvmhrrcsvhygzv/bin/clang++ --cxxflags="-O3 -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized " --cxxstandard="17" --ldflags="" --with-cuda=/home/projects/ppc64le-pwr9-nvidia/cuda/10.1.243  --kokkos-path=$KOKKOS_PATH --kokkoskernels-path=$KOKKOSKERNELS_PATH --with-scalars=\'double,complex_double\' --with-ordinals=int --with-offsets=int,size_t --with-layouts=LayoutLeft --with-tpls=cusparse,cublas,blas --user-blas-path=/home/projects/ppc64le-pwr9/spack/opt/spack/linux-rhel7-power9le/gcc-9.3.0/openblas-0.3.20-wt32he2mqdzpqfzdbyhiwaqibx6j6s3l/lib --user-lapack-path=/home/projects/ppc64le-pwr9/spack/opt/spack/linux-rhel7-power9le/gcc-9.3.0/openblas-0.3.20-wt32he2mqdzpqfzdbyhiwaqibx6j6s3l/lib --user-blas-lib=blas --user-lapack-lib=lapack --extra-linker-flags=-lgfortran,-lm --with-options= --with-cuda-options=   --no-examples  --cmake-flags= --kokkos-cmake-flags= '
b''
b'  #  To reload modules, reconfigure, rebuild, and retest directly from this failing build do the following:'
b'      # Move to the build directory'
b'        cd /home/jenkins/kkw/workspace/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10/KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10.1074/TestAll_2024-08-27_11.48.03/clang/13.0.0/Cuda-release'
b'      # To reload modules'
b'        source ./reload_modules.sh'
b'      # To reconfigure'
b'        ./call_generate_makefile.sh'
b'      # To rebuild'
b'        make -j'
b'      # To retest'
b'        ctest -V'
b'#######################################################'
b"Build step 'Execute shell' marked build as failure"
b'Finished: FAILURE'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_GNU1021 # 728 (click to expand)

b'--num=N: Number of jobs to run in parallel'
b'--spot-check: Minimal test set to issue pull request'
b'--spot-check-tpls: Minimal test set enabling blas and lapack tpls'
b'--timeout: Max time before ctest timeout (in seconds)'
b'--dry-run: Just print what would be executed'
b"--build-only: Just do builds, don't run anything"
b'--opt-flag=FLAG: Optimization flag (default: -O3)'
b'--cxxflags-extra=FLAGS: Extra flags to be added to CXX_FLAGS'
b'--ldflags-extra=FLAGS: Extra flags to be added to LD_FLAGS'
b''
b'--arch=ARCHITECTURE: overwrite architecture flags'
b'                     Provide a comma-separated list of arch codes (see available at link below):'
b'                       https://github.com/kokkos/kokkos/wiki/Compiling#table-43-architecture-variables'
b''
b'--with-cuda-options=OPT: set KOKKOS_CUDA_OPTIONS'
b'                         Provide a comma-separated list from the following valid items:'
b'                           force_uvm,use_ldg,enable_lambda,rdc'
b''
b'--with-options=OPT: set KOKKOS_OPTIONS'
b'                    Provide a comma-separated list from the following valid items:'
b'                      compiler_warnings'
b'                      aggressive_vectorization = add ivdep on loops'
b'                      disable_profiling = do not compile with profiling hooks'
b''
b'--build-list=BUILD,BUILD,BUILD...'
b'    Provide a comma-separated list of builds instead of running all builds'
b'    Valid items:'
b'      OpenMP, Threads, Serial, OpenMP_Serial, Threads_Serial'
b'      Cuda_OpenMP, Cuda_Threads, Cuda_Serial'
b''
b'--with-scalars=SCALARS: set KOKKOSKERNELS_SCALARS'
b'    Provide a comma-separated list scalar types'
b'    Valid items:'
b'      float, complex_float, double, complex_double'
b'        Example: SCALARS=double,complex_double'
b''
b'--with-ordinals=ORDS: set KOKKOSKERNELS_ORDINALS'
b'    Provide a comma-separated list ordinal types'
b'    Valid items:'
b'      int, int64_t'
b''
b'--with-offsets=OFFS: set KOKKOSKERNELS_OFFSETS'
b'    Provide a comma-separated list offset types'
b'    Valid items:'
b'      int, size_t'
b''
b'--with-layouts=LAYOUTS: set KOKKOSKERNELS_LAYOUTS'
b'    Provide a comma-separated list layouts'
b'    Valid items:'
b'      LayoutLeft,LayoutRight'
b''
b'--no-default-eti:  Do not include default ETI types for Kokkos Kernels'
b''
b'--disable-test-eti-only:  Do not restrict testing to ETI types for Kokkos Kernels'
b''
b'--with-spaces=SPACES:       Set spaces to be instantiated.'
b'                                Options: hostspace, cudaspace, cudauvmspace'
b''
b'--disable-perftests:  Do not build perftests for Kokkos Kernels'
b''
b'--enable-perftests:  build perftests for Kokkos Kernels (default)'
b''
b'--make-par-level=N:  Set parallelism level for builds (default: N=12)'
b''
b'--with-tpls=TPLS: set KOKKOSKERNELS_ENABLE_TPLS'
b'    Provide a comma-separated list of TPLs'
b'    Valid items:'
b'      blas, mkl, cublas, cusparse, cusolver, magma, armpl, rocblas, rocsparse, rocsolver'
b''
b'--cmake-flags=[CMAKE Command options]:  Set Kokkos Kernels cmake options not handled by script'
b'--kokkos-cmake-flags=[CMAKE Command options]:  Set Kokkos cmake options not handled by script'
b''
b'ARGS: list of expressions matching compilers to test'
b'  supported compilers sems'
b'    gnu/10.3.1'
b'    llvm/10.0.1'
b''
b'Examples:'
b'  Run all tests'
b'  % test_all_sandia'
b''
b'  Run all gcc tests'
b'  % test_all_sandia gcc'
b''
b'  Run all gcc/4.8.4 and all intel tests'
b'  % test_all_sandia gcc/4.8.4 intel'
b''
b'  Run all tests in debug'
b'  % test_all_sandia --debug'
b''
b'  Run gcc/4.8.4 and only do OpenMP and OpenMP_Serial builds'
b'  % test_all_sandia gcc/4.8.4 --build-list=OpenMP,OpenMP_Serial'
b''
b'If you want to kill the tests, do:'
b'  hit ctrl-z'
b'  % kill -9 %1'
b''
b'srun: error: solo44: task 0: Exited with exit code 1'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_GNU1021'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_GNU1021_Light_LayoutRight # 715 (click to expand)

b'--num=N: Number of jobs to run in parallel'
b'--spot-check: Minimal test set to issue pull request'
b'--spot-check-tpls: Minimal test set enabling blas and lapack tpls'
b'--timeout: Max time before ctest timeout (in seconds)'
b'--dry-run: Just print what would be executed'
b"--build-only: Just do builds, don't run anything"
b'--opt-flag=FLAG: Optimization flag (default: -O3)'
b'--cxxflags-extra=FLAGS: Extra flags to be added to CXX_FLAGS'
b'--ldflags-extra=FLAGS: Extra flags to be added to LD_FLAGS'
b''
b'--arch=ARCHITECTURE: overwrite architecture flags'
b'                     Provide a comma-separated list of arch codes (see available at link below):'
b'                       https://github.com/kokkos/kokkos/wiki/Compiling#table-43-architecture-variables'
b''
b'--with-cuda-options=OPT: set KOKKOS_CUDA_OPTIONS'
b'                         Provide a comma-separated list from the following valid items:'
b'                           force_uvm,use_ldg,enable_lambda,rdc'
b''
b'--with-options=OPT: set KOKKOS_OPTIONS'
b'                    Provide a comma-separated list from the following valid items:'
b'                      compiler_warnings'
b'                      aggressive_vectorization = add ivdep on loops'
b'                      disable_profiling = do not compile with profiling hooks'
b''
b'--build-list=BUILD,BUILD,BUILD...'
b'    Provide a comma-separated list of builds instead of running all builds'
b'    Valid items:'
b'      OpenMP, Threads, Serial, OpenMP_Serial, Threads_Serial'
b'      Cuda_OpenMP, Cuda_Threads, Cuda_Serial'
b''
b'--with-scalars=SCALARS: set KOKKOSKERNELS_SCALARS'
b'    Provide a comma-separated list scalar types'
b'    Valid items:'
b'      float, complex_float, double, complex_double'
b'        Example: SCALARS=double,complex_double'
b''
b'--with-ordinals=ORDS: set KOKKOSKERNELS_ORDINALS'
b'    Provide a comma-separated list ordinal types'
b'    Valid items:'
b'      int, int64_t'
b''
b'--with-offsets=OFFS: set KOKKOSKERNELS_OFFSETS'
b'    Provide a comma-separated list offset types'
b'    Valid items:'
b'      int, size_t'
b''
b'--with-layouts=LAYOUTS: set KOKKOSKERNELS_LAYOUTS'
b'    Provide a comma-separated list layouts'
b'    Valid items:'
b'      LayoutLeft,LayoutRight'
b''
b'--no-default-eti:  Do not include default ETI types for Kokkos Kernels'
b''
b'--disable-test-eti-only:  Do not restrict testing to ETI types for Kokkos Kernels'
b''
b'--with-spaces=SPACES:       Set spaces to be instantiated.'
b'                                Options: hostspace, cudaspace, cudauvmspace'
b''
b'--disable-perftests:  Do not build perftests for Kokkos Kernels'
b''
b'--enable-perftests:  build perftests for Kokkos Kernels (default)'
b''
b'--make-par-level=N:  Set parallelism level for builds (default: N=12)'
b''
b'--with-tpls=TPLS: set KOKKOSKERNELS_ENABLE_TPLS'
b'    Provide a comma-separated list of TPLs'
b'    Valid items:'
b'      blas, mkl, cublas, cusparse, cusolver, magma, armpl, rocblas, rocsparse, rocsolver'
b''
b'--cmake-flags=[CMAKE Command options]:  Set Kokkos Kernels cmake options not handled by script'
b'--kokkos-cmake-flags=[CMAKE Command options]:  Set Kokkos cmake options not handled by script'
b''
b'ARGS: list of expressions matching compilers to test'
b'  supported compilers sems'
b'    gnu/10.3.1'
b'    llvm/10.0.1'
b''
b'Examples:'
b'  Run all tests'
b'  % test_all_sandia'
b''
b'  Run all gcc tests'
b'  % test_all_sandia gcc'
b''
b'  Run all gcc/4.8.4 and all intel tests'
b'  % test_all_sandia gcc/4.8.4 intel'
b''
b'  Run all tests in debug'
b'  % test_all_sandia --debug'
b''
b'  Run gcc/4.8.4 and only do OpenMP and OpenMP_Serial builds'
b'  % test_all_sandia gcc/4.8.4 --build-list=OpenMP,OpenMP_Serial'
b''
b'If you want to kill the tests, do:'
b'  hit ctrl-z'
b'  % kill -9 %1'
b''
b'srun: error: solo153: task 0: Exited with exit code 1'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_GNU1021_Light_LayoutRight'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_GNU1021 # 716 (click to expand)

b'--num=N: Number of jobs to run in parallel'
b'--spot-check: Minimal test set to issue pull request'
b'--spot-check-tpls: Minimal test set enabling blas and lapack tpls'
b'--timeout: Max time before ctest timeout (in seconds)'
b'--dry-run: Just print what would be executed'
b"--build-only: Just do builds, don't run anything"
b'--opt-flag=FLAG: Optimization flag (default: -O3)'
b'--cxxflags-extra=FLAGS: Extra flags to be added to CXX_FLAGS'
b'--ldflags-extra=FLAGS: Extra flags to be added to LD_FLAGS'
b''
b'--arch=ARCHITECTURE: overwrite architecture flags'
b'                     Provide a comma-separated list of arch codes (see available at link below):'
b'                       https://github.com/kokkos/kokkos/wiki/Compiling#table-43-architecture-variables'
b''
b'--with-cuda-options=OPT: set KOKKOS_CUDA_OPTIONS'
b'                         Provide a comma-separated list from the following valid items:'
b'                           force_uvm,use_ldg,enable_lambda,rdc'
b''
b'--with-options=OPT: set KOKKOS_OPTIONS'
b'                    Provide a comma-separated list from the following valid items:'
b'                      compiler_warnings'
b'                      aggressive_vectorization = add ivdep on loops'
b'                      disable_profiling = do not compile with profiling hooks'
b''
b'--build-list=BUILD,BUILD,BUILD...'
b'    Provide a comma-separated list of builds instead of running all builds'
b'    Valid items:'
b'      OpenMP, Threads, Serial, OpenMP_Serial, Threads_Serial'
b'      Cuda_OpenMP, Cuda_Threads, Cuda_Serial'
b''
b'--with-scalars=SCALARS: set KOKKOSKERNELS_SCALARS'
b'    Provide a comma-separated list scalar types'
b'    Valid items:'
b'      float, complex_float, double, complex_double'
b'        Example: SCALARS=double,complex_double'
b''
b'--with-ordinals=ORDS: set KOKKOSKERNELS_ORDINALS'
b'    Provide a comma-separated list ordinal types'
b'    Valid items:'
b'      int, int64_t'
b''
b'--with-offsets=OFFS: set KOKKOSKERNELS_OFFSETS'
b'    Provide a comma-separated list offset types'
b'    Valid items:'
b'      int, size_t'
b''
b'--with-layouts=LAYOUTS: set KOKKOSKERNELS_LAYOUTS'
b'    Provide a comma-separated list layouts'
b'    Valid items:'
b'      LayoutLeft,LayoutRight'
b''
b'--no-default-eti:  Do not include default ETI types for Kokkos Kernels'
b''
b'--disable-test-eti-only:  Do not restrict testing to ETI types for Kokkos Kernels'
b''
b'--with-spaces=SPACES:       Set spaces to be instantiated.'
b'                                Options: hostspace, cudaspace, cudauvmspace'
b''
b'--disable-perftests:  Do not build perftests for Kokkos Kernels'
b''
b'--enable-perftests:  build perftests for Kokkos Kernels (default)'
b''
b'--make-par-level=N:  Set parallelism level for builds (default: N=12)'
b''
b'--with-tpls=TPLS: set KOKKOSKERNELS_ENABLE_TPLS'
b'    Provide a comma-separated list of TPLs'
b'    Valid items:'
b'      blas, mkl, cublas, cusparse, cusolver, magma, armpl, rocblas, rocsparse, rocsolver'
b''
b'--cmake-flags=[CMAKE Command options]:  Set Kokkos Kernels cmake options not handled by script'
b'--kokkos-cmake-flags=[CMAKE Command options]:  Set Kokkos cmake options not handled by script'
b''
b'ARGS: list of expressions matching compilers to test'
b'  supported compilers sems'
b'    intel/19.1'
b'    gnu/10.3.1'
b''
b'Examples:'
b'  Run all tests'
b'  % test_all_sandia'
b''
b'  Run all gcc tests'
b'  % test_all_sandia gcc'
b''
b'  Run all gcc/4.8.4 and all intel tests'
b'  % test_all_sandia gcc/4.8.4 intel'
b''
b'  Run all tests in debug'
b'  % test_all_sandia --debug'
b''
b'  Run gcc/4.8.4 and only do OpenMP and OpenMP_Serial builds'
b'  % test_all_sandia gcc/4.8.4 --build-list=OpenMP,OpenMP_Serial'
b''
b'If you want to kill the tests, do:'
b'  hit ctrl-z'
b'  % kill -9 %1'
b''
b'srun: error: solo157: task 0: Exited with exit code 1'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_GNU1021'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_Tpls_INTEL19_solo # 720 (click to expand)

b'--num=N: Number of jobs to run in parallel'
b'--spot-check: Minimal test set to issue pull request'
b'--spot-check-tpls: Minimal test set enabling blas and lapack tpls'
b'--timeout: Max time before ctest timeout (in seconds)'
b'--dry-run: Just print what would be executed'
b"--build-only: Just do builds, don't run anything"
b'--opt-flag=FLAG: Optimization flag (default: -O3)'
b'--cxxflags-extra=FLAGS: Extra flags to be added to CXX_FLAGS'
b'--ldflags-extra=FLAGS: Extra flags to be added to LD_FLAGS'
b''
b'--arch=ARCHITECTURE: overwrite architecture flags'
b'                     Provide a comma-separated list of arch codes (see available at link below):'
b'                       https://github.com/kokkos/kokkos/wiki/Compiling#table-43-architecture-variables'
b''
b'--with-cuda-options=OPT: set KOKKOS_CUDA_OPTIONS'
b'                         Provide a comma-separated list from the following valid items:'
b'                           force_uvm,use_ldg,enable_lambda,rdc'
b''
b'--with-options=OPT: set KOKKOS_OPTIONS'
b'                    Provide a comma-separated list from the following valid items:'
b'                      compiler_warnings'
b'                      aggressive_vectorization = add ivdep on loops'
b'                      disable_profiling = do not compile with profiling hooks'
b''
b'--build-list=BUILD,BUILD,BUILD...'
b'    Provide a comma-separated list of builds instead of running all builds'
b'    Valid items:'
b'      OpenMP, Threads, Serial, OpenMP_Serial, Threads_Serial'
b'      Cuda_OpenMP, Cuda_Threads, Cuda_Serial'
b''
b'--with-scalars=SCALARS: set KOKKOSKERNELS_SCALARS'
b'    Provide a comma-separated list scalar types'
b'    Valid items:'
b'      float, complex_float, double, complex_double'
b'        Example: SCALARS=double,complex_double'
b''
b'--with-ordinals=ORDS: set KOKKOSKERNELS_ORDINALS'
b'    Provide a comma-separated list ordinal types'
b'    Valid items:'
b'      int, int64_t'
b''
b'--with-offsets=OFFS: set KOKKOSKERNELS_OFFSETS'
b'    Provide a comma-separated list offset types'
b'    Valid items:'
b'      int, size_t'
b''
b'--with-layouts=LAYOUTS: set KOKKOSKERNELS_LAYOUTS'
b'    Provide a comma-separated list layouts'
b'    Valid items:'
b'      LayoutLeft,LayoutRight'
b''
b'--no-default-eti:  Do not include default ETI types for Kokkos Kernels'
b''
b'--disable-test-eti-only:  Do not restrict testing to ETI types for Kokkos Kernels'
b''
b'--with-spaces=SPACES:       Set spaces to be instantiated.'
b'                                Options: hostspace, cudaspace, cudauvmspace'
b''
b'--disable-perftests:  Do not build perftests for Kokkos Kernels'
b''
b'--enable-perftests:  build perftests for Kokkos Kernels (default)'
b''
b'--make-par-level=N:  Set parallelism level for builds (default: N=12)'
b''
b'--with-tpls=TPLS: set KOKKOSKERNELS_ENABLE_TPLS'
b'    Provide a comma-separated list of TPLs'
b'    Valid items:'
b'      blas, mkl, cublas, cusparse, cusolver, magma, armpl, rocblas, rocsparse, rocsolver'
b''
b'--cmake-flags=[CMAKE Command options]:  Set Kokkos Kernels cmake options not handled by script'
b'--kokkos-cmake-flags=[CMAKE Command options]:  Set Kokkos cmake options not handled by script'
b''
b'ARGS: list of expressions matching compilers to test'
b'  supported compilers sems'
b'    intel/19.1'
b'    gnu/10.3.1'
b''
b'Examples:'
b'  Run all tests'
b'  % test_all_sandia'
b''
b'  Run all gcc tests'
b'  % test_all_sandia gcc'
b''
b'  Run all gcc/4.8.4 and all intel tests'
b'  % test_all_sandia gcc/4.8.4 intel'
b''
b'  Run all tests in debug'
b'  % test_all_sandia --debug'
b''
b'  Run gcc/4.8.4 and only do OpenMP and OpenMP_Serial builds'
b'  % test_all_sandia gcc/4.8.4 --build-list=OpenMP,OpenMP_Serial'
b''
b'If you want to kill the tests, do:'
b'  hit ctrl-z'
b'  % kill -9 %1'
b''
b'srun: error: solo158: task 0: Exited with exit code 1'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_Tpls_INTEL19_solo'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_CLANG1001_solo # 692 (click to expand)

b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_scal_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_scal_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_sum_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_sum_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_swap_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_update_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_update_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_gemv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_ger_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_syr2_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_syr_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_gemm_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_trmm_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_trsm_eti_spec_decl.hpp.in'
b' create mode 100644 graph/eti/generated_specializations_hpp/KokkosGraph_color_d1_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_gesv_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_svd_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_trtri_eti_spec_decl.hpp.in'
b' create mode 100644 perf_test/sparse/KokkosSparse_sort_crs.cpp'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_bspgemm_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_apply_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gmres_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_par_ilut_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_par_ilut_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spadd_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spadd_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_jacobi_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_noreuse_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spiluk_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spiluk_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_bsrmatrix_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_bsrmatrix_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_struct_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_struct_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_sptrsv_solve_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_sptrsv_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_trsv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/impl/KokkosSparse_sort_crs_impl.hpp'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_CLANG1001_solo'
b'srun: INFO: Adding filesystem licenses to job: qscratch:1,gpfs:1'
b'srun: job 3109701 queued and waiting for resources'
b'srun: job 3109701 has been allocated resources'
b'Running on machine: solo'
b"KokkosKernels Repository Status:  c7be848dada7a93f361ae77c2c8ece9fb6d8e012 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  6ecdf605e0f7639adec599d25cf0e206d7b8f9f5 Merge pull request #6994 from ndellingwood/master-release-4.3.01'
b''
b''
b'Going to test compilers:  llvm/10.0.1'
b'Testing compiler llvm/10.0.1'
b'Unrecognized compiler llvm/10.0.1 when looking for Spack variants'
b'Unrecognized compiler llvm/10.0.1 when looking for Spack variants'
b'Unrecognized compiler llvm/10.0.1 when looking for Spack variants'
b'  FAILED llvm-10.0.1-Threads_Serial-release'
b'SETUP_ENV: compiler=llvm/10.0.1 modules=cmake llvm/10.0.1 gnu/10.3.1'
b'Lmod has detected the following error: The following module(s) are unknown:'
b'"gnu/10.2.1"'
b''
b'Please check the spelling or version number. Also try "module spider ..."'
b'It is also possible your cache file is out-of-date; it may help to try:'
b'  $ module --ignore_cache load "gnu/10.2.1"'
b''
b'Also make sure that all modulefiles written in TCL start with the string'
b'#%Module'
b''
b'Executing this command requires loading "gnu/10.2.1" which failed while'
b'processing the following module(s):'
b''
b'    Module fullname  Module Filename'
b'    ---------------  ---------------'
b'    llvm/10.0.1      /apps/modules/modulefiles-apps/llvm/10.0.1.lua'
b''
b''
b'#######################################################'
b'PASSED TESTS'
b'#######################################################'
b'#######################################################'
b'FAILED TESTS'
b'#######################################################'
b'llvm-10.0.1-Threads_Serial-release (configure failed)'
b'#######################################################'
b'  # Reproducer instructions:'
b'  #   Load modules:'
b'        module purge'
b'        module load cmake llvm/10.0.1 gnu/10.3.1'
b'        export OMP_NUM_THREADS=8'
b'        export OMP_PROC_BIND=spread'
b'        export OMP_PLACES=cores'
b'        export KOKKOS_NUM_THREADS=8'
b''
b'cat: /gpfs/jenkins/workspace/KokkosKernels_PullRequest_CLANG1001_solo/KokkosKernels_PullRequest_CLANG1001_solo.692/TestAll_2024-08-27_11.56.23/llvm/10.0.1/Threads_Serial-release/call_generate_makefile_genericpath.sh: No such file or directory'
b'srun: error: solo44: task 0: Exited with exit code 1'
b'/gpfs/jenkins/workspace/KokkosKernels_PullRequest_CLANG1001_solo'
b'Finished: SUCCESS'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_VEGA90A_ROCM561 # 1166 (click to expand)

b' create mode 100644 batched/eti/generated_specializations_hpp/KokkosBatched_Gemm_t_nt_bll_eti_spec_decl.hpp.in'
b' create mode 100644 batched/eti/generated_specializations_hpp/KokkosBatched_Gemm_t_nt_blr_eti_spec_decl.hpp.in'
b' create mode 100644 batched/eti/generated_specializations_hpp/KokkosBatched_Gemm_t_t_bll_eti_spec_decl.hpp.in'
b' create mode 100644 batched/eti/generated_specializations_hpp/KokkosBatched_Gemm_t_t_blr_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_abs_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_abs_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_axpby_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_axpby_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_dot_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_dot_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_iamax_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_iamax_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_mult_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_mult_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrm1_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrm1_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrm2_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrm2_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrm2w_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrm2w_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrminf_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrminf_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_reciprocal_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_reciprocal_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_rot_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_rotg_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_rotm_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_rotmg_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_scal_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_scal_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_sum_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_sum_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_swap_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_update_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_update_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_gemv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_ger_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_syr2_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_syr_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_gemm_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_trmm_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_trsm_eti_spec_decl.hpp.in'
b' create mode 100644 graph/eti/generated_specializations_hpp/KokkosGraph_color_d1_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_gesv_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_svd_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_trtri_eti_spec_decl.hpp.in'
b' create mode 100644 perf_test/sparse/KokkosSparse_sort_crs.cpp'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_bspgemm_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_apply_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gmres_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_par_ilut_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_par_ilut_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spadd_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spadd_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_jacobi_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_noreuse_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spiluk_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spiluk_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_bsrmatrix_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_bsrmatrix_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_struct_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_struct_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_sptrsv_solve_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_sptrsv_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_trsv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/impl/KokkosSparse_sort_crs_impl.hpp'
b'/home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA90A_ROCM561'
b'Running on machine: vega90a_caraway'
b"KokkosKernels Repository Status:  d286cedfb1057883468fe8ca975689c41675d6c6 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  6ecdf605e0f7639adec599d25cf0e206d7b8f9f5 Merge pull request #6994 from ndellingwood/master-release-4.3.01'
b''
b''
b'Going to test compilers:  rocm/5.6.1'
b'Testing compiler rocm/5.6.1'
b'Unrecognized compiler rocm/5.6.1 when looking for Spack variants'
b'Unrecognized compiler rocm/5.6.1 when looking for Spack variants'
b'Unrecognized compiler rocm/5.6.1 when looking for Spack variants'
b'  Starting job rocm-5.6.1-Hip_Serial-release'
b'Hip IS THE KOKKOS DEVICE'
b'kokkos devices: Hip,Serial'
b'kokkos arch: VEGA90A'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3  '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: '
b'Build was aborted'
b'Aborted by Evan Harvey'
b'Finished: ABORTED'
b''

Console Output (last 100 lines) : KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM561 # 675 (click to expand)

b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrm2w_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrminf_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_nrminf_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_reciprocal_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_reciprocal_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_rot_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_rotg_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_rotm_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_rotmg_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_scal_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_scal_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_sum_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_sum_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_swap_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_update_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas1_update_mv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_gemv_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_ger_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_syr2_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas2_syr_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_gemm_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_trmm_eti_spec_decl.hpp.in'
b' create mode 100644 blas/eti/generated_specializations_hpp/KokkosBlas3_trsm_eti_spec_decl.hpp.in'
b' create mode 100644 graph/eti/generated_specializations_hpp/KokkosGraph_color_d1_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_gesv_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_svd_eti_spec_decl.hpp.in'
b' create mode 100644 lapack/eti/generated_specializations_hpp/KokkosLapack_trtri_eti_spec_decl.hpp.in'
b' create mode 100644 perf_test/sparse/KokkosSparse_sort_crs.cpp'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_bspgemm_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_apply_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gauss_seidel_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_gmres_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_par_ilut_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_par_ilut_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spadd_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spadd_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_jacobi_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_noreuse_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spgemm_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spiluk_numeric_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spiluk_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_bsrmatrix_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_bsrmatrix_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_mv_struct_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_spmv_struct_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_sptrsv_solve_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_sptrsv_symbolic_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/eti/generated_specializations_hpp/KokkosSparse_trsv_eti_spec_decl.hpp.in'
b' create mode 100644 sparse/impl/KokkosSparse_sort_crs_impl.hpp'
b'/home/jenkins/caraway-new/workspace/KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM561'
b''
b'----------------------------------------------------------------------------'
b'  rocm:'
b'----------------------------------------------------------------------------'
b'     Versions:'
b'        rocm/5.2.0'
b'        rocm/5.6.1'
b'        rocm/6.0.0'
b''
b'----------------------------------------------------------------------------'
b'  For detailed information about a specific "rocm" package (including how to load the modules) use the module\'s full name.'
b'  Note that names that have a trailing (E) are extensions provided by other modules.'
b'  For example:'
b''
b'     $ module spider rocm/6.0.0'
b'----------------------------------------------------------------------------'
b''
b' '
b''
b'Running on machine: vega90a_caraway'
b"KokkosKernels Repository Status:  95fc8d460127fe8c957f53a9419af69dd11609e3 Merge remote-tracking branch 'upstream/develop' into HEAD"
b''
b'Kokkos Repository Status:  6ecdf605e0f7639adec599d25cf0e206d7b8f9f5 Merge pull request #6994 from ndellingwood/master-release-4.3.01'
b''
b''
b'Going to test compilers:  rocm/5.6.1'
b'Testing compiler rocm/5.6.1'
b'Unrecognized compiler rocm/5.6.1 when looking for Spack variants'
b'Unrecognized compiler rocm/5.6.1 when looking for Spack variants'
b'Unrecognized compiler rocm/5.6.1 when looking for Spack variants'
b'  Starting job rocm-5.6.1-Hip_Serial-release'
b'Hip IS THE KOKKOS DEVICE'
b'kokkos devices: Hip,Serial'
b'kokkos arch: VEGA908A'
b'kokkos options: '
b'kokkos cuda options: '
b'kokkos cxxflags: -O3  '
b'extra_args: '
b"kokkoskernels scalars: 'double,complex_double'"
b'kokkoskernels ordinals: int'
b'kokkoskernels offsets: int,size_t'
b'kokkoskernels layouts: LayoutLeft'
b'kokkoskernels tpls list: ,blas,rocblas,rocsparse,rocsolver'
b'Build was aborted'
b'Aborted by Evan Harvey'
b'Finished: ABORTED'
b''

Yuuichi Asahi added 2 commits August 28, 2024 04:31
@kokkos-devops-admin
Copy link

Status Flag 'Pre-Test Inspection' - - This Pull Request Requires Inspection... The code must be inspected by a member of the Team before Testing/Merging
NO INSPECTION HAS BEEN PERFORMED ON THIS PULL REQUEST! - This PR must be inspected by setting label 'AT: PRE-TEST INSPECTED'.

Copy link
Contributor

@lucbv lucbv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, thanks for the updates

@lucbv lucbv added the AT: PRE-TEST INSPECTED Mark this PR as approved for testing. label Aug 28, 2024
@kokkos-devops-admin kokkos-devops-admin removed the AT: PRE-TEST INSPECTED Mark this PR as approved for testing. label Aug 28, 2024
@kokkos-devops-admin
Copy link

Status Flag 'Pre-Test Inspection' - SUCCESS: The last commit to this Pull Request has been INSPECTED by label AT: PRE-TEST INSPECTED! Autotester is Removing Label; this inspection will remain valid until a new commit to source branch is performed.

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Testing Jenkins Projects:

Pull Request Auto Testing STARTING (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 1489
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 1077
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021

  • Build Num: 731
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021_Light_LayoutRight

  • Build Num: 718
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GNU1021

  • Build Num: 719
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19_solo

  • Build Num: 723
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001_solo

  • Build Num: 695
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_ROCM561

  • Build Num: 1169
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM561

  • Build Num: 678
  • Status: STARTED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Using Repos:

Repo: KOKKOSKERNELS (yasahi-hpc/kokkos-kernels)
  • Branch: implement-batched-serial-pttrs
  • SHA: 9a3628c
  • Mode: TEST_REPO

Pull Request Author: yasahi-hpc

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Jenkins Testing: all Jobs PASSED

Pull Request Auto Testing has PASSED (click to expand)

Build Information

Test Name: KokkosKernels_PullRequest_CUDA11_CUDA11_LayoutRight

  • Build Num: 1489
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10

  • Build Num: 1077
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021

  • Build Num: 731
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_GNU1021_Light_LayoutRight

  • Build Num: 718
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_GNU1021

  • Build Num: 719
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_Tpls_INTEL19_solo

  • Build Num: 723
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_CLANG1001_solo

  • Build Num: 695
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_ROCM561

  • Build Num: 1169
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

Build Information

Test Name: KokkosKernels_PullRequest_VEGA90A_Tpls_ROCM561

  • Build Num: 678
  • Status: PASSED

Jenkins Parameters

Parameter Name Value
KOKKOSKERNELS_SOURCE_REPO https://github.com/yasahi-hpc/kokkos-kernels
KOKKOSKERNELS_SOURCE_SHA 9a3628c
KOKKOSKERNELS_TARGET_BRANCH develop
KOKKOSKERNELS_TARGET_REPO https://github.com/kokkos/kokkos-kernels
KOKKOSKERNELS_TARGET_SHA d4c2511
PR_LABELS feature request
PULLREQUESTNUM 2277
TEST_REPO_ALIAS KOKKOSKERNELS

@kokkos-devops-admin
Copy link

Status Flag 'Pre-Merge Inspection' - SUCCESS: The last commit to this Pull Request has been INSPECTED AND APPROVED by [ lucbv ]!

@kokkos-devops-admin
Copy link

Status Flag 'Pull Request AutoTester' - Pull Request MUST BE MERGED MANUALLY BY Project Team - This Repo does not support Automerge

@lucbv lucbv merged commit 69811cc into kokkos:develop Aug 28, 2024
18 of 19 checks passed
lucbv added a commit that referenced this pull request Nov 25, 2024
* Fix2130 (#2132)

* Fix #2130

- Do not call BsrMatrix spmv impl if block size is 1
- Instead, convert it to unmanaged CrsMatrix and call spmv again
  - cuSPARSE returned an error code in this case
  - Better performance

* Formatting

* Remove redundant remove_pointer_t

Handle is already a non-pointer type

* Benchmark: modifying spmv benchmark to run range of spmv tests (#2135)

This could be further automated to run on matrix from suite sparse

* Kokkos Kernels: update version guards to drop old version of Kokkos (#2133)

Since we are now in the 4.2 series we only support up to 4.1.00.
Older version of Kokkos Core will require older version of Kokkos
Kernels for compatibility. Once 4.3.00 is out we will move to
drop support for the 4.1 series and only keep 4.2 and 4.3 series.

* ODE: BDF methods (#1930)

* ODE: adding BDF algorithms

Implementing BDF formula for stiff ODEs.
Orders 1 to 5 are available and tested.
The integrators can be called on GPU to
solve multiple systems in parallel.

* ODE: fixing storage handling for start-up RK stack

* ODE: clang-format

* ODE: first adaptive version of BDF

The current implementation only allows for adaptivity in time,
at this point the BDF Step actually converges as expected with
first order integration!

* ODE: fixing issues with adaptive BDF

The unit-test BDF_adaptive now shows the integration
of the logistic equation using adaptive time steps and
increasing integration order from 1 to 5.

* ODE: running BDF on StiffChemistry problem

The problem runs fine and is solved but there are oscillations
while the behavior of the solution is smooth. More investigation
is needed...

* BDF: fixing types and template parameters in batched calls

Bascially we need template parameters to be more versatile
and cannot assume that all rank1 views will have the exact
same underlying type, for instance layouts can be different.

* More fixes for GPUs only in tests this time.

* ODE: BDF adaptive, fix small bug

After adding rhs and update vectors to temp the subviews taken for
other variables need to be offset appropriately...

* Revert "More fixes for GPUs only in tests this time."

This reverts commit 2f70432761485bc6a4c65a1833e7299dd2c340e2.

* Revert "Revert "More fixes for GPUs only in tests this time.""

This reverts commit 836012bb529551727b3f5913057acad94dfe60df.

* ODE: BDF small change to temporarily avoid compile time issue

True fix involving a KOKKOS_VERSION check is upcoming after more
tests on GPU side...

* ODE: BDF fix for some printf statements that will go away soon...

* ODE: adding benchmark for BDF

The benchmark helps us monitor the performance of the BDF
implementaiton across multiple platforms as well as impact of
changes over time.

* ODE: improve benchmark interface...

* ODE: BDF changes to use RMS norm and change some default values

Small changes to compare more closely with reference implementation.
Some of these might be reverted eventually but that's fine for now.

* ODE: BDF convergence more stable and results look pretty good now!

Changing the Newton solver convergence criteria as well as changing
a few default input parameters leads to a more stable algorithms
which can now integrate the stiff Henderson autocatalytic example
well in 66 time steps instead of 200k for fixed order integration...

* ODE: BDF fix bug in initial time step calculation

The initial step routine was overwriting the initial right hand side
which led to obvious issues further down the road... now things should
work fine. Need to figure out if I can re-initialize the variables in
the perf test while excluding that time from each iteration.

* ODE: BDF removing bad print statement...

std::cout in device code

* ODE - BDF: improving perf test

Basically adding new untimed setup within the main loop of the
benchmark to reset the intial conditions, buffers and vectors
ahead of each iteration.

* Modifying unit-test to catch proper return type

* Applying clang-format

* cm_test_all_sandia: update caraway compilers

add rocm/5.6.1 and rocm/6.0.0, and openblas/0.3.23 as tpl

* Sparse MKL: changing the location of the MKL_SAFE_CALL macro (#2134)

* Sparse MKL: changing the location of the MKL_SAFE_CALL macro

Moving the macro outside of namespaces to ensure that it will be
interpreted correctly when called from any other location in the
library.

It does not make much sense to guard Impl code in the Experimental
namespace and in this case it cleans up a problem with namespace
disambiguation for the compiler...

* Sparse BsrSpMV: removing Experimental namespace from Impl namespace

* Applying clang-format

* Sparse SpMV: fixing more namespace issues!

* Fixing missing descriptor for bsr spmv

* Kokkos Kernels: change the default offset ETI from size_t to int (#2140)

This change makes it easier for customer to leverage TPL support
which almost always requires offset=int, ordinal=int to be enabled
meaning that no TPL support is available with our default ETI...

* KokkosSparse_spmv_bsrmatrix_spec: fix Bsr_TC_Precision namespacing

Resolve compilation errors in nightly cuda/12.2 A100 build

* Drop comment for cleaner clang-format fix

* Fix usage of RAII to set cusparse/rocsparse stream (#2141)

Temporary objects like "A()" get destructed immediately.
For the object to have scope lifetime, it needs a name like "A a();".
This was causing cusparse/rocsparse spmv to always execute on the default stream,
causing incorrect timing in the spmv perf test.

* Use execution space operator== (#2136)

It actually is part of the public interface

* cm_test_all_sandia: more caraway module updates and cleanup (#2145)

* Spmv perftest improvements (#2146)

* Spmv perf test improvements

- Add option to flush caches by filling a dummy buffer between
iterations
- Add option to call the non-reuse interface instead of handle/reuse
interface
- Fix modes T, H in nonsquare case (make x,y the correct length)

* Fix mode help text

* KokkosKernels Utils: cleaning the zero_vector interface

One of the overload requires an unused template, removing that
extraneous template and simplify how that function is called in
a second overload.

* Kokkos Kernels: fixing call to zero_vector in Gauss-Seidel

* CMakeLists.txt: Update develop to 4.3.99

* Address reviewer comments

* Update CMakeLists.txt

Co-authored-by: brian-kelley <[email protected]>

* cm_test_all_sandia: solo updates

module updates post TOSS upgrade

* Fix signed/unsigned comparison warnings (#2150)

This is only hit when spmv is called with integer scalars,
which doesn't happen in our CI but does often in Tpetra.

* SPMV tpl fixes, cusparse workaround (#2152)

* SPMV tpl fixes, workaround

* Avoid possible integer conversion warnings

* Document cusparseSpMM algos that were tested

* Applying clang-format

* Use default_size_type as default offset in matrix types (#2149)

Now a declaration like CrsMatrix<Scalar, Ordinal, Device>
will by default use an ETI'd type combination (as int is the default
ETI'd offset)

* spmv bsr perftest: move fences to right place (#2153)

* Kokkos Kernels: removing old code branches ahead of 4.3.00 release (#2139)

Some old code branches kept for compatibility with Kokkos Core
versions less than 4.2.00 are dropped after release 4.3.00.
The largest changes are the removal of support the Kokkos print
macro in favor of Kokkos::printf() and the removal of half support
from Kokkos Kernels since it is now in Kokkos Core.

* Code for running performance measurements on ger() (#2082)

* Correct flop count

* Addressing feedbacks from Luc

* Using 'zero()' instead of '0.'

* KokkosBlas1_axpby.hpp: change debug macro guard for printInformation (#2157)

* KokkosBlas1_axpby.hpp: change debug macro guard for printInformation

- resolves test failures in Trilinos (MueLu) that rely on gold file diff
comparisons by removing extra output in debug builds

* fix compilation error

* Add user tolerance to Serial SVD (#2120)

* Add user tolerance to Serial SVD
---------

Co-authored-by: whorne <[email protected]>
Co-authored-by: Carl Pearson <[email protected]>

* Add a simple Harwell-Boeing file reader (#2155)

* Add a simple Harwell-Boeing file reader

And a test that validates against the MM reader.

* Support for symmetrize

* This loop can be simplified, there's no diag duplication

* Improve IO test

* spmv tpls: use correct bool for eti template param (#2160)

(It should just be the default,
KokkosSpars::Impl::eti_spec_avail<..>::value)

* Fix sparse_ioutils test on kokkos-dev (#2162)

Adding a ss.sync call fixes it but I have no idea why it
was needed here. All the other stringstream reads work fine.

* Fix #2156 (#2164)

spmv: add special path for rank-2 x/y, but where both have 1 column
and a TPL is available for rank-1 but not rank-2.

Also call "subhandle->set_exec_space" correctly in the TPLs to ensure
proper synchronization between setup, spmv and cleanup (in the case that
different exec instances are used in different calls)

* Updates from feedback runnig Trilinos testing

- Update debug level to > 1 guarding `printInformation(...)` in
  KokkosBlas1_axpby.hpp to reduce noisy test output
- Loosen tolerance of lapack.svd test to avoid random failures that
  occur near prior tolerance level

* Fix #2167: classic MKL doesn't use space instance (#2168)

* CHANGELOG.md: 4.3.00 update

* KokkosLapack_svd_tpl_spec_decl: defer to MKL spec when LAPACK also enabled

Resolves redefintion of struct SVD compilation errors with both MKL and LAPACK are enabled
Reported by @maartenarnst in https://github.com/trilinos/Trilinos/issues/12891

Co-authored-by: brian-kelley <[email protected]>

* .github/workflows: Added bdw.yml

* .github/workflows: Added spr.yml

* .github/workflows: Added mi210.yml

* .github/workflows: Added h100.yml

* .github/workflows: Added volta70.yml

* .github/workflows: Added power9.yml and power9_tpls.yml

* Jgfouca/block spiluk fixes (#2172)

* Progress

* Attempt 1, fix multiplication order

* Converges in 1 step

* Various cleanups

* Be sure not to reduce performance of unblocked impl

Also add some comments.

* Remove test mangling

* Fixes for GPU

* Fix warning

* formatting

* Increase eps for floats

* This is no longer needed

* .github/workflows: Add PR_VEGA908_ROCM561_HIP_SERIAL_LEFT_OPENBLAS_OPENLAPACK_REL

* Add guard for cusparse spmv_mv_tpl_spec_avail

Address issue #2175
Configuring with magma tpl enabled and cusparse disabled mistakenly triggers the cusparse tpl avail check to be true
Guard the KOKKOSSPARSE_SPMV_MV_TPL_SPEC_AVAIL_CUSPARSE macros when CUSPARSE is enabled to prevent this

* .github/workflows: Remove OPENLAPACK from names

* .github/workflows: Remove power9 until we have hardware to test it on

* .github/workflows: Enable rocblas in rocm tpl check

* .github/workflows: Remove volta70 until we have hardware to test it on

* Add early return if numRows == 0 in trsv to avoid integer divide-by-zero error

* Resolves multiple definition of Magma and Cuda singletons (#2178)

Address issue #2175

* magma: fix linker errors for builds without cusolver (#2181)

* magma: fix linker errors for builds without cusolver

* BatchedGemm test: workaround testing cublas+magma

- temporary workaround to skip magma test when cublas enabled to avoid issues like #2177

* .github/workflows/mi210: Fix include paths

* workflows/osx.yml: test against most recent kokkos tag

- test against most recent kokkos release rather than develop branch, as
  done with AT CI, to avoid compatibility breakages

* Resolve vortex compilation issue by resolving (potentially) duplicate symbol (#2183)

Stick to pattern of removing leading 'c' or 'z' in method name and relying on the template type

Co-authored-by: malphil <[email protected]>

* Changes to enable OneAPI usage with Trilinos build (#2185)

- Get rid of SYCL_OVERRIDE setting
- Add MKL_PROVIDES_BLAS_LAPACK cmake variable and define so that
  code knows whether to use int, or MKL_INT
  - Trilinos builds might link with OneAPI for GPU but standard
    BLAS/LAPACK on CPU

* Fix macOS docs build (#2190)

* Fix docs build

* try docs fix

* make sphinx available at config time

* GH-Actions: adding security actions and scorecard (#2192)

Pretty much taking the new files from PR #2191 and re-creating
it in a clean PR on top of develop with small changes relevant
to our repository.

Cleaning up some workflows to tailor it for our needs

* Scorecard: adding manual dispatch and target default branch (#2195)

* BsrMatrix: Fix HostMirror typedef (#2196)

It needed to have size_type.

* update changelog for 4.3.1

* Update CHANGELOG.md

* docs.yml: change kokkos version to latest release

- avoid version range check issues for release tests

* Sparse - SpGEMM: labeling spgemm_symbolic in TPL layer a bit more clearly (#2193)

This just improves the readability of the output from the tools as it now
has a symbolic matching the numeric phase. Previously we only had spgemm
in the label which is a bit confusing as it could be the whole spgemm time
i.e. both symbolic and numeric, additionally we had symbolic in the MKL
path but not in cusparse, rocsparse...

* SpMV: Test NaN, fix NaN handling when beta=0 (#2188)

* Test_Sparse_spmv_bsr.hpp: add NaNs to tests

* handle NaN in spmv_beta_transpose when beta=0

* handle nan in SpmvMergeHierarchical when beta=0

* Test NaNs in Y, don't reuse modifed Y, catch NaNs in results test

* remove unused <iostream> include

* explicit casting of zero

* Test_sparse_spmv.hpp: remove unused nans parameter

* KokkosSparse_spmv.hpp: CUDA11 can't detect this function always returns

* Test_Sparse_spmv.hpp: remove unused variable

* Run unit tests in correct execution space

* Test_Sparse_spmv.hpp: remove unused type aliases

* Kokkos::nan() -> KokkosKernels::Impl::quiet_NaN()

* Disable cuBLAS dot wrapper (#2206)

(not deleted, just guarded with #if 0 and comments explaining)

It performs significantly worse than our native impl on 11.2, 11.8 and 12.0 on V100.
This is in the dot perf test with a warm-up call.

https://github.com/trilinos/Trilinos/issues/12982 was a symptom of this.

* Fix spmv regressions (#2204)

* Restore cusparse spmv ALG2 path for imbalanced

With correct version cutoffs

* spmv: use separate rank-1 and rank-2 tpl subhandles

* Remove redundant single-column path in native spmv_mv

* Fix unused param warning

* c++17: add [[fallthrough]] attribute (#1493)

* c++17: add [[fallthrough]] attribute

* cm_test_all_sandia: -Wimplicit-fallthrough

* Enable 3 at2 builds (#2210)

* .github/mi210: Enable on PRs

* .github/mi210: Disable non-tpl build

* .github/bdw: Enable PR_BDW_GNU1020_OPENMP_SERIAL_LEFT_OPENBLAS_REL

* .github/h100: Enable PR_HOPPER90_CUDA1180_CUDA_LEFT_RIGHT_REL

* Bump ossf/scorecard-action from 2.0.6 to 2.3.3 (#2214)

Bumps [ossf/scorecard-action](https://github.com/ossf/scorecard-action) from 2.0.6 to 2.3.3.
- [Release notes](https://github.com/ossf/scorecard-action/releases)
- [Changelog](https://github.com/ossf/scorecard-action/blob/main/RELEASE.md)
- [Commits](https://github.com/ossf/scorecard-action/compare/99c53751e09b9529366343771cc321ec74e9bd3d...dc50aa9510b46c811795eb24b2f1ba02a914e534)

---
updated-dependencies:
- dependency-name: ossf/scorecard-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/upload-artifact from 3.1.3 to 4.3.3 (#2215)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 3.1.3 to 4.3.3.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/a8a3f3ad30e3422c9c7b888a15615d19a852ae32...65462800fd760344b1a7b4382951275a0abb4808)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/dependency-review-action from 2.5.1 to 4.3.2 (#2217)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 2.5.1 to 4.3.2.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/0efb1d1d84fc9633afcdaad14c485cbbc90ef46c...0c155c5e8556a497adf53f2c18edabf945ed8e70)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump step-security/harden-runner from 2.7.1 to 2.8.0 (#2218)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.7.1 to 2.8.0.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/a4aa98b93cab29d9b1101a6143fb8bce00e2eac4...f086349bfa2bd1361f7909c78558e816508cdc10)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 2.25.3 to 3.25.6 (#2216)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 2.25.3 to 3.25.6.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/ceaec5c11a131e0d282ff3b6f095917d234caace...9fdb3e49720b44c48891d036bb502feb25684276)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Kokkos Kernels: initial security policy (#2220)

We state were patches will be made and how to report security vulnerabilities to the project.

* Kokkos Kernels: adding SHA for github actions (#2221)

* Kokkos Kernels: adding SHA for github actions

* Adding sha for docker images

* More dependencies fix (#2222)

* workflows: fix sha for label checker docker image

* workflows: fixing Docker files dependencies.

* Fix Docker files (#2223)

* SPMV TPLs: improve profile region labels (#2219)

- Mark rank-2 versions as "spmv_mv" not "spmv" (the native impl has
  this, and it's useful to know which one is being run)
- Add missing commas separating "BSRMATRIX" and the scalar type name

* cusparse spgemm: provide non-null row-ptr (#2213)

* Bump github/codeql-action from 3.25.6 to 3.25.7 (#2225)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.6 to 3.25.7.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/9fdb3e49720b44c48891d036bb502feb25684276...f079b8493333aace61c81488f8bd40919487bd9f)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/checkout from 3.6.0 to 4.1.6 (#2226)

Bumps [actions/checkout](https://github.com/actions/checkout) from 3.6.0 to 4.1.6.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/f43a0e5ff2bd294095638e18286ca9a3d1956744...a5ac7e51b41094c92402da3b24376905380afc29)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* README: adding link to scorecard (#2230)

* spmv_mv wrappers for rocsparse (#2233)

* spmv_mv wrappers for rocsparse (rocsparse_spmm())

* Use consistent types for alpha/beta in spmv wrappers

* Bump step-security/harden-runner from 2.8.0 to 2.8.1 (#2236)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/f086349bfa2bd1361f7909c78558e816508cdc10...17d0e2bd7d51742c71671bd19fa12bdc9d40a3d6)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.25.7 to 3.25.8 (#2237)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.7 to 3.25.8.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/f079b8493333aace61c81488f8bd40919487bd9f...2e230e8fe0ad3a14a340ad0815ddb96d599d2aff)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/dependency-review-action from 4.3.2 to 4.3.3 (#2235)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.3.2 to 4.3.3.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/0c155c5e8556a497adf53f2c18edabf945ed8e70...72eb03d02c7872a771aacd928f3123ac62ad6d3a)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Add testing for transpose corner cases (#2234)

* Add testing for transpose corner cases

crs, bsr, graph: test cases that are 0x0, 100x0 and 0x100. In these cases make
sure the matrix generator doesn't try to insert any entries (nnz = 0).

* Update sparse/unit_test/Test_Sparse_Transpose.hpp

Co-authored-by: Luc Berger <[email protected]>

* Update sparse/unit_test/Test_Sparse_Transpose.hpp

Co-authored-by: Luc Berger <[email protected]>

---------

Co-authored-by: Luc Berger <[email protected]>

* Graph: removing executable from repo (#2239)

* Fix logic around merge path with TPLs (#2240)

SPMV_MERGE_PATH is not always a native algorithm. Add
SPMV_NATIVE_MERGE_PATH to cover that case specifically. Test this new
option.

* spgemm unit test: change matrix value distribution (#2241)

Change the distribution A, B values are sampled from so that
values in C can't end up close to 0 (as the result of summing terms
that are larger). The relative error metric in is_same_matrix is sensitive
to this.

Fixes #2232

* kokkoskernels_tpls.cmake: remove duplicates arguments when creating argument for exported INTERFACE_INCLUDE_DIRECTORIES

Attempt to workaround issue #2238

* Sparse - BsrMatrix: adding new wiki example for documentation (#2228)

There is already an example for this but it uses a CrsMatrix as
starting point to build a BsrMatrix which is not really helpful in
general as the hope is that you can use the BsrMatrix without needing
the CrsMatrix as it would double the storage needed...

Addressing Kim's comments

* Sparse - CrsToBsr: fix type mismatch (#2242)

* Update rocsparse algo defaults (#2245)

* Update default spmv algorithms for rocsparse

- Use stream for common cases (default, fast setup) as it has nearly zero
  setup cost and performs well for somewhat balanced matrices
- Use adaptive (which is rocsparse's default) only if SPMV_MERGE_PATH
  is the algorithm, as it has a very high setup cost

* Re-enable rocsparse spmv for SPMV_FAST_SETUP

* In deprecated spmv, fix Controls algorithm mapping (#2246)

native -> SPMV_NATIVE
native-merge -> SPMV_NATIVE_MERGE_PATH
merge -> SPMV_MERGE_PATH
tpl -> SPMV_FAST_SETUP

* Add batched serial tbsv (#2202)

* Add batched serial tbsv

* remove incx argument and use strided views instead

* Add a new line at the end of files

* fix random number generation for complex numbers

* remove unused variables from internal tbsv serial functions

* remove allclose for testing

---------

Co-authored-by: Yuuichi Asahi <[email protected]>

* Bump actions/checkout from 4.1.6 to 4.1.7 (#2248)

Bumps [actions/checkout](https://github.com/actions/checkout) from 4.1.6 to 4.1.7.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/a5ac7e51b41094c92402da3b24376905380afc29...692973e3d937129bcbf40652eb9f2f61becf3332)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.25.8 to 3.25.10 (#2249)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.8 to 3.25.10.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/2e230e8fe0ad3a14a340ad0815ddb96d599d2aff...23acc5c183826b7a8a97bce3cecc52db901f8251)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* FindTPLROC*: updates to fix export of import targets

Changes for the Rocm tpls to match the handling as done with the Cuda tpls

Should resolve issue #2238

* Fix warning about memcpy (#2252)

When building Stokhos BlockCrs, this util function gave a warning
about memcpy modifying a non-trivially-copyable type. Silence it
by casting to void*

* RCM fixes, improvements (#2254)

* Fix RCM starting vertex issue, improve testing
* apply reversing as labels are computed
instead of at the end. Saves a loop over all the labels
* use min-degree starting vertex within each connected component

* spgemm: add profiling regions to native implementations (#2253)

* spgemm: add profiling regions to native implementations

* Add profiling region to KokkosSPGEMM::KokkosSPGEMM_symbolic

* sparse: replace macros with constexpr bools (#2260)

* Rename `Impl::alignPtr` to `Impl::alignPtrTo`, allow it to infer argument type (#2261)

* KokkosKernels::Impl::alignPtr infers argument type

* Rename KokkosKernels::Impl::alignPtr -> alignPtrTo

* Bump github/codeql-action from 3.25.10 to 3.25.11 (#2263)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.10 to 3.25.11.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/23acc5c183826b7a8a97bce3cecc52db901f8251...b611370bb5703a7efb587f9d136a52ea24c5c38c)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* sparse: spadd_symbolic fences before device values used on host (#2259)

* sparse: spadd_symbolic fences before device values used on host

* sparse: use prefix sum to remove explicit spadd fence

* sycl: use alternative `alignPtrTo` when SYCL is enabled (SpGEMM) (#2262)

* sycl: use alternative alignPtr when SYCL is enabled

The current alignPtr, as well as two other alternatives below, do not work on SYCL on Intel PVC.

unsigned int f1(unsigned int i, unsigned int align) {
    return ((i + align - 1) / align * align);
}

unsigned int f2(unsigned int i, unsigned int align) {
    return (i + align - 1) & (-align);
}

* alignPtrTo unit tests

* Help gcc/8.3 with ctad issue

Resolves #2264

Co-authored-by: Carl Pearson <[email protected]>

* Bump actions/upload-artifact from 4.3.3 to 4.3.4 (#2266)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.3.3 to 4.3.4.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/65462800fd760344b1a7b4382951275a0abb4808...0b2256b8c012f0828dc542b3febcab082c67f72b)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* handle_t* -> unique_ptr<handle_t> in Bsr SpMV unit tests (#2269)

* Workarounds for removed cusparse functions (#2270)

cusparse 12.5 removed some functions that were deprecated, like the ILU
factorizations and the legacy csrsv (sparse triangular solve) functions.
As a workaround, if the cusparse version is >= 12.5 then disable the paths
in perftests that call those.

* BLAS - gemv: using fallback when mode is 't' or 'c' and onemkl is used (#2272)

* Implement batched serial pttrf (#2256)

* Batched serial pttrf implementation

* fix: use GEMM to add matrices

* fix: initialization order

* fformat

* fix: temporary variable in a test code

* fix: docstring of pttrf

* check_positive_definitiveness only if KOKKOSKERNELS_DEBUG_LEVEL > 0

* Improve the test for pttrf

* fix: int type

* fix: cleanup tests for SerialPttrf

* cleanup: remove unused deep_copies

* fix: docstrings and comments for pttrf

* ConjTranspose with conj and Transpose

* quick return in pttrf for size 1 or 0 matrix

* Add tests for invalid input

* fix: info computation

---------

Co-authored-by: Yuuichi Asahi <[email protected]>

* A little sptrsv cleanup before the main block effort (#2247)

* Some cleanup and refactoring

* First round of cleanup complete

* Fix a couple warnings

* formatting

* Sparse - SpMV: removing calls to unsuported oneapi - MKL functions (#2274)

* Sycl gemv beta (#2276)

* BLAS - GEMV: zero out Y when beta == 0 in SYCL TPL code path

* BLAS - GEMV: reverting wrong change from previous PR, my bad.

* Applying clang-format

* Unify alignPtrTo implementation (#2275)

* init (#2273)

* Bigger sptrsv cleanup (#2280)

* Some cleanup and refactoring
* Remove Upper/Lower TriLvlSchedTP2SolverFunctors
* Remove Upper/Lower single block functors
* Remove unused TriLvlSchedTP1SingleBlockFunctorDiagValues and merge upper/lower tri_solve_cg
* Merge two big upper/lower branch of tri_solve_chain
* Merge upper/lower tri_solve_streams
* Switch over block spiluk precond test to use new block sptrsv

* Bump actions/dependency-review-action from 4.3.3 to 4.3.4 (#2279)

Bumps [actions/dependency-review-action](https://github.com/actions/dependency-review-action) from 4.3.3 to 4.3.4.
- [Release notes](https://github.com/actions/dependency-review-action/releases)
- [Commits](https://github.com/actions/dependency-review-action/compare/72eb03d02c7872a771aacd928f3123ac62ad6d3a...5a2ce3f5b92ee19cbb1541a4984c76d921601d7c)

---
updated-dependencies:
- dependency-name: actions/dependency-review-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.25.11 to 3.25.12 (#2278)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.11 to 3.25.12.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/b611370bb5703a7efb587f9d136a52ea24c5c38c...4fa2a7953630fd2f3fb380f21be14ede0169dd4f)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Propose increasing column limit to 120. (#2255)

* Change key files

* Full reformat

* Update format.yml

* Update ubuntu version for format checker

* A couple platforms do not correctly handle static complexes

* .github/workflows: Auto trigger AT2 ci upon review (#2243)

* Bump github/codeql-action from 3.25.12 to 3.25.13 (#2284)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.12 to 3.25.13.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/4fa2a7953630fd2f3fb380f21be14ede0169dd4f...2d790406f505036ef40ecba973cc774a50395aac)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump step-security/harden-runner from 2.8.1 to 2.9.0 (#2283)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.8.1 to 2.9.0.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/17d0e2bd7d51742c71671bd19fa12bdc9d40a3d6...0d381219ddf674d61a7572ddd19d7941e271515c)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Add big reformat commits to ignore revs for blame (#2286)

* Bump github/codeql-action from 3.25.13 to 3.25.15 (#2288)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.13 to 3.25.15.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/2d790406f505036ef40ecba973cc774a50395aac...afb54ba388a7dca6ecae48f608c4ff05ff4cc77a)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump ossf/scorecard-action from 2.3.3 to 2.4.0 (#2287)

Bumps [ossf/scorecard-action](https://github.com/ossf/scorecard-action) from 2.3.3 to 2.4.0.
- [Release notes](https://github.com/ossf/scorecard-action/releases)
- [Changelog](https://github.com/ossf/scorecard-action/blob/main/RELEASE.md)
- [Commits](https://github.com/ossf/scorecard-action/compare/dc50aa9510b46c811795eb24b2f1ba02a914e534...62b2cac7ed8198b15735ed49ab1e5cf35480ba46)

---
updated-dependencies:
- dependency-name: ossf/scorecard-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Sparse - coo2csr: deactivating test on PVC (#2290)

* Sparse - coo2csr: deactivating test on PVC

With this test deactivated everything in Kokkos Kernels passes.
Let us confirm that with a nightly build and potentially create
of SYCL CI-build.

* Protecting SYCL check with preprocessor guard

* cmake: add CMake language support for CUDA/HIP (#2173)

* cmake: add CMake language support for CUDA/HIP

* Apply suggestions of @brian-kelley

* Make sure clang is using a newer GCC on weaver (#2289)

It's clear from the name of the AT job GCC930_Light_Tpls_GCC930_Tpls_CLANG13CUDA10 that
the intent was to use gcc-9.3 but gcc-7.4.0 was the one actually being
used because the clang-13 module loads that gcc.

* Bump actions/upload-artifact from 4.3.4 to 4.3.5 (#2295)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.3.4 to 4.3.5.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/0b2256b8c012f0828dc542b3febcab082c67f72b...89ef406dd8d7e03cfd12d9e0a4a378f454709029)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* SpAdd handle: delete sort_option getter/setter (#2296)

SpAdd handle was originally a copy-paste of the spgemm
handle way back in #122, and included get_sort_option() and
set_sort_option() from spgemm. But these try to use the member
bool sort_option, which doesn't exist. Somehow these functions never
produced compile errors until someone tried to call them.

* Add support for BSRs to sptrsv (#2281)

* Some cleanup and refactoring

* First round of cleanup complete

* Fix a couple warnings

* formatting

* Cleanup progress

* Fixes

* Remove Upper/Lower TriLvlSchedTP2SolverFunctors

* Remove Upper/Lower single block functors

* Remove unused TriLvlSchedTP1SingleBlockFunctorDiagValues and merge upper/lower tri_solve_cg

* Merge two big upper/lower branch of tri_solve_chain

* Merge upper/lower tri_solve_streams

* progres

* progress

* Progress, test added

* progress and fixes

* prog

* prog but broken

* Prog with debug prints

* Remove extreme debug printing

* works

* all working

* Remove test mangling

* Switch over block spiluk precond test to use new block sptrsv

* More test cleanup

* Fixes for GPU warnings

* Conflicts resolved, still work needed

* builds

* Formatting

* Update work to latest format style

* Remove unused functions. Remove prints. Add barriers

* Minor fixes

* lset is not used

* Fix for clang

* formatting

* New impl approach

* Unset macro

* There's no reason to limit team-policy alg to MAX_VEC_SIZE

* formatting

* Add missing kokkos-inline-funcs

* Fix warnings

* format

* Some compilers throw shadow warnings in static functions (#2297)

* Update Solo GCC version: 10.2.1 -> 10.3.1 (#2300)

* github workflow: adding release workflow to have provenance for release artifacts (#2304)

This adds a workflow based on the scorecard recommendation for software provenance.
The workflow requires some manual actions to verify that the signiture in the release is correct.

* Improve crs/bsr sorting performance (#2293)

* CRS sorting improvements

- Wrote bulk sort/permutation based sorting for CRS graph, matrix, and
  BSR matrix (bulk = one large sort of all the entries, using row-major
  dense index as keys)
  - This is more performant for imbalanced entries per row
- If matrix dimensions are too large to do bulk sort, fall back to
  sorting within each row with a thread.

* Add perf test for sort_crs_matrix
* sort_crs: improve parallel labels
* Work around kokkos issue 7036
* sort_crs: replace radix sort lambda with functor
(Lambda segfaults with nvcc+openmp)
---------
Signed-off-by: Brian Kelley <[email protected]>

* Don't error out if graph unit tests disabled (#2305)

Graph unit tests, with TEST_ETI_ONLY=ON, require double and/or
float to be enabled as scalars. Instead of erroring out the configure,
just give a warning and disable the graph tests.

* Update version to 4.4.99 (#2302)

* Update version to 4.4.99

Update min kokkos version to 4.3.01

* update workflows yaml files to test with [email protected]

* Deprecate redundant team-level sort functions (#2306)

* Deprecate redundant team-level sort functions

These were moved into Kokkos core a long time ago with a nicer
interface and better testing. Replace our implementations with calls
to the Kokkos functions like Kokkos::Experimental::sort_team.

* Formatting

Signed-off-by: Brian Kelley <[email protected]>

* Use our own DefaultComparator, not use Kokkos impl

---------

Signed-off-by: Brian Kelley <[email protected]>

* Bump actions/upload-artifact from 4.3.5 to 4.3.6 (#2307)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.3.5 to 4.3.6.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/89ef406dd8d7e03cfd12d9e0a4a378f454709029...834a144ee995460fba8ed112a2fc961b36a5ec5a)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump step-security/harden-runner from 2.9.0 to 2.9.1 (#2309)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.9.0 to 2.9.1.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/0d381219ddf674d61a7572ddd19d7941e271515c...5c7944e73c4c2a096b17a9cb74d65b6c2bbafbde)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.25.15 to 3.26.0 (#2308)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.15 to 3.26.0.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/afb54ba388a7dca6ecae48f608c4ff05ff4cc77a...eb055d739abdc2e8de2e5f4ba1a8b246daa779aa)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Eti extern marking (#2292)

* ETI: adding headers for template instantiation declarations

* ETI: adding more decl files to eti

Now all the sparse kernels have an ETI decl file to prevent
multiple template instantiation.

* ETI: adding BLAS kernels that are called from Sparse kernels.

* Fixing CMAKE logic

* clang-format

* ETI work: finish the declarations for BLAS and adding GRAPH, LAPACK and BATCHED

* clang-format

* Fixing two headers and rename Nrm2w<2-D> as Nrm2w_mv

* update changelog for 4.4.00 (#2310)

* Bump github/codeql-action from 3.26.0 to 3.26.2 (#2311)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.0 to 3.26.2.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/eb055d739abdc2e8de2e5f4ba1a8b246daa779aa...429e1977040da7a23b6822b13c129cd1ba93dbb2)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Restore size_t as default offset, in Tribits builds (#2313)

If building KokkosKernels standalone, leave int as the default offset
(this was the case since #2140). But if building KokkosKernels as a
Trilinos/Tribits package, then make size_t the default offset because
this is what the Tpetra stack currently uses.

Signed-off-by: Brian Kelley <[email protected]>

* Improve GH action to produce release artifacts (#2312)

* Fix minor typing error. (#2314)

* Bump github/codeql-action from 3.26.2 to 3.26.5 (#2315)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.2 to 3.26.5.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/429e1977040da7a23b6822b13c129cd1ba93dbb2...2c779ab0d087cd7fe7b826087247c2c81f27bfa6)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Implement batched serial pttrs (#2277)

* Implement batched serial pttrs

* Add tests for pttrs

* Add tag for pttrs

* fix: remove unnecessary specialization for pttrs internal

* format

* format

---------

Co-authored-by: Yuuichi Asahi <[email protected]>

* .github/workflows: Group jobs under common github-AT2 name (#2320)

* coo2csr: add parens to function calls (#2318)

* Bump github/codeql-action from 3.26.5 to 3.26.6 (#2323)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.5 to 3.26.6.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/2c779ab0d087cd7fe7b826087247c2c81f27bfa6...4dd16135b69a43b6c8efb853346f8437d92d3c93)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump actions/upload-artifact from 4.3.6 to 4.4.0 (#2324)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.3.6 to 4.4.0.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/834a144ee995460fba8ed112a2fc961b36a5ec5a...50769540e7f4bd5e21e526ee35c689e35e0d6874)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Add MAGMA TPL support for GESV on HIP backend (#2326)

* Small changes for MAGMA GESV on HIP

* Apply clang-format

* Relax eps to 1e-8 for multi-rhs tests

---------

Co-authored-by: Vinh Quang Dang (-EXP) <[email protected]>

* implement batched serial pbtrf (#2322)

* implement batched serial pbtrf

* fix: docstring

* Add tests for info

* fix CodeQL

* fix: type

* Add a analytical test case for pbtrf

---------

Co-authored-by: Yuuichi Asahi <[email protected]>

* remove unneeded volatile qualifier for Kokkos::Single (#2333)

* AT-2: adding non-TPL build for HIP backend (#2329)

* AT-2: adding non-TPL build for HIP backend

Clean-up a few things in the mi210 workflow as well should
make it a little bit more readable and maintainable.

* AT-2: updating hip no tpl runs-on tag

By reusing the same tag this should allow the job to run without issues
with the same runner as the tpl job.

* removing tabs and putting whitespaces instead

* hopefully fixing paths in mi210.yml

* AT-2: using env variable to get abs path, let's see...

* replacing tabs with whitespaces

* AST-2: Fixing mkdir?

* removing a tab...

* Fix typo in working-directory for kokkos build step

* another fix to a path

* Fixing quotation...

* Fixing working-dir which is already in $GITHUB_WORKSPACE

* fixing path in the kokkos-kernels configure step

* Fixing configure step for tpl build

* Bump github/codeql-action from 3.26.6 to 3.26.7 (#2338)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.6 to 3.26.7.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/4dd16135b69a43b6c8efb853346f8437d92d3c93...8214744c546c1e5c8f03dde8fab3a7353211988d)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump step-security/harden-runner from 2.9.1 to 2.10.1 (#2339)

Bumps [step-security/harden-runner](https://github.com/step-security/harden-runner) from 2.9.1 to 2.10.1.
- [Release notes](https://github.com/step-security/harden-runner/releases)
- [Commits](https://github.com/step-security/harden-runner/compare/5c7944e73c4c2a096b17a9cb74d65b6c2bbafbde...91182cccc01eb5e619899d80e4e971d6181294a7)

---
updated-dependencies:
- dependency-name: step-security/harden-runner
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Update changelog for 4.4.01 (#2340)

* Bump github/codeql-action from 3.26.7 to 3.26.8 (#2343)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.7 to 3.26.8.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/8214744c546c1e5c8f03dde8fab3a7353211988d...294a9d92911152fe08befb9ec03e240add280cb3)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.26.8 to 3.26.9 (#2348)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.8 to 3.26.9.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/294a9d92911152fe08befb9ec03e240add280cb3...461ef6c76dfe95d5c364de2f431ddbd31a417628)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* .github/workflows: Add remaining spr and bdw checks (#2321)

* .github/workflows: Add llvm1001 and inteloneapi AT2 checks

* .github/workflows: Enable gcc10 AT2 builds

* .github/workflows: Stage volta70 checks

* .github/workflows: Test spr against kokkos 4.3.01

* .github/workflows: Specify ETI types for spr

* .github/workflows: Disable two of the GNU1020 builds

* Add KokkosKernels::eager_initialize() to common (#2317)

* Add KokkosKernels::eager_initialize() to common

* Formatting

* TPL singletons: allow query of whether initialized

And test KokkosKernels::eager_initialize() using this

* Fix magma build

- build TPL singleton file into library when magma's enabled
- include correct magma TPL header with declarations, not definitions

* Modify validity checks for output views sizes in svd (#2350)

* Modify validity check for output views sizes in svd

* Apply clang format

* Apply clang format

* kk_is_gpu_exec_space() -> is_gpu_exec_space_v (#2354)

* Fix #2344: SVD hanging (#2345)

* Fix #2344: hang in SerialSVD

Add test for the case that failed.

* Add more SVD tests to exercise corner cases

Used temporary printfs in the two zero-diagonal codepaths to make
sure they are now covered by testing

* Revert replacing spadd example with reproducer

* Clean up debug printouts

* Fix -Wshadow

* SVD: Add more rank-deficient test cases

* Last fix for SVD, test cleanup

* SVD test fixes

* Comment both test cases from issue 2344

* workflow: adding permissions in workflow files (#2357)

This should make the OSS scorecard a little happier?

Signed-off-by: Luc <[email protected]>

* Bypass bulk sort in KokkosSparse::sort_crs_matrix sometimes (#2353)

* set [[maybe_unused]] attribute on mySys (#2355)

* Fix compilation error, kk_is_gpu_exec_space->is_gpu_exec_space_v (#2358)

Apply change consistent with PR #2354

* Bump github/codeql-action from 3.26.9 to 3.26.11 (#2360)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.9 to 3.26.11.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/461ef6c76dfe95d5c364de2f431ddbd31a417628...6db8d6351fd0be61f9ed8ebd12ccd35dcec51fea)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* sparse_sort_crs: fix column shuffle indices (#2346)

* random_shuffle -> shuffle

random_shuffle is deprecated / removed

* sparse_sort_crs: Fix row begin/end offset in entry shuffle

* workflows: fixing some action versions using current sha (#2361)

* Bump actions/checkout from 4.1.7 to 4.2.1 (#2363)

Bumps [actions/checkout](https://github.com/actions/checkout) from 4.1.7 to 4.2.1.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/692973e3d937129bcbf40652eb9f2f61becf3332...eef61447b9ff4aafe5dcd4e0bbf5d482be7e7871)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* workflow: update the documentation workflow to use pinned python modules (#2366)

* Workflow volta70 (#2356)

* worflow - volta70: cleaning up cmake recipes

* Adding volta70 to at2.yml

* gemm perf_test: print matrix sizes (#2362)

* workflows: fixing dependency for checkout action to make dependabot happy (#2369)

Signed-off-by: Luc <[email protected]>

* workflows: limit the permissions in volta70 build (#2370)

Reducing the permissions keeps the workflow and repository
safe from potential malicious deeds.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* OpenMPSmartStatic_SPMV.hpp: throw if posix_memalign fails (#2368)

* Bump actions/upload-artifact from 4.4.0 to 4.4.3 (#2378)

Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.4.0 to 4.4.3.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](https://github.com/actions/upload-artifact/compare/50769540e7f4bd5e21e526ee35c689e35e0d6874...b4b15b8c7c6ac21ea08fcf65892d2ee8f75cf882)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* Bump github/codeql-action from 3.26.11 to 3.26.12 (#2377)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.11 to 3.26.12.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/6db8d6351fd0be61f9ed8ebd12ccd35dcec51fea...c36620d31ac7c881962c3d9dd939c40ec9434f2b)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* set KokkosKernels_ENABLE_COMPONENT variables to value instead of variable name (#2380)

Signed-off-by: Carl Pearson <[email protected]>

* Block Sptrsv fixes (#2376)

* Go back to original impl with vector reduce

Signed-off-by: James Foucar <[email protected]>

* Disable tp1 test for clang+cuda+complex

Signed-off-by: James Foucar <[email protected]>

* Fixes and warn users about broken case

Signed-off-by: James Foucar <[email protected]>

* formatting

Signed-off-by: James Foucar <[email protected]>

* Fix comments and print skip message

Signed-off-by: James Foucar <[email protected]>

---------

Signed-off-by: James Foucar <[email protected]>

* Do not specify template argument when using Kokkos atomics (#2382)

Let Function Template Argument Deduction do its job, do not interfere.

Signed-off-by: Damien L-G <[email protected]>

* Blas1 asum: work around for openblas error with short vectors (#2384)

Signed-off-by: Carl William Pearson <[email protected]>

* Prefer atomic_assign(ptr, val) -> atomic_store(ptr, val) (#2383)

We are looking at deprecating atomic_assign() https://github.com/kokkos/kokkos/issues/7449
Use atomic_store() instead.

Signed-off-by: Damien L-G <[email protected]>

* search/replace KOKKOS_-prefixed macros (#2372)

Signed-off-by: Carl Pearson <[email protected]>

* Replace atomic_{inc, dec}[rement] (#2386)

Signed-off-by: Damien L-G <[email protected]>

* put default types in KokkosKernels namespace (#2341)

* put default types in KokkosKernels namespace

Signed-off-by: Carl Pearson <[email protected]>

* Deprecate global namespace default type aliases

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* perf_tests: remove false dependence on google test (#2385)

* perf_tests: remove false dependence on google test

Signed-off-by: Carl Pearson <[email protected]>

* KokkosBlas3_gemm_perf_test.hpp: remove comment

Signed-off-by: Carl Pearson <[email protected]>

* Test::epsilon -> Kokkos::ArithTraits<>::eps

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* Prefer `expected == atomic_compare_exchange(ptr, expected, desired)` (#2387)

* Prefer expected == atomic_compare_exchange(ptr, expected, desired)

Signed-off-by: Damien L-G <[email protected]>

* Reintroduce break statement that was accidentally removed

Signed-off-by: Damien L-G <[email protected]>

---------

Signed-off-by: Damien L-G <[email protected]>

* workflows: adding variable to set kokkos version to test against (#2389)

* workflows: adding variable to set kokkos version to test against

The variable is set directly in at2.yml so it can be uniformly
applied to all the "sub-workflows" making maintenance when a
new release of kokkos comes out easier.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* workflow: fix for the osx builds

The new variable to set the version of Kokkos Core to build against
needs to use the "in script syntax" not the bash syntax.

Signed-off-by: Luc Berger-Vergiat <[email protected]>

---------

Signed-off-by: Luc Berger-Vergiat <[email protected]>

* Bump github/codeql-action from 3.26.12 to 3.26.13 (#2391)

Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.26.12 to 3.26.13.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/github/codeql-action/compare/c36620d31ac7c881962c3d9dd939c40ec9434f2b...f779452ac5af1c261dce0346a8f964149f49322b)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <[email protected]>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* remove deprecation message on intel 19 (#2393)

Signed-off-by: Carl Pearson <[email protected]>

* deprecate `__KOKKOSBATCHED_PROMOTION__` (#2392)

* deprecate __KOKKOSBATCHED_PROMOTION__

Signed-off-by: Carl Pearson <[email protected]>

* __KOKKOSBATCHED_PROMOTION___DEPRECATED_MACRO -> KOKKOSBATCHED_IMPL_PROMOTION

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* Reduce duplicated code in trsv (#2388)

* Reduce duplicated code in trsv
A slightly different for loop can avoid having to
separate out the last iteration.

* Fix inner loops underflow
* Change to a more-readable loop

Signed-off-by: James Foucar <[email protected]>

* Implement batched serial laswp (#2395)

* Implement batched serial laswp

Signed-off-by: Yuuichi Asahi <[email protected]>

* fix: integer types in laswp test

Signed-off-by: Yuuichi Asahi <[email protected]>

* change loop order based on base layout

Signed-off-by: Yuuichi Asahi <[email protected]>

* update serial laswp test based on review

Signed-off-by: Yuuichi Asahi <[email protected]>

---------

Signed-off-by: Yuuichi Asahi <[email protected]>
Co-authored-by: Yuuichi Asahi <[email protected]>

* 2392 missed a case where __KOKKOSBATCHED_PROMOTION__ was defined (#2396)

Signed-off-by: Carl Pearson <[email protected]>

* Rename reserved identifiers (#2373)

* clang-tidy file for reserved identifiers

Signed-off-by: Carl Pearson <[email protected]>

* Batched: don't use reserved names in include guards

Signed-off-by: Carl Pearson <[email protected]>

* BLAS: don't use reserved names in include guards

Signed-off-by: Carl Pearson <[email protected]>

---------

Signed-off-by: Carl Pearson <[email protected]>

* implement batched serial pbtrs (#2330)

* implement …
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants