Commit Graph

2739 Commits (d01b8145c6f4de5b3beb4ebabc88c6ec69e52bfe)
 

Author SHA1 Message Date
sfilippone d01b8145c6 Fix cuda makefile dependencies 8 months ago
sfilippone d8ed01218d Cleanup hash_map using new indx_map%set_lc 8 months ago
sfilippone 7ec394ce1c Rename indx_map_mod and put SET_LR/C under ifdef 8 months ago
sfilippone 7dc64692cc Fix for OpenMP runs in hash_map_mod 8 months ago
Salvatore Filippone e711c53fab Make sure we compile when LPK /= IPK 8 months ago
Salvatore Filippone b5a32a59f9 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage 8 months ago
Salvatore Filippone 773b79e7bc OpenMP in repl_map 8 months ago
Salvatore Filippone 98a9005602 Further advances on OpenMP versions of various index maps. 8 months ago
Salvatore Filippone fa86c91411 Fix OpenMP version of hash_map and hash 8 months ago
Salvatore Filippone 188dee6842 Add indx_map%inc_lc() method 8 months ago
sfilippone b99aa7a90f Switch off OMP in HASH g2l_ins 8 months ago
sfilippone 4e0a9e5db8 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage 8 months ago
sfilippone e72c0f0bf9 Fix OMP impl of sparse-sparse product 8 months ago
Salvatore Filippone d444a12879 Condition call to x%sync() in vect_mv 8 months ago
Salvatore Filippone 5e2e1e34fd Introduce set_host() in inner_vect_sv 8 months ago
sfilippone 025350a361 Make sure realloc is always called with size >0 8 months ago
sfilippone ba8c32c507 Define merge_nd method 8 months ago
sfilippone aca1848401 New timings in CG 8 months ago
sfilippone e18de650f2 Take out debug print 8 months ago
sfilippone 6f92a5c37a Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage 9 months ago
sfilippone 553531eefb Take out obsolete ilu_fct source files 9 months ago
sfilippone 2f575894fc Fix --with-cudacc in configure 9 months ago
sfilippone 0760e4d553 Fix C function declarations for compilation with LLVM/clang in CUDA 9 months ago
sfilippone 4347c663c2 Change conftest **argv to recognize CUDA_VERSION. 9 months ago
sfilippone a2f92e616f Put VOLATILE under ifdef for FLANG 9 months ago
sfilippone 59e6df73a4 Make sure configure recognizes FLANG 9 months ago
sfilippone 0023b8ac78 Compile adjcncy_fnd_owner 9 months ago
sfilippone 3a25d7b04a Fixes for LLVM compilation 9 months ago
sfilippone 373d841bce Don't need renaming of psi_gth and psi_sct 9 months ago
sfilippone 472f16f0df Fix compilation with --enable-serial 9 months ago
sfilippone e0a4d362fa Define flag TRACK_CUDA_MALLOC 10 months ago
Salvatore Filippone b5f1442ac8 Merge branch 'nond-rep' into repackage 10 months ago
sfilippone 48455190ec Add GPU version of XYZW 10 months ago
sfilippone a11f328e62 Added CUDA version of XYZW 10 months ago
sfilippone 86be8ebcd0 New method W%XYZW() 10 months ago
sfilippone b5d5f97661 Improve cuda%zero() 10 months ago
sfilippone 0e269ed641 typo in Cabgdxyz 10 months ago
Salvatore Filippone d95077ffd6 Fix typo in vectordev_mod 10 months ago
Salvatore Filippone 2d3773df98 CUDA kernels for ABGDXYZ 10 months ago
Salvatore Filippone 2a75d677d0 ABGDXYZ in vectordev_mod 10 months ago
sfilippone 2391f64df6 X_cuda_vect%abgdxyz 10 months ago
sfilippone 93c71c4316 Fix %ZERO() on cuda 10 months ago
sfilippone 0568a83734 Fix ifdef and old code 10 months ago
Salvatore Filippone 35d68aa4e3 Reuse calls to getDeviceProperties done at init time 10 months ago
Salvatore Filippone 1ba8dfc7b7 Switch FOR and IF in AXPBY 10 months ago
Salvatore Filippone f9677bc892 Enabled new CUDA version of ABGDXYZ 10 months ago
Salvatore Filippone 4681767ef8 New implementation for ABGDXYZ in CUDA 10 months ago
Salvatore Filippone 105aa3c570 Intermediate impl of ABGDXYZ 10 months ago
Salvatore Filippone 864872ecac Intermediate implementation of abgdxyz on cuda 10 months ago
Salvatore Filippone a41b209144 Better AXPBY implementation in CUDA. 10 months ago