Commit Graph

2778 Commits (development)
 

Author SHA1 Message Date
sfilippone e18de650f2 Take out debug print 9 months ago
sfilippone eed4c574bc Take out obsolete ilu_fct files 9 months ago
sfilippone 6f92a5c37a Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage 9 months ago
sfilippone 553531eefb Take out obsolete ilu_fct source files 9 months ago
sfilippone 2f575894fc Fix --with-cudacc in configure 9 months ago
sfilippone 0760e4d553 Fix C function declarations for compilation with LLVM/clang in CUDA 9 months ago
sfilippone 4347c663c2 Change conftest **argv to recognize CUDA_VERSION. 9 months ago
sfilippone a2f92e616f Put VOLATILE under ifdef for FLANG 10 months ago
sfilippone 59e6df73a4 Make sure configure recognizes FLANG 10 months ago
sfilippone 0023b8ac78 Compile adjcncy_fnd_owner 10 months ago
sfilippone 3a25d7b04a Fixes for LLVM compilation 10 months ago
sfilippone 373d841bce Don't need renaming of psi_gth and psi_sct 10 months ago
sfilippone 472f16f0df Fix compilation with --enable-serial 10 months ago
sfilippone e0a4d362fa Define flag TRACK_CUDA_MALLOC 10 months ago
Salvatore Filippone b5f1442ac8 Merge branch 'nond-rep' into repackage 10 months ago
sfilippone 48455190ec Add GPU version of XYZW 10 months ago
sfilippone a11f328e62 Added CUDA version of XYZW 10 months ago
sfilippone 86be8ebcd0 New method W%XYZW() 10 months ago
sfilippone b5d5f97661 Improve cuda%zero() 11 months ago
sfilippone 0e269ed641 typo in Cabgdxyz 11 months ago
Salvatore Filippone d95077ffd6 Fix typo in vectordev_mod 11 months ago
Salvatore Filippone 2d3773df98 CUDA kernels for ABGDXYZ 11 months ago
Salvatore Filippone 2a75d677d0 ABGDXYZ in vectordev_mod 11 months ago
sfilippone 2391f64df6 X_cuda_vect%abgdxyz 11 months ago
sfilippone 93c71c4316 Fix %ZERO() on cuda 11 months ago
sfilippone 0568a83734 Fix ifdef and old code 11 months ago
Salvatore Filippone 35d68aa4e3 Reuse calls to getDeviceProperties done at init time 11 months ago
Salvatore Filippone 1ba8dfc7b7 Switch FOR and IF in AXPBY 11 months ago
Salvatore Filippone f9677bc892 Enabled new CUDA version of ABGDXYZ 11 months ago
Salvatore Filippone 4681767ef8 New implementation for ABGDXYZ in CUDA 11 months ago
Salvatore Filippone 105aa3c570 Intermediate impl of ABGDXYZ 11 months ago
Salvatore Filippone 864872ecac Intermediate implementation of abgdxyz on cuda 11 months ago
Salvatore Filippone a41b209144 Better AXPBY implementation in CUDA. 11 months ago
Salvatore Filippone f4c7604f61 Fix base implementation of abgdxyz to call set_host 11 months ago
Salvatore Filippone b8f9badf95 Fix interface between vect and base_vect%ABGD 11 months ago
Salvatore Filippone 2a40b82b58 Fix typo in base_vect_mod 11 months ago
Salvatore Filippone 4e611bb078 Enable psi_abgdxyz 11 months ago
Salvatore Filippone 9ced67634d Fix KIND for NR in axpby 11 months ago
Salvatore Filippone 3121c43582 Silly bug in abgdxyz implementation 11 months ago
Salvatore Filippone 5c3d5f0235 Silly bug in abgdxyz implementation 11 months ago
Salvatore Filippone 29669b56a2 Implementation of psb_abgdxyz 11 months ago
Salvatore Filippone a942b47f7c Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 11 months ago
Salvatore Filippone 6c53b6ec79 Fix typo in interface for psb_abgdxyz 11 months ago
sfilippone 83ededd02b Implementatino of abgd_xyz 11 months ago
Salvatore Filippone 92a95699ba Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 11 months ago
Salvatore Filippone ebc7c6b3b4 Fix call to base%abgdxyz 11 months ago
sfilippone 45f00e6e19 Fixed comments 11 months ago
Salvatore Filippone 14c4ff0f32 Added new methd for two combined axpbys 11 months ago
Salvatore Filippone b49ce6b610 Merge branch 'repackage' into nond-rep 11 months ago
sfilippone 6433dc797e Fix CUDA implementation of %set_scal and %zero 11 months ago