Commit Graph

2740 Commits (beb418e00ba0857ea6bea2b7647f70610e7dc4fb)
 

Author SHA1 Message Date
gabrielequatrana beb418e00b Fixed SpMM for ELG (AXPBY GPU not working) 11 months ago
gabrielequatrana c807d88c57 SpMM using Cusparse dedicated routine (CSRG) 11 months ago
Salvatore Filippone 399818a482 Never do arithmetic on a (void *) 11 months ago
Salvatore Filippone 1ecf36c9c3 Merge branch 'psblas-bgmres' of github.com:sfilippone/psblas3 into psblas-bgmres 11 months ago
Salvatore Filippone 7d74ebf5c4 Make multivectors work 11 months ago
gabrielequatrana 82715fec9b Fixed SpMM for CSRG 11 months ago
gabrielequatrana 409b51e609 Try to fix SpMM 11 months ago
gabrielequatrana ee140bc8dd Read/Write multivect fixed (SpMM bug) 11 months ago
gabrielequatrana a624b7098b Cuda multivect methods implementation 11 months ago
sfilippone c1e4f9c2b1 Merge branch 'repackage' into psblas-bgmres, fixes to resolve merge 11 months ago
sfilippone 0760e4d553 Fix C function declarations for compilation with LLVM/clang in CUDA 11 months ago
sfilippone 4347c663c2 Change conftest **argv to recognize CUDA_VERSION. 11 months ago
sfilippone a2f92e616f Put VOLATILE under ifdef for FLANG 11 months ago
sfilippone 59e6df73a4 Make sure configure recognizes FLANG 11 months ago
gabrielequatrana 02fb43ba82 Fixed convergence 11 months ago
gabrielequatrana 6b785d66c6 Check convergence now working 11 months ago
gabrielequatrana 0839165bdc Added convergence check 11 months ago
sfilippone 0023b8ac78 Compile adjcncy_fnd_owner 11 months ago
sfilippone 3a25d7b04a Fixes for LLVM compilation 11 months ago
sfilippone 373d841bce Don't need renaming of psi_gth and psi_sct 11 months ago
sfilippone 472f16f0df Fix compilation with --enable-serial 11 months ago
gabrielequatrana 1b79939255 Fixed some bugs (QR_fact serial) 11 months ago
gabrielequatrana 676652fcff Working parallel (QR_fact serial) 11 months ago
gabrielequatrana d10631530f Init Parallelize 11 months ago
sfilippone e0a4d362fa Define flag TRACK_CUDA_MALLOC 11 months ago
gabrielequatrana 6987582c30 Done SERIAL 12 months ago
Salvatore Filippone b5f1442ac8 Merge branch 'nond-rep' into repackage 12 months ago
sfilippone 48455190ec Add GPU version of XYZW 12 months ago
sfilippone a11f328e62 Added CUDA version of XYZW 12 months ago
sfilippone 86be8ebcd0 New method W%XYZW() 12 months ago
sfilippone b5d5f97661 Improve cuda%zero() 1 year ago
sfilippone 0e269ed641 typo in Cabgdxyz 1 year ago
Salvatore Filippone d95077ffd6 Fix typo in vectordev_mod 1 year ago
Salvatore Filippone 2d3773df98 CUDA kernels for ABGDXYZ 1 year ago
Salvatore Filippone 2a75d677d0 ABGDXYZ in vectordev_mod 1 year ago
sfilippone 2391f64df6 X_cuda_vect%abgdxyz 1 year ago
sfilippone 93c71c4316 Fix %ZERO() on cuda 1 year ago
sfilippone 0568a83734 Fix ifdef and old code 1 year ago
Salvatore Filippone 35d68aa4e3 Reuse calls to getDeviceProperties done at init time 1 year ago
Salvatore Filippone 1ba8dfc7b7 Switch FOR and IF in AXPBY 1 year ago
Salvatore Filippone f9677bc892 Enabled new CUDA version of ABGDXYZ 1 year ago
Salvatore Filippone 4681767ef8 New implementation for ABGDXYZ in CUDA 1 year ago
Salvatore Filippone 105aa3c570 Intermediate impl of ABGDXYZ 1 year ago
Salvatore Filippone 864872ecac Intermediate implementation of abgdxyz on cuda 1 year ago
Salvatore Filippone a41b209144 Better AXPBY implementation in CUDA. 1 year ago
Salvatore Filippone f4c7604f61 Fix base implementation of abgdxyz to call set_host 1 year ago
Salvatore Filippone b8f9badf95 Fix interface between vect and base_vect%ABGD 1 year ago
Salvatore Filippone 2a40b82b58 Fix typo in base_vect_mod 1 year ago
Salvatore Filippone 4e611bb078 Enable psi_abgdxyz 1 year ago
Salvatore Filippone 9ced67634d Fix KIND for NR in axpby 1 year ago