Commit Graph

2733 Commits (ee140bc8dd89dff57a24c32c98f3b27af767ba4e)
 

Author SHA1 Message Date
gabrielequatrana ee140bc8dd Read/Write multivect fixed (SpMM bug) 2 years ago
gabrielequatrana a624b7098b Cuda multivect methods implementation 2 years ago
sfilippone c1e4f9c2b1 Merge branch 'repackage' into psblas-bgmres, fixes to resolve merge 2 years ago
sfilippone 0760e4d553 Fix C function declarations for compilation with LLVM/clang in CUDA 2 years ago
sfilippone 4347c663c2 Change conftest **argv to recognize CUDA_VERSION. 2 years ago
sfilippone a2f92e616f Put VOLATILE under ifdef for FLANG 2 years ago
sfilippone 59e6df73a4 Make sure configure recognizes FLANG 2 years ago
gabrielequatrana 02fb43ba82 Fixed convergence 2 years ago
gabrielequatrana 6b785d66c6 Check convergence now working 2 years ago
gabrielequatrana 0839165bdc Added convergence check 2 years ago
sfilippone 0023b8ac78 Compile adjcncy_fnd_owner 2 years ago
sfilippone 3a25d7b04a Fixes for LLVM compilation 2 years ago
sfilippone 373d841bce Don't need renaming of psi_gth and psi_sct 2 years ago
sfilippone 472f16f0df Fix compilation with --enable-serial 2 years ago
gabrielequatrana 1b79939255 Fixed some bugs (QR_fact serial) 2 years ago
gabrielequatrana 676652fcff Working parallel (QR_fact serial) 2 years ago
gabrielequatrana d10631530f Init Parallelize 2 years ago
sfilippone e0a4d362fa Define flag TRACK_CUDA_MALLOC 2 years ago
gabrielequatrana 6987582c30 Done SERIAL 2 years ago
Salvatore Filippone b5f1442ac8 Merge branch 'nond-rep' into repackage 2 years ago
sfilippone 48455190ec Add GPU version of XYZW 2 years ago
sfilippone a11f328e62 Added CUDA version of XYZW 2 years ago
sfilippone 86be8ebcd0 New method W%XYZW() 2 years ago
sfilippone b5d5f97661 Improve cuda%zero() 2 years ago
sfilippone 0e269ed641 typo in Cabgdxyz 2 years ago
Salvatore Filippone d95077ffd6 Fix typo in vectordev_mod 2 years ago
Salvatore Filippone 2d3773df98 CUDA kernels for ABGDXYZ 2 years ago
Salvatore Filippone 2a75d677d0 ABGDXYZ in vectordev_mod 2 years ago
sfilippone 2391f64df6 X_cuda_vect%abgdxyz 2 years ago
sfilippone 93c71c4316 Fix %ZERO() on cuda 2 years ago
sfilippone 0568a83734 Fix ifdef and old code 2 years ago
Salvatore Filippone 35d68aa4e3 Reuse calls to getDeviceProperties done at init time 2 years ago
Salvatore Filippone 1ba8dfc7b7 Switch FOR and IF in AXPBY 2 years ago
Salvatore Filippone f9677bc892 Enabled new CUDA version of ABGDXYZ 2 years ago
Salvatore Filippone 4681767ef8 New implementation for ABGDXYZ in CUDA 2 years ago
Salvatore Filippone 105aa3c570 Intermediate impl of ABGDXYZ 2 years ago
Salvatore Filippone 864872ecac Intermediate implementation of abgdxyz on cuda 2 years ago
Salvatore Filippone a41b209144 Better AXPBY implementation in CUDA. 2 years ago
Salvatore Filippone f4c7604f61 Fix base implementation of abgdxyz to call set_host 2 years ago
Salvatore Filippone b8f9badf95 Fix interface between vect and base_vect%ABGD 2 years ago
Salvatore Filippone 2a40b82b58 Fix typo in base_vect_mod 2 years ago
Salvatore Filippone 4e611bb078 Enable psi_abgdxyz 2 years ago
Salvatore Filippone 9ced67634d Fix KIND for NR in axpby 2 years ago
Salvatore Filippone 3121c43582 Silly bug in abgdxyz implementation 2 years ago
Salvatore Filippone 5c3d5f0235 Silly bug in abgdxyz implementation 2 years ago
Salvatore Filippone 29669b56a2 Implementation of psb_abgdxyz 2 years ago
Salvatore Filippone a942b47f7c Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 2 years ago
Salvatore Filippone 6c53b6ec79 Fix typo in interface for psb_abgdxyz 2 years ago
sfilippone 83ededd02b Implementatino of abgd_xyz 2 years ago
Salvatore Filippone 92a95699ba Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 2 years ago