Commit Graph

34 Commits (ef82b975e3d28147ff399c385d703d1371ecae13)

Author SHA1 Message Date
sfilippone ef82b975e3 Track CUDA allocation 11 months ago
sfilippone e0a4d362fa Define flag TRACK_CUDA_MALLOC 11 months ago
Salvatore Filippone b5f1442ac8 Merge branch 'nond-rep' into repackage 12 months ago
sfilippone 48455190ec Add GPU version of XYZW 12 months ago
sfilippone a11f328e62 Added CUDA version of XYZW 12 months ago
sfilippone b5d5f97661 Improve cuda%zero() 12 months ago
sfilippone 0e269ed641 typo in Cabgdxyz 12 months ago
Salvatore Filippone d95077ffd6 Fix typo in vectordev_mod 12 months ago
Salvatore Filippone 2d3773df98 CUDA kernels for ABGDXYZ 12 months ago
Salvatore Filippone 2a75d677d0 ABGDXYZ in vectordev_mod 12 months ago
sfilippone 2391f64df6 X_cuda_vect%abgdxyz 12 months ago
sfilippone 93c71c4316 Fix %ZERO() on cuda 12 months ago
sfilippone 0568a83734 Fix ifdef and old code 1 year ago
Salvatore Filippone 35d68aa4e3 Reuse calls to getDeviceProperties done at init time 1 year ago
Salvatore Filippone 1ba8dfc7b7 Switch FOR and IF in AXPBY 1 year ago
Salvatore Filippone f9677bc892 Enabled new CUDA version of ABGDXYZ 1 year ago
Salvatore Filippone 4681767ef8 New implementation for ABGDXYZ in CUDA 1 year ago
Salvatore Filippone 105aa3c570 Intermediate impl of ABGDXYZ 1 year ago
Salvatore Filippone 864872ecac Intermediate implementation of abgdxyz on cuda 1 year ago
Salvatore Filippone a41b209144 Better AXPBY implementation in CUDA. 1 year ago
Salvatore Filippone ebc7c6b3b4 Fix call to base%abgdxyz 1 year ago
Salvatore Filippone 14c4ff0f32 Added new methd for two combined axpbys 1 year ago
sfilippone 6433dc797e Fix CUDA implementation of %set_scal and %zero 1 year ago
sfilippone 097d63147a Fix cuda dir makefile 1 year ago
Salvatore Filippone 20a01d4d71 Attempt at fixing CSRG in CUDA 10.2. Not complete yet. 1 year ago
sfilippone 1bc2a884e2 Adjust conditional compilation on CUDA version 1 year ago
Salvatore Filippone 62db7c0449 Fix spsv with CSRG handling of descriptors. 1 year ago
Salvatore Filippone d28ea462d9 Modified CSRG to work with latest versions; cusparse docs are unclear 1 year ago
sfilippone 0230fbb7af Identufied problems with CSRG. Will fix in a branch 1 year ago
sfilippone b2b7b074df Fix usage of HAVE_CUDA/HAVE_GPU (mostly disappeared) 1 year ago
sfilippone 655c86caed Updated docs. 1 year ago
sfilippone 9b713c177b Fix cuda interfaces for renaming 1 year ago
sfilippone 6fa0bf7fe7 Complete cuda renaming 1 year ago
sfilippone 6aa7987d52 Rename GPU into cuda, and merge SPGPU code. 1 year ago