sfilippone
|
c8cc2275d0
|
Fix cuda/makefile for make -j
|
7 months ago |
sfilippone
|
d01b8145c6
|
Fix cuda makefile dependencies
|
8 months ago |
sfilippone
|
e18de650f2
|
Take out debug print
|
8 months ago |
sfilippone
|
0760e4d553
|
Fix C function declarations for compilation with LLVM/clang in CUDA
|
9 months ago |
sfilippone
|
3a25d7b04a
|
Fixes for LLVM compilation
|
9 months ago |
sfilippone
|
e0a4d362fa
|
Define flag TRACK_CUDA_MALLOC
|
10 months ago |
Salvatore Filippone
|
b5f1442ac8
|
Merge branch 'nond-rep' into repackage
|
10 months ago |
sfilippone
|
48455190ec
|
Add GPU version of XYZW
|
10 months ago |
sfilippone
|
a11f328e62
|
Added CUDA version of XYZW
|
10 months ago |
sfilippone
|
b5d5f97661
|
Improve cuda%zero()
|
10 months ago |
sfilippone
|
0e269ed641
|
typo in Cabgdxyz
|
10 months ago |
Salvatore Filippone
|
d95077ffd6
|
Fix typo in vectordev_mod
|
10 months ago |
Salvatore Filippone
|
2d3773df98
|
CUDA kernels for ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
2a75d677d0
|
ABGDXYZ in vectordev_mod
|
10 months ago |
sfilippone
|
2391f64df6
|
X_cuda_vect%abgdxyz
|
10 months ago |
sfilippone
|
93c71c4316
|
Fix %ZERO() on cuda
|
10 months ago |
sfilippone
|
0568a83734
|
Fix ifdef and old code
|
10 months ago |
Salvatore Filippone
|
35d68aa4e3
|
Reuse calls to getDeviceProperties done at init time
|
10 months ago |
Salvatore Filippone
|
1ba8dfc7b7
|
Switch FOR and IF in AXPBY
|
10 months ago |
Salvatore Filippone
|
f9677bc892
|
Enabled new CUDA version of ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
4681767ef8
|
New implementation for ABGDXYZ in CUDA
|
10 months ago |
Salvatore Filippone
|
105aa3c570
|
Intermediate impl of ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
864872ecac
|
Intermediate implementation of abgdxyz on cuda
|
10 months ago |
Salvatore Filippone
|
a41b209144
|
Better AXPBY implementation in CUDA.
|
10 months ago |
Salvatore Filippone
|
ebc7c6b3b4
|
Fix call to base%abgdxyz
|
11 months ago |
Salvatore Filippone
|
14c4ff0f32
|
Added new methd for two combined axpbys
|
11 months ago |
sfilippone
|
6433dc797e
|
Fix CUDA implementation of %set_scal and %zero
|
11 months ago |
sfilippone
|
097d63147a
|
Fix cuda dir makefile
|
11 months ago |
Salvatore Filippone
|
20a01d4d71
|
Attempt at fixing CSRG in CUDA 10.2. Not complete yet.
|
1 year ago |
sfilippone
|
1bc2a884e2
|
Adjust conditional compilation on CUDA version
|
1 year ago |
Salvatore Filippone
|
62db7c0449
|
Fix spsv with CSRG handling of descriptors.
|
1 year ago |
Salvatore Filippone
|
d28ea462d9
|
Modified CSRG to work with latest versions; cusparse docs are unclear
|
1 year ago |
sfilippone
|
0230fbb7af
|
Identufied problems with CSRG. Will fix in a branch
|
1 year ago |
sfilippone
|
b2b7b074df
|
Fix usage of HAVE_CUDA/HAVE_GPU (mostly disappeared)
|
1 year ago |
sfilippone
|
655c86caed
|
Updated docs.
|
1 year ago |
sfilippone
|
9b713c177b
|
Fix cuda interfaces for renaming
|
1 year ago |
sfilippone
|
6fa0bf7fe7
|
Complete cuda renaming
|
1 year ago |
sfilippone
|
6aa7987d52
|
Rename GPU into cuda, and merge SPGPU code.
|
1 year ago |