Salvatore Filippone
|
2d3773df98
|
CUDA kernels for ABGDXYZ
|
9 months ago |
Salvatore Filippone
|
2a75d677d0
|
ABGDXYZ in vectordev_mod
|
9 months ago |
sfilippone
|
2391f64df6
|
X_cuda_vect%abgdxyz
|
9 months ago |
sfilippone
|
93c71c4316
|
Fix %ZERO() on cuda
|
9 months ago |
sfilippone
|
0568a83734
|
Fix ifdef and old code
|
9 months ago |
Salvatore Filippone
|
35d68aa4e3
|
Reuse calls to getDeviceProperties done at init time
|
9 months ago |
Salvatore Filippone
|
1ba8dfc7b7
|
Switch FOR and IF in AXPBY
|
9 months ago |
Salvatore Filippone
|
f9677bc892
|
Enabled new CUDA version of ABGDXYZ
|
9 months ago |
Salvatore Filippone
|
4681767ef8
|
New implementation for ABGDXYZ in CUDA
|
9 months ago |
Salvatore Filippone
|
105aa3c570
|
Intermediate impl of ABGDXYZ
|
9 months ago |
Salvatore Filippone
|
864872ecac
|
Intermediate implementation of abgdxyz on cuda
|
9 months ago |
Salvatore Filippone
|
a41b209144
|
Better AXPBY implementation in CUDA.
|
9 months ago |
Salvatore Filippone
|
f4c7604f61
|
Fix base implementation of abgdxyz to call set_host
|
9 months ago |
Salvatore Filippone
|
b8f9badf95
|
Fix interface between vect and base_vect%ABGD
|
9 months ago |
Salvatore Filippone
|
2a40b82b58
|
Fix typo in base_vect_mod
|
9 months ago |
Salvatore Filippone
|
4e611bb078
|
Enable psi_abgdxyz
|
9 months ago |
Salvatore Filippone
|
9ced67634d
|
Fix KIND for NR in axpby
|
9 months ago |
Salvatore Filippone
|
3121c43582
|
Silly bug in abgdxyz implementation
|
9 months ago |
Salvatore Filippone
|
5c3d5f0235
|
Silly bug in abgdxyz implementation
|
9 months ago |
Salvatore Filippone
|
29669b56a2
|
Implementation of psb_abgdxyz
|
9 months ago |
Salvatore Filippone
|
a942b47f7c
|
Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep
|
9 months ago |
Salvatore Filippone
|
6c53b6ec79
|
Fix typo in interface for psb_abgdxyz
|
9 months ago |
sfilippone
|
83ededd02b
|
Implementatino of abgd_xyz
|
9 months ago |
Salvatore Filippone
|
92a95699ba
|
Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep
|
9 months ago |
Salvatore Filippone
|
ebc7c6b3b4
|
Fix call to base%abgdxyz
|
9 months ago |
sfilippone
|
45f00e6e19
|
Fixed comments
|
9 months ago |
Salvatore Filippone
|
14c4ff0f32
|
Added new methd for two combined axpbys
|
9 months ago |
Salvatore Filippone
|
b49ce6b610
|
Merge branch 'repackage' into nond-rep
|
9 months ago |
sfilippone
|
097d63147a
|
Fix cuda dir makefile
|
10 months ago |
sfilippone
|
3aa3c795e9
|
Refactor assembly and cnv
|
11 months ago |
sfilippone
|
4d051c777d
|
Fix makefile and test program
|
11 months ago |
sfilippone
|
49e99a3e82
|
Fix conversion and product to enable overlap with GPU
|
11 months ago |
sfilippone
|
74cf138a6c
|
Merge branch 'repackage' into non-diag
|
11 months ago |
sfilippone
|
be7571f568
|
Fix missing directive
|
11 months ago |
sfilippone
|
e9d1238b43
|
Add detailed measurements.
|
11 months ago |
Salvatore Filippone
|
20a01d4d71
|
Attempt at fixing CSRG in CUDA 10.2. Not complete yet.
|
11 months ago |
sfilippone
|
1bc2a884e2
|
Adjust conditional compilation on CUDA version
|
11 months ago |
Salvatore Filippone
|
62db7c0449
|
Fix spsv with CSRG handling of descriptors.
|
11 months ago |
Salvatore Filippone
|
d28ea462d9
|
Modified CSRG to work with latest versions; cusparse docs are unclear
|
11 months ago |
sfilippone
|
6b65199afb
|
Check CUDA version for -dopt=on only from 11.7
|
11 months ago |
sfilippone
|
0230fbb7af
|
Identufied problems with CSRG. Will fix in a branch
|
11 months ago |
sfilippone
|
41491f7b9c
|
Fix HAVE_CUDA in test programs
|
11 months ago |
sfilippone
|
b2b7b074df
|
Fix usage of HAVE_CUDA/HAVE_GPU (mostly disappeared)
|
11 months ago |
sfilippone
|
e373ed7e0b
|
Modify configry to only use HAVE_CUDA, since SPGU is recompiled.
|
11 months ago |
sfilippone
|
a6016f00fa
|
Bump PSBLAS version to 3.9
|
11 months ago |
sfilippone
|
ab8631439f
|
Update configure script
|
11 months ago |
sfilippone
|
6c9ca58282
|
Silly bug in coo insert
|
11 months ago |
sfilippone
|
d3b2b7816d
|
Fix coo insert OpenMP. Fix Make.inc.in
|
11 months ago |
sfilippone
|
655c86caed
|
Updated docs.
|
11 months ago |
sfilippone
|
9b713c177b
|
Fix cuda interfaces for renaming
|
11 months ago |