sfilippone
|
b99aa7a90f
|
Switch off OMP in HASH g2l_ins
|
8 months ago |
sfilippone
|
4e0a9e5db8
|
Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage
|
8 months ago |
sfilippone
|
e72c0f0bf9
|
Fix OMP impl of sparse-sparse product
|
8 months ago |
Salvatore Filippone
|
d444a12879
|
Condition call to x%sync() in vect_mv
|
8 months ago |
Salvatore Filippone
|
5e2e1e34fd
|
Introduce set_host() in inner_vect_sv
|
8 months ago |
sfilippone
|
025350a361
|
Make sure realloc is always called with size >0
|
8 months ago |
sfilippone
|
ba8c32c507
|
Define merge_nd method
|
8 months ago |
sfilippone
|
aca1848401
|
New timings in CG
|
8 months ago |
sfilippone
|
e18de650f2
|
Take out debug print
|
8 months ago |
sfilippone
|
6f92a5c37a
|
Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage
|
8 months ago |
sfilippone
|
553531eefb
|
Take out obsolete ilu_fct source files
|
8 months ago |
sfilippone
|
2f575894fc
|
Fix --with-cudacc in configure
|
8 months ago |
sfilippone
|
0760e4d553
|
Fix C function declarations for compilation with LLVM/clang in CUDA
|
9 months ago |
sfilippone
|
4347c663c2
|
Change conftest **argv to recognize CUDA_VERSION.
|
9 months ago |
sfilippone
|
a2f92e616f
|
Put VOLATILE under ifdef for FLANG
|
9 months ago |
sfilippone
|
59e6df73a4
|
Make sure configure recognizes FLANG
|
9 months ago |
sfilippone
|
0023b8ac78
|
Compile adjcncy_fnd_owner
|
9 months ago |
sfilippone
|
3a25d7b04a
|
Fixes for LLVM compilation
|
9 months ago |
sfilippone
|
373d841bce
|
Don't need renaming of psi_gth and psi_sct
|
9 months ago |
sfilippone
|
472f16f0df
|
Fix compilation with --enable-serial
|
9 months ago |
sfilippone
|
e0a4d362fa
|
Define flag TRACK_CUDA_MALLOC
|
10 months ago |
Salvatore Filippone
|
b5f1442ac8
|
Merge branch 'nond-rep' into repackage
|
10 months ago |
sfilippone
|
48455190ec
|
Add GPU version of XYZW
|
10 months ago |
sfilippone
|
a11f328e62
|
Added CUDA version of XYZW
|
10 months ago |
sfilippone
|
86be8ebcd0
|
New method W%XYZW()
|
10 months ago |
sfilippone
|
b5d5f97661
|
Improve cuda%zero()
|
10 months ago |
sfilippone
|
0e269ed641
|
typo in Cabgdxyz
|
10 months ago |
Salvatore Filippone
|
d95077ffd6
|
Fix typo in vectordev_mod
|
10 months ago |
Salvatore Filippone
|
2d3773df98
|
CUDA kernels for ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
2a75d677d0
|
ABGDXYZ in vectordev_mod
|
10 months ago |
sfilippone
|
2391f64df6
|
X_cuda_vect%abgdxyz
|
10 months ago |
sfilippone
|
93c71c4316
|
Fix %ZERO() on cuda
|
10 months ago |
sfilippone
|
0568a83734
|
Fix ifdef and old code
|
10 months ago |
Salvatore Filippone
|
35d68aa4e3
|
Reuse calls to getDeviceProperties done at init time
|
10 months ago |
Salvatore Filippone
|
1ba8dfc7b7
|
Switch FOR and IF in AXPBY
|
10 months ago |
Salvatore Filippone
|
f9677bc892
|
Enabled new CUDA version of ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
4681767ef8
|
New implementation for ABGDXYZ in CUDA
|
10 months ago |
Salvatore Filippone
|
105aa3c570
|
Intermediate impl of ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
864872ecac
|
Intermediate implementation of abgdxyz on cuda
|
10 months ago |
Salvatore Filippone
|
a41b209144
|
Better AXPBY implementation in CUDA.
|
10 months ago |
Salvatore Filippone
|
f4c7604f61
|
Fix base implementation of abgdxyz to call set_host
|
10 months ago |
Salvatore Filippone
|
b8f9badf95
|
Fix interface between vect and base_vect%ABGD
|
10 months ago |
Salvatore Filippone
|
2a40b82b58
|
Fix typo in base_vect_mod
|
11 months ago |
Salvatore Filippone
|
4e611bb078
|
Enable psi_abgdxyz
|
11 months ago |
Salvatore Filippone
|
9ced67634d
|
Fix KIND for NR in axpby
|
11 months ago |
Salvatore Filippone
|
3121c43582
|
Silly bug in abgdxyz implementation
|
11 months ago |
Salvatore Filippone
|
5c3d5f0235
|
Silly bug in abgdxyz implementation
|
11 months ago |
Salvatore Filippone
|
29669b56a2
|
Implementation of psb_abgdxyz
|
11 months ago |
Salvatore Filippone
|
a942b47f7c
|
Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep
|
11 months ago |
Salvatore Filippone
|
6c53b6ec79
|
Fix typo in interface for psb_abgdxyz
|
11 months ago |