sfilippone
|
d01b8145c6
|
Fix cuda makefile dependencies
|
8 months ago |
sfilippone
|
d8ed01218d
|
Cleanup hash_map using new indx_map%set_lc
|
8 months ago |
sfilippone
|
7ec394ce1c
|
Rename indx_map_mod and put SET_LR/C under ifdef
|
8 months ago |
sfilippone
|
7dc64692cc
|
Fix for OpenMP runs in hash_map_mod
|
8 months ago |
Salvatore Filippone
|
e711c53fab
|
Make sure we compile when LPK /= IPK
|
8 months ago |
Salvatore Filippone
|
b5a32a59f9
|
Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage
|
8 months ago |
Salvatore Filippone
|
773b79e7bc
|
OpenMP in repl_map
|
8 months ago |
Salvatore Filippone
|
98a9005602
|
Further advances on OpenMP versions of various index maps.
|
8 months ago |
Salvatore Filippone
|
fa86c91411
|
Fix OpenMP version of hash_map and hash
|
8 months ago |
Salvatore Filippone
|
188dee6842
|
Add indx_map%inc_lc() method
|
8 months ago |
sfilippone
|
b99aa7a90f
|
Switch off OMP in HASH g2l_ins
|
8 months ago |
sfilippone
|
4e0a9e5db8
|
Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage
|
8 months ago |
sfilippone
|
e72c0f0bf9
|
Fix OMP impl of sparse-sparse product
|
8 months ago |
Salvatore Filippone
|
d444a12879
|
Condition call to x%sync() in vect_mv
|
8 months ago |
Salvatore Filippone
|
5e2e1e34fd
|
Introduce set_host() in inner_vect_sv
|
8 months ago |
sfilippone
|
025350a361
|
Make sure realloc is always called with size >0
|
8 months ago |
sfilippone
|
ba8c32c507
|
Define merge_nd method
|
8 months ago |
sfilippone
|
aca1848401
|
New timings in CG
|
8 months ago |
sfilippone
|
e18de650f2
|
Take out debug print
|
8 months ago |
sfilippone
|
6f92a5c37a
|
Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage
|
9 months ago |
sfilippone
|
553531eefb
|
Take out obsolete ilu_fct source files
|
9 months ago |
sfilippone
|
2f575894fc
|
Fix --with-cudacc in configure
|
9 months ago |
sfilippone
|
0760e4d553
|
Fix C function declarations for compilation with LLVM/clang in CUDA
|
9 months ago |
sfilippone
|
4347c663c2
|
Change conftest **argv to recognize CUDA_VERSION.
|
9 months ago |
sfilippone
|
a2f92e616f
|
Put VOLATILE under ifdef for FLANG
|
9 months ago |
sfilippone
|
59e6df73a4
|
Make sure configure recognizes FLANG
|
9 months ago |
sfilippone
|
0023b8ac78
|
Compile adjcncy_fnd_owner
|
9 months ago |
sfilippone
|
3a25d7b04a
|
Fixes for LLVM compilation
|
9 months ago |
sfilippone
|
373d841bce
|
Don't need renaming of psi_gth and psi_sct
|
9 months ago |
sfilippone
|
472f16f0df
|
Fix compilation with --enable-serial
|
9 months ago |
sfilippone
|
e0a4d362fa
|
Define flag TRACK_CUDA_MALLOC
|
10 months ago |
Salvatore Filippone
|
b5f1442ac8
|
Merge branch 'nond-rep' into repackage
|
10 months ago |
sfilippone
|
48455190ec
|
Add GPU version of XYZW
|
10 months ago |
sfilippone
|
a11f328e62
|
Added CUDA version of XYZW
|
10 months ago |
sfilippone
|
86be8ebcd0
|
New method W%XYZW()
|
10 months ago |
sfilippone
|
b5d5f97661
|
Improve cuda%zero()
|
10 months ago |
sfilippone
|
0e269ed641
|
typo in Cabgdxyz
|
10 months ago |
Salvatore Filippone
|
d95077ffd6
|
Fix typo in vectordev_mod
|
10 months ago |
Salvatore Filippone
|
2d3773df98
|
CUDA kernels for ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
2a75d677d0
|
ABGDXYZ in vectordev_mod
|
10 months ago |
sfilippone
|
2391f64df6
|
X_cuda_vect%abgdxyz
|
10 months ago |
sfilippone
|
93c71c4316
|
Fix %ZERO() on cuda
|
10 months ago |
sfilippone
|
0568a83734
|
Fix ifdef and old code
|
10 months ago |
Salvatore Filippone
|
35d68aa4e3
|
Reuse calls to getDeviceProperties done at init time
|
10 months ago |
Salvatore Filippone
|
1ba8dfc7b7
|
Switch FOR and IF in AXPBY
|
10 months ago |
Salvatore Filippone
|
f9677bc892
|
Enabled new CUDA version of ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
4681767ef8
|
New implementation for ABGDXYZ in CUDA
|
10 months ago |
Salvatore Filippone
|
105aa3c570
|
Intermediate impl of ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
864872ecac
|
Intermediate implementation of abgdxyz on cuda
|
10 months ago |
Salvatore Filippone
|
a41b209144
|
Better AXPBY implementation in CUDA.
|
10 months ago |