sfilippone
|
a2f92e616f
|
Put VOLATILE under ifdef for FLANG
|
9 months ago |
sfilippone
|
59e6df73a4
|
Make sure configure recognizes FLANG
|
9 months ago |
sfilippone
|
0023b8ac78
|
Compile adjcncy_fnd_owner
|
9 months ago |
sfilippone
|
3a25d7b04a
|
Fixes for LLVM compilation
|
9 months ago |
sfilippone
|
373d841bce
|
Don't need renaming of psi_gth and psi_sct
|
9 months ago |
sfilippone
|
472f16f0df
|
Fix compilation with --enable-serial
|
9 months ago |
sfilippone
|
e0a4d362fa
|
Define flag TRACK_CUDA_MALLOC
|
10 months ago |
Salvatore Filippone
|
b5f1442ac8
|
Merge branch 'nond-rep' into repackage
|
10 months ago |
sfilippone
|
48455190ec
|
Add GPU version of XYZW
|
10 months ago |
sfilippone
|
a11f328e62
|
Added CUDA version of XYZW
|
10 months ago |
sfilippone
|
86be8ebcd0
|
New method W%XYZW()
|
10 months ago |
sfilippone
|
b5d5f97661
|
Improve cuda%zero()
|
10 months ago |
sfilippone
|
0e269ed641
|
typo in Cabgdxyz
|
10 months ago |
Salvatore Filippone
|
d95077ffd6
|
Fix typo in vectordev_mod
|
10 months ago |
Salvatore Filippone
|
2d3773df98
|
CUDA kernels for ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
2a75d677d0
|
ABGDXYZ in vectordev_mod
|
10 months ago |
sfilippone
|
2391f64df6
|
X_cuda_vect%abgdxyz
|
10 months ago |
sfilippone
|
93c71c4316
|
Fix %ZERO() on cuda
|
10 months ago |
sfilippone
|
0568a83734
|
Fix ifdef and old code
|
10 months ago |
Salvatore Filippone
|
35d68aa4e3
|
Reuse calls to getDeviceProperties done at init time
|
10 months ago |
Salvatore Filippone
|
1ba8dfc7b7
|
Switch FOR and IF in AXPBY
|
10 months ago |
Salvatore Filippone
|
f9677bc892
|
Enabled new CUDA version of ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
4681767ef8
|
New implementation for ABGDXYZ in CUDA
|
10 months ago |
Salvatore Filippone
|
105aa3c570
|
Intermediate impl of ABGDXYZ
|
10 months ago |
Salvatore Filippone
|
864872ecac
|
Intermediate implementation of abgdxyz on cuda
|
10 months ago |
Salvatore Filippone
|
a41b209144
|
Better AXPBY implementation in CUDA.
|
10 months ago |
Salvatore Filippone
|
f4c7604f61
|
Fix base implementation of abgdxyz to call set_host
|
10 months ago |
Salvatore Filippone
|
b8f9badf95
|
Fix interface between vect and base_vect%ABGD
|
11 months ago |
Salvatore Filippone
|
2a40b82b58
|
Fix typo in base_vect_mod
|
11 months ago |
Salvatore Filippone
|
4e611bb078
|
Enable psi_abgdxyz
|
11 months ago |
Salvatore Filippone
|
9ced67634d
|
Fix KIND for NR in axpby
|
11 months ago |
Salvatore Filippone
|
3121c43582
|
Silly bug in abgdxyz implementation
|
11 months ago |
Salvatore Filippone
|
5c3d5f0235
|
Silly bug in abgdxyz implementation
|
11 months ago |
Salvatore Filippone
|
29669b56a2
|
Implementation of psb_abgdxyz
|
11 months ago |
Salvatore Filippone
|
a942b47f7c
|
Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep
|
11 months ago |
Salvatore Filippone
|
6c53b6ec79
|
Fix typo in interface for psb_abgdxyz
|
11 months ago |
sfilippone
|
83ededd02b
|
Implementatino of abgd_xyz
|
11 months ago |
Salvatore Filippone
|
92a95699ba
|
Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep
|
11 months ago |
Salvatore Filippone
|
ebc7c6b3b4
|
Fix call to base%abgdxyz
|
11 months ago |
sfilippone
|
45f00e6e19
|
Fixed comments
|
11 months ago |
Salvatore Filippone
|
14c4ff0f32
|
Added new methd for two combined axpbys
|
11 months ago |
Salvatore Filippone
|
b49ce6b610
|
Merge branch 'repackage' into nond-rep
|
11 months ago |
sfilippone
|
6433dc797e
|
Fix CUDA implementation of %set_scal and %zero
|
11 months ago |
sfilippone
|
097d63147a
|
Fix cuda dir makefile
|
11 months ago |
sfilippone
|
3aa3c795e9
|
Refactor assembly and cnv
|
1 year ago |
sfilippone
|
4d051c777d
|
Fix makefile and test program
|
1 year ago |
sfilippone
|
49e99a3e82
|
Fix conversion and product to enable overlap with GPU
|
1 year ago |
sfilippone
|
74cf138a6c
|
Merge branch 'repackage' into non-diag
|
1 year ago |
sfilippone
|
be7571f568
|
Fix missing directive
|
1 year ago |
sfilippone
|
e9d1238b43
|
Add detailed measurements.
|
1 year ago |