Salvatore Filippone
|
9ced67634d
|
Fix KIND for NR in axpby
|
11 months ago |
Salvatore Filippone
|
3121c43582
|
Silly bug in abgdxyz implementation
|
11 months ago |
Salvatore Filippone
|
5c3d5f0235
|
Silly bug in abgdxyz implementation
|
11 months ago |
Salvatore Filippone
|
29669b56a2
|
Implementation of psb_abgdxyz
|
11 months ago |
Salvatore Filippone
|
a942b47f7c
|
Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep
|
11 months ago |
Salvatore Filippone
|
6c53b6ec79
|
Fix typo in interface for psb_abgdxyz
|
11 months ago |
sfilippone
|
83ededd02b
|
Implementatino of abgd_xyz
|
11 months ago |
Salvatore Filippone
|
92a95699ba
|
Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep
|
11 months ago |
Salvatore Filippone
|
ebc7c6b3b4
|
Fix call to base%abgdxyz
|
11 months ago |
sfilippone
|
45f00e6e19
|
Fixed comments
|
11 months ago |
Salvatore Filippone
|
14c4ff0f32
|
Added new methd for two combined axpbys
|
11 months ago |
Salvatore Filippone
|
b49ce6b610
|
Merge branch 'repackage' into nond-rep
|
11 months ago |
sfilippone
|
6433dc797e
|
Fix CUDA implementation of %set_scal and %zero
|
11 months ago |
sfilippone
|
097d63147a
|
Fix cuda dir makefile
|
11 months ago |
sfilippone
|
3aa3c795e9
|
Refactor assembly and cnv
|
1 year ago |
sfilippone
|
4d051c777d
|
Fix makefile and test program
|
1 year ago |
sfilippone
|
49e99a3e82
|
Fix conversion and product to enable overlap with GPU
|
1 year ago |
sfilippone
|
74cf138a6c
|
Merge branch 'repackage' into non-diag
|
1 year ago |
sfilippone
|
be7571f568
|
Fix missing directive
|
1 year ago |
sfilippone
|
e9d1238b43
|
Add detailed measurements.
|
1 year ago |
Salvatore Filippone
|
20a01d4d71
|
Attempt at fixing CSRG in CUDA 10.2. Not complete yet.
|
1 year ago |
sfilippone
|
1bc2a884e2
|
Adjust conditional compilation on CUDA version
|
1 year ago |
Salvatore Filippone
|
62db7c0449
|
Fix spsv with CSRG handling of descriptors.
|
1 year ago |
Salvatore Filippone
|
d28ea462d9
|
Modified CSRG to work with latest versions; cusparse docs are unclear
|
1 year ago |
sfilippone
|
6b65199afb
|
Check CUDA version for -dopt=on only from 11.7
|
1 year ago |
sfilippone
|
0230fbb7af
|
Identufied problems with CSRG. Will fix in a branch
|
1 year ago |
sfilippone
|
41491f7b9c
|
Fix HAVE_CUDA in test programs
|
1 year ago |
sfilippone
|
b2b7b074df
|
Fix usage of HAVE_CUDA/HAVE_GPU (mostly disappeared)
|
1 year ago |
sfilippone
|
e373ed7e0b
|
Modify configry to only use HAVE_CUDA, since SPGU is recompiled.
|
1 year ago |
sfilippone
|
a6016f00fa
|
Bump PSBLAS version to 3.9
|
1 year ago |
sfilippone
|
ab8631439f
|
Update configure script
|
1 year ago |
sfilippone
|
6c9ca58282
|
Silly bug in coo insert
|
1 year ago |
sfilippone
|
d3b2b7816d
|
Fix coo insert OpenMP. Fix Make.inc.in
|
1 year ago |
sfilippone
|
655c86caed
|
Updated docs.
|
1 year ago |
sfilippone
|
9b713c177b
|
Fix cuda interfaces for renaming
|
1 year ago |
sfilippone
|
6fa0bf7fe7
|
Complete cuda renaming
|
1 year ago |
sfilippone
|
ae7fad95d4
|
Merge branch 'development' into non-diag
|
1 year ago |
sfilippone
|
a6ec655a97
|
Prepare merge
|
1 year ago |
sfilippone
|
a2788bdf0b
|
New version with ND product
|
1 year ago |
sfilippone
|
d718ef1e6d
|
Always allocate szs in psb_gather
|
1 year ago |
sfilippone
|
baf18cebd7
|
Further fix for gather.
|
1 year ago |
sfilippone
|
5caee551e5
|
Fixed IN_PLACE option for collectives.
|
1 year ago |
sfilippone
|
d82b090289
|
Fix makefile for psi_acx & friends
|
1 year ago |
Salvatore Filippone
|
25e9183e50
|
Fix SHFT implementation, step 2
|
1 year ago |
Salvatore Filippone
|
250a6300ba
|
Fix SHFT implementation
|
1 year ago |
Salvatore Filippone
|
0b184e4313
|
Merge branch 'shift' into development
|
1 year ago |
Salvatore Filippone
|
d3fcd566d9
|
Define a SHIFT argument to compute ILU( A+shft I)
|
1 year ago |
sfilippone
|
6aa7987d52
|
Rename GPU into cuda, and merge SPGPU code.
|
1 year ago |
sfilippone
|
2732336915
|
Fix gpu/makefile
|
1 year ago |
sfilippone
|
81e9121c91
|
Add GPULDLIBS into Make.inc (and fix configry)
|
1 year ago |