Commit Graph

2739 Commits (d01b8145c6f4de5b3beb4ebabc88c6ec69e52bfe)
 

Author SHA1 Message Date
Salvatore Filippone f4c7604f61 Fix base implementation of abgdxyz to call set_host 11 months ago
Salvatore Filippone b8f9badf95 Fix interface between vect and base_vect%ABGD 11 months ago
Salvatore Filippone 2a40b82b58 Fix typo in base_vect_mod 11 months ago
Salvatore Filippone 4e611bb078 Enable psi_abgdxyz 11 months ago
Salvatore Filippone 9ced67634d Fix KIND for NR in axpby 11 months ago
Salvatore Filippone 3121c43582 Silly bug in abgdxyz implementation 11 months ago
Salvatore Filippone 5c3d5f0235 Silly bug in abgdxyz implementation 11 months ago
Salvatore Filippone 29669b56a2 Implementation of psb_abgdxyz 11 months ago
Salvatore Filippone a942b47f7c Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 11 months ago
Salvatore Filippone 6c53b6ec79 Fix typo in interface for psb_abgdxyz 11 months ago
sfilippone 83ededd02b Implementatino of abgd_xyz 11 months ago
Salvatore Filippone 92a95699ba Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 11 months ago
Salvatore Filippone ebc7c6b3b4 Fix call to base%abgdxyz 11 months ago
sfilippone 45f00e6e19 Fixed comments 11 months ago
Salvatore Filippone 14c4ff0f32 Added new methd for two combined axpbys 11 months ago
Salvatore Filippone b49ce6b610 Merge branch 'repackage' into nond-rep 11 months ago
sfilippone 6433dc797e Fix CUDA implementation of %set_scal and %zero 11 months ago
sfilippone 097d63147a Fix cuda dir makefile 11 months ago
sfilippone 3aa3c795e9 Refactor assembly and cnv 1 year ago
sfilippone 4d051c777d Fix makefile and test program 1 year ago
sfilippone 49e99a3e82 Fix conversion and product to enable overlap with GPU 1 year ago
sfilippone 74cf138a6c Merge branch 'repackage' into non-diag 1 year ago
sfilippone be7571f568 Fix missing directive 1 year ago
sfilippone e9d1238b43 Add detailed measurements. 1 year ago
Salvatore Filippone 20a01d4d71 Attempt at fixing CSRG in CUDA 10.2. Not complete yet. 1 year ago
sfilippone 1bc2a884e2 Adjust conditional compilation on CUDA version 1 year ago
Salvatore Filippone 62db7c0449 Fix spsv with CSRG handling of descriptors. 1 year ago
Salvatore Filippone d28ea462d9 Modified CSRG to work with latest versions; cusparse docs are unclear 1 year ago
sfilippone 6b65199afb Check CUDA version for -dopt=on only from 11.7 1 year ago
sfilippone 0230fbb7af Identufied problems with CSRG. Will fix in a branch 1 year ago
sfilippone 41491f7b9c Fix HAVE_CUDA in test programs 1 year ago
sfilippone b2b7b074df Fix usage of HAVE_CUDA/HAVE_GPU (mostly disappeared) 1 year ago
sfilippone e373ed7e0b Modify configry to only use HAVE_CUDA, since SPGU is recompiled. 1 year ago
sfilippone a6016f00fa Bump PSBLAS version to 3.9 1 year ago
sfilippone ab8631439f Update configure script 1 year ago
sfilippone 6c9ca58282 Silly bug in coo insert 1 year ago
sfilippone d3b2b7816d Fix coo insert OpenMP. Fix Make.inc.in 1 year ago
sfilippone 655c86caed Updated docs. 1 year ago
sfilippone 9b713c177b Fix cuda interfaces for renaming 1 year ago
sfilippone 6fa0bf7fe7 Complete cuda renaming 1 year ago
sfilippone ae7fad95d4 Merge branch 'development' into non-diag 1 year ago
sfilippone a6ec655a97 Prepare merge 1 year ago
sfilippone a2788bdf0b New version with ND product 1 year ago
sfilippone d718ef1e6d Always allocate szs in psb_gather 1 year ago
sfilippone baf18cebd7 Further fix for gather. 1 year ago
sfilippone 5caee551e5 Fixed IN_PLACE option for collectives. 1 year ago
sfilippone d82b090289 Fix makefile for psi_acx & friends 1 year ago
Salvatore Filippone 25e9183e50 Fix SHFT implementation, step 2 1 year ago
Salvatore Filippone 250a6300ba Fix SHFT implementation 1 year ago
Salvatore Filippone 0b184e4313 Merge branch 'shift' into development 1 year ago