Commit Graph

2750 Commits (ccb4f73dcad95152ad6b479f8ae71af003febd27)
 

Author SHA1 Message Date
Salvatore Filippone f9677bc892 Enabled new CUDA version of ABGDXYZ 1 year ago
Salvatore Filippone 4681767ef8 New implementation for ABGDXYZ in CUDA 1 year ago
Salvatore Filippone 105aa3c570 Intermediate impl of ABGDXYZ 1 year ago
Salvatore Filippone 864872ecac Intermediate implementation of abgdxyz on cuda 1 year ago
Salvatore Filippone a41b209144 Better AXPBY implementation in CUDA. 1 year ago
Salvatore Filippone f4c7604f61 Fix base implementation of abgdxyz to call set_host 1 year ago
Salvatore Filippone b8f9badf95 Fix interface between vect and base_vect%ABGD 1 year ago
Salvatore Filippone 2a40b82b58 Fix typo in base_vect_mod 1 year ago
Salvatore Filippone 4e611bb078 Enable psi_abgdxyz 1 year ago
Salvatore Filippone 9ced67634d Fix KIND for NR in axpby 1 year ago
Salvatore Filippone 3121c43582 Silly bug in abgdxyz implementation 1 year ago
Salvatore Filippone 5c3d5f0235 Silly bug in abgdxyz implementation 1 year ago
Salvatore Filippone 29669b56a2 Implementation of psb_abgdxyz 1 year ago
Salvatore Filippone a942b47f7c Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 1 year ago
Salvatore Filippone 6c53b6ec79 Fix typo in interface for psb_abgdxyz 1 year ago
sfilippone 83ededd02b Implementatino of abgd_xyz 1 year ago
Salvatore Filippone 92a95699ba Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 1 year ago
Salvatore Filippone ebc7c6b3b4 Fix call to base%abgdxyz 1 year ago
sfilippone 45f00e6e19 Fixed comments 1 year ago
Salvatore Filippone 14c4ff0f32 Added new methd for two combined axpbys 1 year ago
Salvatore Filippone b49ce6b610 Merge branch 'repackage' into nond-rep 1 year ago
sfilippone 6433dc797e Fix CUDA implementation of %set_scal and %zero 1 year ago
sfilippone 097d63147a Fix cuda dir makefile 1 year ago
sfilippone 3aa3c795e9 Refactor assembly and cnv 1 year ago
sfilippone 4d051c777d Fix makefile and test program 1 year ago
sfilippone 49e99a3e82 Fix conversion and product to enable overlap with GPU 1 year ago
sfilippone 74cf138a6c Merge branch 'repackage' into non-diag 1 year ago
sfilippone be7571f568 Fix missing directive 1 year ago
sfilippone e9d1238b43 Add detailed measurements. 1 year ago
Salvatore Filippone 20a01d4d71 Attempt at fixing CSRG in CUDA 10.2. Not complete yet. 1 year ago
sfilippone 1bc2a884e2 Adjust conditional compilation on CUDA version 1 year ago
Salvatore Filippone 62db7c0449 Fix spsv with CSRG handling of descriptors. 1 year ago
Salvatore Filippone d28ea462d9 Modified CSRG to work with latest versions; cusparse docs are unclear 1 year ago
sfilippone 6b65199afb Check CUDA version for -dopt=on only from 11.7 1 year ago
sfilippone 0230fbb7af Identufied problems with CSRG. Will fix in a branch 1 year ago
sfilippone 41491f7b9c Fix HAVE_CUDA in test programs 1 year ago
sfilippone b2b7b074df Fix usage of HAVE_CUDA/HAVE_GPU (mostly disappeared) 1 year ago
sfilippone e373ed7e0b Modify configry to only use HAVE_CUDA, since SPGU is recompiled. 1 year ago
sfilippone a6016f00fa Bump PSBLAS version to 3.9 1 year ago
sfilippone ab8631439f Update configure script 1 year ago
sfilippone 6c9ca58282 Silly bug in coo insert 1 year ago
sfilippone 8633e76cb0 Silly bug in coo insert 1 year ago
sfilippone d3b2b7816d Fix coo insert OpenMP. Fix Make.inc.in 1 year ago
sfilippone 492b28f342 Fix wrong insert without OpenMP 1 year ago
sfilippone 655c86caed Updated docs. 1 year ago
sfilippone 9b713c177b Fix cuda interfaces for renaming 1 year ago
sfilippone 6fa0bf7fe7 Complete cuda renaming 1 year ago
sfilippone cce3103bb4 Fix CXXDEFINES 1 year ago
sfilippone a082cdb1b6 Deactivate OpenMP in hash_g2lv_ins for the time being. 1 year ago
sfilippone b850c0ef6a Fix typo 1 year ago