Commit Graph

2671 Commits (3aa3c795e98fc2a8a4dc9f8ee3aa36e74f13fef8)
 

Author SHA1 Message Date
sfilippone 3aa3c795e9 Refactor assembly and cnv
sfilippone 4d051c777d Fix makefile and test program
sfilippone 49e99a3e82 Fix conversion and product to enable overlap with GPU
sfilippone 74cf138a6c Merge branch 'repackage' into non-diag
sfilippone be7571f568 Fix missing directive
sfilippone e9d1238b43 Add detailed measurements.
Salvatore Filippone 20a01d4d71 Attempt at fixing CSRG in CUDA 10.2. Not complete yet.
sfilippone 1bc2a884e2 Adjust conditional compilation on CUDA version
Salvatore Filippone 62db7c0449 Fix spsv with CSRG handling of descriptors.
Salvatore Filippone d28ea462d9 Modified CSRG to work with latest versions; cusparse docs are unclear
sfilippone 6b65199afb Check CUDA version for -dopt=on only from 11.7
sfilippone 0230fbb7af Identufied problems with CSRG. Will fix in a branch
sfilippone 41491f7b9c Fix HAVE_CUDA in test programs
sfilippone b2b7b074df Fix usage of HAVE_CUDA/HAVE_GPU (mostly disappeared)
sfilippone e373ed7e0b Modify configry to only use HAVE_CUDA, since SPGU is recompiled.
sfilippone a6016f00fa Bump PSBLAS version to 3.9
sfilippone ab8631439f Update configure script
sfilippone 6c9ca58282 Silly bug in coo insert
sfilippone d3b2b7816d Fix coo insert OpenMP. Fix Make.inc.in
sfilippone 655c86caed Updated docs.
sfilippone 9b713c177b Fix cuda interfaces for renaming
sfilippone 6fa0bf7fe7 Complete cuda renaming
sfilippone ae7fad95d4 Merge branch 'development' into non-diag
sfilippone a6ec655a97 Prepare merge
sfilippone a2788bdf0b New version with ND product
sfilippone d718ef1e6d Always allocate szs in psb_gather
sfilippone baf18cebd7 Further fix for gather.
sfilippone 5caee551e5 Fixed IN_PLACE option for collectives.
sfilippone d82b090289 Fix makefile for psi_acx & friends
Salvatore Filippone 25e9183e50 Fix SHFT implementation, step 2
Salvatore Filippone 250a6300ba Fix SHFT implementation
Salvatore Filippone 0b184e4313 Merge branch 'shift' into development
Salvatore Filippone d3fcd566d9 Define a SHIFT argument to compute ILU( A+shft I)
sfilippone 6aa7987d52 Rename GPU into cuda, and merge SPGPU code.
sfilippone 2732336915 Fix gpu/makefile
sfilippone 81e9121c91 Add GPULDLIBS into Make.inc (and fix configry)
sfilippone d0d4e45877 Fix for I gpu types from template: use psb_sizeof_ip
sfilippone 3a9116bc42 Merge psblas-ext into psblas step 2: GPU formats and configry.
sfilippone 1d5faa388d Merge psblas-ext into psblas, step 1: ext storage formats.
sfilippone d1bf46b0b1 Merge branch 'dev-openmp' into development
sfilippone e31dd52c41 Fixed CRITICAL in hash_mod
sfilippone def0635c53 More OMP directives in cd_inloc
sfilippone efa29bc3ad Fix configure for GCC when using --with-fcopt
sfilippone 3e29e603d2 Fix necessary flags for GCC
sfilippone 41be1357c3 Set defaults for SPSPMM depending on OpenMP compilation.
Salvatore Filippone 0d8a5d3dc2 New SPSPMM implementation
Salvatore Filippone d0cacda995 Moved various modules related to RB around, into auxil, update Makefile.
Salvatore Filippone 7b45994b70 Setter/getter for SPSPMM algorithm in base_mat_mod
wlthr 2322a9ce61 using end_idx to copy data from threads in gustavson and gustavson_1d
wlthr 0185b79b2a added setter for d_csr_spspmm implementation