Commit Graph

1458 Commits (fc65ca4ea0d9366e550c368cc634f2b0f14b4cd4)

Author SHA1 Message Date
sfilippone 2982aaee27 Implementation in OpenACC for ELL and HLL into templates. Merge from development 2 years ago
sfilippone 4461b44eda Change name abgdxyz into upd_xyz 2 years ago
sfilippone 39cfcd3893 Fix allocation in coo_impl 2 years ago
sfilippone a38867be25 Fix allocation in coo_impl 2 years ago
sfilippone b9ad357648 Improve temp memory allocation in fix_coo 2 years ago
Salvatore Filippone 42293c62b6 Fix usage of sync() 2 years ago
Salvatore Filippone a177e94ba5 Fix comments, 2 years ago
sfilippone 12a4c21fed Fixes for OpenMP compilation in map_mod 2 years ago
sfilippone e19284eb6c Small omp addition 2 years ago
sfilippone 35096a2ef9 Cosmetic changes to coo_impl 2 years ago
sfilippone 70f51b9da8 Improve handling of fix_coo buffers with OpenMP 2 years ago
sfilippone ecccb13914 Fix COO fix_coo_inner_rowmajor not to overflow on integers. 2 years ago
sfilippone a613e963db First step in fix for coo_impl on OpenMP 2 years ago
sfilippone d8ed01218d Cleanup hash_map using new indx_map%set_lc 2 years ago
sfilippone 7ec394ce1c Rename indx_map_mod and put SET_LR/C under ifdef 2 years ago
sfilippone 7dc64692cc Fix for OpenMP runs in hash_map_mod 2 years ago
Salvatore Filippone e711c53fab Make sure we compile when LPK /= IPK 2 years ago
Salvatore Filippone b5a32a59f9 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage 2 years ago
Salvatore Filippone 773b79e7bc OpenMP in repl_map 2 years ago
Salvatore Filippone 98a9005602 Further advances on OpenMP versions of various index maps. 2 years ago
Salvatore Filippone fa86c91411 Fix OpenMP version of hash_map and hash 2 years ago
Salvatore Filippone 188dee6842 Add indx_map%inc_lc() method 2 years ago
sfilippone b99aa7a90f Switch off OMP in HASH g2l_ins 2 years ago
sfilippone 4e0a9e5db8 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage 2 years ago
sfilippone e72c0f0bf9 Fix OMP impl of sparse-sparse product 2 years ago
Salvatore Filippone d444a12879 Condition call to x%sync() in vect_mv 2 years ago
Salvatore Filippone 5e2e1e34fd Introduce set_host() in inner_vect_sv 2 years ago
sfilippone 025350a361 Make sure realloc is always called with size >0 2 years ago
sfilippone ba8c32c507 Define merge_nd method 2 years ago
sfilippone aca1848401 New timings in CG 2 years ago
sfilippone a2f92e616f Put VOLATILE under ifdef for FLANG 2 years ago
sfilippone 0023b8ac78 Compile adjcncy_fnd_owner 2 years ago
sfilippone 3a25d7b04a Fixes for LLVM compilation 2 years ago
sfilippone 373d841bce Don't need renaming of psi_gth and psi_sct 2 years ago
sfilippone 472f16f0df Fix compilation with --enable-serial 2 years ago
sfilippone 86be8ebcd0 New method W%XYZW() 2 years ago
Salvatore Filippone f4c7604f61 Fix base implementation of abgdxyz to call set_host 2 years ago
Salvatore Filippone b8f9badf95 Fix interface between vect and base_vect%ABGD 2 years ago
Salvatore Filippone 2a40b82b58 Fix typo in base_vect_mod 2 years ago
Salvatore Filippone 4e611bb078 Enable psi_abgdxyz 2 years ago
Salvatore Filippone 9ced67634d Fix KIND for NR in axpby 2 years ago
Salvatore Filippone 3121c43582 Silly bug in abgdxyz implementation 2 years ago
Salvatore Filippone 5c3d5f0235 Silly bug in abgdxyz implementation 2 years ago
Salvatore Filippone 29669b56a2 Implementation of psb_abgdxyz 2 years ago
Salvatore Filippone a942b47f7c Merge branch 'nond-rep' of github.com:sfilippone/psblas3 into nond-rep 2 years ago
Salvatore Filippone 6c53b6ec79 Fix typo in interface for psb_abgdxyz 2 years ago
sfilippone 83ededd02b Implementatino of abgd_xyz 2 years ago
sfilippone 45f00e6e19 Fixed comments 2 years ago
Salvatore Filippone 14c4ff0f32 Added new methd for two combined axpbys 2 years ago
sfilippone 3aa3c795e9 Refactor assembly and cnv 2 years ago
sfilippone 49e99a3e82 Fix conversion and product to enable overlap with GPU 2 years ago
sfilippone 74cf138a6c Merge branch 'repackage' into non-diag 2 years ago
sfilippone be7571f568 Fix missing directive 2 years ago
sfilippone e9d1238b43 Add detailed measurements. 2 years ago
sfilippone a6016f00fa Bump PSBLAS version to 3.9 3 years ago
sfilippone 6c9ca58282 Silly bug in coo insert 3 years ago
sfilippone d3b2b7816d Fix coo insert OpenMP. Fix Make.inc.in 3 years ago
sfilippone ae7fad95d4 Merge branch 'development' into non-diag 3 years ago
sfilippone a6ec655a97 Prepare merge 3 years ago
sfilippone a2788bdf0b New version with ND product 3 years ago
sfilippone d718ef1e6d Always allocate szs in psb_gather 3 years ago
sfilippone baf18cebd7 Further fix for gather. 3 years ago
sfilippone 5caee551e5 Fixed IN_PLACE option for collectives. 3 years ago
sfilippone d82b090289 Fix makefile for psi_acx & friends 3 years ago
sfilippone e31dd52c41 Fixed CRITICAL in hash_mod 3 years ago
sfilippone def0635c53 More OMP directives in cd_inloc 3 years ago
sfilippone 41be1357c3 Set defaults for SPSPMM depending on OpenMP compilation. 3 years ago
Salvatore Filippone 0d8a5d3dc2 New SPSPMM implementation 3 years ago
Salvatore Filippone d0cacda995 Moved various modules related to RB around, into auxil, update Makefile. 3 years ago
Salvatore Filippone 7b45994b70 Setter/getter for SPSPMM algorithm in base_mat_mod 3 years ago
wlthr 2322a9ce61 using end_idx to copy data from threads in gustavson and gustavson_1d 3 years ago
wlthr 0185b79b2a added setter for d_csr_spspmm implementation 3 years ago
wlthr 0fe95c3c76 added use statement 3 years ago
wlthr 979a3da95f merged dev-openmp into omp-walther 3 years ago
wlthr 1af76c067c added parallel double precision spspmm implementations 3 years ago
sfilippone f001ebbad3 Final fix for COO on OMP 3 years ago
sfilippone 26bf4c5d69 Fixed COO csput for OMP/not OMP 3 years ago
sfilippone 3aa748b0e3 Finish dual OMP/notOMP g2lv1_ins 3 years ago
sfilippone 08c1ab0cd1 Fix tril/triu in COO for non-OMP paths. 3 years ago
sfilippone ca82520b88 Reworked CSR TRIL/TRIU for OpenMP 3 years ago
sfilippone 5e691d5bff Some improvements for openmp vector updates 3 years ago
sfilippone bb9f213551 Define and implement OMP version of TRIL/TRIU 3 years ago
sfilippone 2f403e0df7 Rework cp_{from|to}_fmt for better OpenMP performance 3 years ago
sfilippone d378266f33 Fix synatx error 3 years ago
sfilippone a66778f270 Improve coo and merge development 3 years ago
sfilippone 347352fe1e Make spins work in OpenMP from either par or serial 3 years ago
sfilippone db0e4db507 Minimize debug sttements in hash_ins 3 years ago
sfilippone 1941affe7a Exposed error in AMG test when not parallelizing generation loop 3 years ago
sfilippone 494e29dd2e Cosmetic adjustments to COO and BSRCH 3 years ago
sfilippone 739dc78a75 Merge branch 'development' into omp-threadsafe 3 years ago
sfilippone 7e5dc20e03 Define new options for BSRCH, clean interface 3 years ago
sfilippone 40cc78854a Improve implementation of fix_coo using exscan 3 years ago
sfilippone 91d3e66547 Merge branch 'omp-threadsafe' of github.com:sfilippone/psblas3 into omp-threadsafe 3 years ago
sfilippone 74a8217520 Fixed silly bug in EXSCAN and usage in CSR_IMPL 3 years ago
Salvatore Filippone 5bc02fb2e6 Take out redundant statements in SPINS 3 years ago
sfilippone f3efea0a89 Take out IBASE from exscan, makes no sense. 3 years ago
sfilippone 05b684ddbb Updated use of exscan in CSC 3 years ago
sfilippone 9c248a31e2 Refactored EXSCAN and its OpenMP usage. 3 years ago
sfilippone 02dd204351 Implement psi_exscan and use in _from_coo 3 years ago
sfilippone dbd55321f8 Fixed CSR mv and cp _from_coo with OpenMP. 3 years ago