-
4e71fa971c
Modify clean_zeros and have it only at base%
repackage
sfilippone
2025-01-08 17:18:54 +0100
-
66a9d5e50a
Interface psb_sizeof_X size computation
Salvatore Filippone
2025-01-04 09:44:58 +0100
-
c83137cf78
Constants psb_sizeof_X should be 8 bytes
Salvatore Filippone
2025-01-04 09:40:30 +0100
-
2af0d56938
Malloc and trasnfers with CUDA should use (size_t) casts
sfilippone
2025-01-02 11:49:08 +0100
-
20958f654a
Make CUDAKERN tests lighter
sfilippone
2025-01-02 11:46:51 +0100
-
488f5dfd86
CMAKE should look for util/psb_metis_int.c
cmake
sfilippone
2025-01-01 12:32:12 +0100
-
7669f2ee26
Hot fix
Luca Pepè Sciarria
2024-12-12 10:51:06 +0100
-
ef71a32484
Merge branch 'cmake2' into cmake
Luca Pepè Sciarria
2024-12-12 10:11:45 +0100
-
-
61976812be
Add cmake building for base, cbind, ext, linsolve, prec, util
Luca Pepè Sciarria
2024-12-12 10:08:47 +0100
-
-
-
5b81cbac12
Add CMake building for base, prec, ext, util, krylov, cbind
Luca Pepè Sciarria
2024-12-09 15:09:25 +0100
-
f904764d0b
Merge branch 'openacc' of github.com:sfilippone/psblas3 into openacc
openacc
sfilippone
2024-12-03 10:54:55 +0100
-
-
288805671f
New test in openacc
sfilippone
2024-12-03 10:54:33 +0100
-
63fb828528
Fix Makefile for I2
Salvatore Filippone
2024-12-03 10:36:52 +0100
-
-
-
61456cd42a
Fix dependencies for I2
Salvatore Filippone
2024-12-03 10:34:17 +0100
-
-
-
fa6b8d5e33
Enable I2 send/receive/collectives
sfilippone
2024-11-25 12:35:39 +0100
-
7e73900703
Fix clean_zeros' description
sfilippone
2024-11-19 12:16:46 +0100
-
999830f225
Update description of clean_zeros
sfilippone
2024-11-19 12:11:26 +0100
-
283e849c94
Cleanup use of RICHARDSON
sfilippone
2024-11-18 16:24:01 +0100
-
5936c05eb4
Fix UG
sfilippone
2024-11-18 15:58:53 +0100
-
931000cc68
Update README.md
Fabio Durastante
2024-11-18 10:15:20 +0100
-
f825bf37a1
Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage
sfilippone
2024-11-17 15:12:15 +0100
-
-
8907a7600a
Fix krylov-> linsolve in test/openacc
sfilippone
2024-11-17 15:10:39 +0100
-
7187575915
Doc fixes
Salvatore Filippone
2024-11-16 16:20:55 +0100
-
9c4f4c4d15
Doc fixes
Salvatore Filippone
2024-11-16 12:03:28 +0100
-
eefc67bbdd
Docs and krylov->linsolve changes
Salvatore Filippone
2024-11-16 10:50:31 +0100
-
-
ef1cc4b321
Update CUDA description in README
sfilippone
2024-11-15 17:48:11 +0100
-
6743913690
Update README Fortran 2003 to 2008
sfilippone
2024-11-15 17:28:48 +0100
-
f35a9ec13c
Update README.md
Fabio Durastante
2024-11-15 16:42:43 +0100
-
961a315936
Update README.md
Fabio Durastante
2024-11-15 16:41:05 +0100
-
d7cb13a283
Final fix for log_conv in rgmres
sfilippone
2024-11-13 18:12:34 +0100
-
e2605d5e48
Final fix for log_conv in rgmres
sfilippone
2024-11-13 17:47:49 +0100
-
0ed2c105ac
Fix RGMRES log_conv
sfilippone
2024-11-13 16:16:07 +0100
-
29f72195ef
Fix GMRES convergence logs
sfilippone
2024-11-13 14:08:13 +0100
-
ecc59b8a4b
Fix test makefile for linsolve
sfilippone
2024-11-13 13:21:37 +0100
-
a02440afff
Updatex docs for linsolve
repack-newsolve
sfilippone
2024-11-11 17:48:49 +0100
-
4f4006cf6b
Configure fixes
sfilippone
2024-11-10 17:30:51 +0100
-
e9aa9a5237
Restructuring linsolve.
sfilippone
2024-11-10 10:59:28 +0100
-
98d5db7377
Krylov into linsolve, step 4
sfilippone
2024-11-10 10:47:05 +0100
-
14dce3eefd
Krylov into linsolve, step 3.
sfilippone
2024-11-10 10:27:31 +0100
-
ea8c526bf2
Rename krylov into linsolve step 2.
sfilippone
2024-11-10 10:12:31 +0100
-
ceac2faad0
Rename krylov into linsolve where needed, step 1.
sfilippone
2024-11-10 10:08:50 +0100
-
029903dbad
New Richardson method.
sfilippone
2024-11-08 18:22:43 +0100
-
f1d21b1c95
Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage
Salvatore Filippone
2024-11-02 12:29:08 +0100
-
-
f10c6c1822
Fix GEPRT
Salvatore Filippone
2024-11-02 12:28:24 +0100
-
5430ba0e22
Fix multivect constructor in CUDA
Salvatore Filippone
2024-11-02 12:28:08 +0100
-
ade79bcc7e
Fixes for compilation with CUDA
sfilippone
2024-10-09 14:27:17 +0200
-
abdf7fc05a
Fix constructor name for multivector
sfilippone
2024-10-09 13:32:29 +0200
-
-
6972c50542
Updated readme
sfilippone
2024-10-09 10:03:37 +0200
-
744f14d2f5
Merge branch 'oacc_loloum' into repackage
sfilippone
2024-10-08 17:43:40 +0200
-
-
5903c0b272
Fix DOT in OpenACC
oacc_loloum
sfilippone
2024-10-08 17:41:42 +0200
-
4822861b73
Merge branch 'oacc_loloum' of github.com:sfilippone/psblas3 into oacc_loloum
sfilippone
2024-10-08 17:37:28 +0200
-
-
49469ce021
Various changes into openacc
sfilippone
2024-10-08 17:36:44 +0200
-
3d9fee2dd7
Fix DOT on CUDA vectors.
sfilippone
2024-10-08 17:07:10 +0200
-
949499265e
Simplify clean_zeros
sfilippone
2024-10-08 11:48:48 +0200
-
c74be820ea
Rework configry for CUDA
sfilippone
2024-10-08 11:48:15 +0200
-
ee56c6be3c
Cosmetic changes to OpenACC vectors
sfilippone
2024-10-08 11:47:37 +0200
-
740609a4d8
Fix present() clauses
sfilippone
2024-10-07 12:45:18 +0200
-
68f20c0e7a
Modify init
sfilippone
2024-10-07 12:44:45 +0200
-
-
9601a837f5
Define --enable-cuda --with-cudadir for CUDA configry
sfilippone
2024-10-04 11:18:39 +0200
-
1c235f9281
Improve clean_zeros
sfilippone
2024-10-04 10:28:52 +0200
-
108d544fc1
Fix clean_zeros to always keep the diagonal
sfilippone
2024-10-04 08:37:45 +0200
-
8ab5cef448
OpenACC environment fixes
sfilippone
2024-10-04 08:37:22 +0200
-
174a8e7aef
Merge branch 'oacc_loloum' into repackage
sfilippone
2024-09-13 11:05:28 +0200
-
-
a8dcba2964
Merge branch 'development' into repackage
sfilippone
2024-09-12 19:09:01 +0200
-
-
-
-
736f3cc629
Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage
sfilippone
2024-09-12 19:08:53 +0200
-
-
e5504ddddc
Fix memory traffic in GTH/SCT
sfilippone
2024-09-12 15:08:21 +0200
-
096bce08c1
Merged changes from V4 OpenACC
sfilippone
2024-09-11 12:43:46 +0200
-
bcbe0c89c7
Backporting fixes from version 4
sfilippone
2024-09-10 17:37:35 +0200
-
6236f3489c
Remove obsolete files
sfilippone
2024-08-30 16:05:08 +0200
-
f783478df3
Merge updates from V4
sfilippone
2024-08-30 16:03:04 +0200
-
fbb974fb8b
Change name sync|free space, unify allocate impl
sfilippone
2024-08-30 10:20:58 +0200
-
479135c62d
Merge some changes from V4
sfilippone
2024-08-29 16:36:24 +0200
-
95c546aadd
Fix OpenACC version of ELL vect_mv
sfilippone
2024-08-26 08:22:35 +0200
-
fa5e7ff945
Fixes for vector methods and sync()
sfilippone
2024-08-20 19:38:34 +0200
-
9314b2cf53
Fix missing method in oacc_ell
sfilippone
2024-08-08 15:08:45 +0200
-
8220140729
Merged recent changes from development
sfilippone
2024-08-08 14:59:09 +0200
-
cf2cc6cab9
Precedence of oacc_vect modules
sfilippone
2024-08-08 14:34:28 +0200
-
3168b7e8f7
Merge branch 'development' into oacc_loloum
sfilippone
2024-08-08 14:24:21 +0200
-
-
-
-
7857015923
Cosmetic changes to vect_mod
development
sfilippone
2024-08-08 14:23:03 +0200
-
2709aa9f16
Fix upd_xyz name
sfilippone
2024-08-08 11:10:04 +0200
-
2982aaee27
Implementation in OpenACC for ELL and HLL into templates. Merge from development
sfilippone
2024-08-08 11:07:28 +0200
-
ff8513b4c6
Ignore .smod files in git
sfilippone
2024-08-07 17:04:29 +0200
-
03aaa090db
New AX_OPENACC macro and supporting flags
sfilippone
2024-08-07 17:04:04 +0200
-
464baceb13
HLL loop nest cannot run with collapse
sfilippone
2024-08-07 10:54:19 +0200
-
a28d3b048b
Fix configure for OpenACC (added warning message)
sfilippone
2024-08-07 10:30:11 +0200
-
9c244964db
Merge branch 'development' into oacc_merge merge development into oacc
tloloum
2024-08-06 11:44:12 +0200
-
-
-
-
55d1067ec2
collapse loop
tloloum
2024-08-06 11:42:52 +0200
-
e6fa1d17a2
oacc hll
tloloum
2024-07-30 14:25:01 +0200
-
4461b44eda
Change name abgdxyz into upd_xyz
sfilippone
2024-07-29 16:59:27 +0200
-
10ec5eafab
ELL oacc impl
tloloum
2024-07-26 09:56:32 +0200
-
162b2fc78f
beginning of oacc_ell
tloloum
2024-07-22 15:27:33 +0200
-
08a9744413
moving test
tloloum
2024-07-19 13:01:07 +0200
-
b5a8c549dd
psb_d_oacc_pde3d draft
tloloum
2024-07-19 11:35:11 +0200
-
e8491380e2
Take out obsolete test targets from makefile
sfilippone
2024-07-19 09:40:14 +0200
-
9e18545151
Fix typos
sfilippone
2024-07-17 13:04:07 +0200
-
686bac4224
Account for S/D/C/Z variants
sfilippone
2024-07-17 13:03:46 +0200
-
b6fe0f3344
New version for modules and methods
sfilippone
2024-07-17 08:52:43 +0200
-
db558cace3
Merge branch 'development' of github.com:sfilippone/psblas3 into development
Salvatore Filippone
2024-07-16 13:53:34 +0200
-
-
ab38a91d10
Fix metis interfacing
Salvatore Filippone
2024-07-16 13:53:00 +0200
-
de27c8f616
Merge branch 'repackage' into development
sfilippone
2024-07-16 13:23:16 +0200
-
-
-