Commit Graph

  • ade79bcc7e Fixes for compilation with CUDA repackage sfilippone 2024-10-09 14:27:17 +0200
  • abdf7fc05a Fix constructor name for multivector sfilippone 2024-10-09 13:32:29 +0200
  • 6972c50542 Updated readme sfilippone 2024-10-09 10:03:37 +0200
  • 744f14d2f5 Merge branch 'oacc_loloum' into repackage sfilippone 2024-10-08 17:43:40 +0200
  • 5903c0b272 Fix DOT in OpenACC oacc_loloum sfilippone 2024-10-08 17:41:42 +0200
  • 4822861b73 Merge branch 'oacc_loloum' of github.com:sfilippone/psblas3 into oacc_loloum sfilippone 2024-10-08 17:37:28 +0200
  • 49469ce021 Various changes into openacc sfilippone 2024-10-08 17:36:44 +0200
  • 3d9fee2dd7 Fix DOT on CUDA vectors. sfilippone 2024-10-08 17:07:10 +0200
  • 949499265e Simplify clean_zeros sfilippone 2024-10-08 11:48:48 +0200
  • c74be820ea Rework configry for CUDA sfilippone 2024-10-08 11:48:15 +0200
  • ee56c6be3c Cosmetic changes to OpenACC vectors sfilippone 2024-10-08 11:47:37 +0200
  • 740609a4d8 Fix present() clauses sfilippone 2024-10-07 12:45:18 +0200
  • 68f20c0e7a Modify init sfilippone 2024-10-07 12:44:45 +0200
  • 9601a837f5 Define --enable-cuda --with-cudadir for CUDA configry sfilippone 2024-10-04 11:18:39 +0200
  • 1c235f9281 Improve clean_zeros sfilippone 2024-10-04 10:28:52 +0200
  • 108d544fc1 Fix clean_zeros to always keep the diagonal sfilippone 2024-10-04 08:37:45 +0200
  • 8ab5cef448 OpenACC environment fixes sfilippone 2024-10-04 08:37:22 +0200
  • 174a8e7aef Merge branch 'oacc_loloum' into repackage sfilippone 2024-09-13 11:05:28 +0200
  • a8dcba2964 Merge branch 'development' into repackage sfilippone 2024-09-12 19:09:01 +0200
  • 736f3cc629 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage sfilippone 2024-09-12 19:08:53 +0200
  • e5504ddddc Fix memory traffic in GTH/SCT sfilippone 2024-09-12 15:08:21 +0200
  • 096bce08c1 Merged changes from V4 OpenACC sfilippone 2024-09-11 12:43:46 +0200
  • bcbe0c89c7 Backporting fixes from version 4 sfilippone 2024-09-10 17:37:35 +0200
  • 6236f3489c Remove obsolete files sfilippone 2024-08-30 16:05:08 +0200
  • f783478df3 Merge updates from V4 sfilippone 2024-08-30 16:03:04 +0200
  • fbb974fb8b Change name sync|free space, unify allocate impl sfilippone 2024-08-30 10:20:58 +0200
  • 479135c62d Merge some changes from V4 sfilippone 2024-08-29 16:36:24 +0200
  • 95c546aadd Fix OpenACC version of ELL vect_mv sfilippone 2024-08-26 08:22:35 +0200
  • fa5e7ff945 Fixes for vector methods and sync() sfilippone 2024-08-20 19:38:34 +0200
  • 9314b2cf53 Fix missing method in oacc_ell sfilippone 2024-08-08 15:08:45 +0200
  • 8220140729 Merged recent changes from development sfilippone 2024-08-08 14:59:09 +0200
  • cf2cc6cab9 Precedence of oacc_vect modules sfilippone 2024-08-08 14:34:28 +0200
  • 3168b7e8f7 Merge branch 'development' into oacc_loloum sfilippone 2024-08-08 14:24:21 +0200
  • 7857015923 Cosmetic changes to vect_mod development sfilippone 2024-08-08 14:23:03 +0200
  • 2709aa9f16 Fix upd_xyz name sfilippone 2024-08-08 11:10:04 +0200
  • 2982aaee27 Implementation in OpenACC for ELL and HLL into templates. Merge from development sfilippone 2024-08-08 11:07:28 +0200
  • ff8513b4c6 Ignore .smod files in git sfilippone 2024-08-07 17:04:29 +0200
  • 03aaa090db New AX_OPENACC macro and supporting flags sfilippone 2024-08-07 17:04:04 +0200
  • 464baceb13 HLL loop nest cannot run with collapse sfilippone 2024-08-07 10:54:19 +0200
  • a28d3b048b Fix configure for OpenACC (added warning message) sfilippone 2024-08-07 10:30:11 +0200
  • 9c244964db Merge branch 'development' into oacc_merge merge development into oacc tloloum 2024-08-06 11:44:12 +0200
  • 55d1067ec2 collapse loop tloloum 2024-08-06 11:42:52 +0200
  • e6fa1d17a2 oacc hll tloloum 2024-07-30 14:25:01 +0200
  • 4461b44eda Change name abgdxyz into upd_xyz sfilippone 2024-07-29 16:59:27 +0200
  • 10ec5eafab ELL oacc impl tloloum 2024-07-26 09:56:32 +0200
  • 162b2fc78f beginning of oacc_ell tloloum 2024-07-22 15:27:33 +0200
  • 08a9744413 moving test tloloum 2024-07-19 13:01:07 +0200
  • b5a8c549dd psb_d_oacc_pde3d draft tloloum 2024-07-19 11:35:11 +0200
  • e8491380e2 Take out obsolete test targets from makefile sfilippone 2024-07-19 09:40:14 +0200
  • 9e18545151 Fix typos sfilippone 2024-07-17 13:04:07 +0200
  • 686bac4224 Account for S/D/C/Z variants sfilippone 2024-07-17 13:03:46 +0200
  • b6fe0f3344 New version for modules and methods sfilippone 2024-07-17 08:52:43 +0200
  • db558cace3 Merge branch 'development' of github.com:sfilippone/psblas3 into development Salvatore Filippone 2024-07-16 13:53:34 +0200
  • ab38a91d10 Fix metis interfacing Salvatore Filippone 2024-07-16 13:53:00 +0200
  • de27c8f616 Merge branch 'repackage' into development sfilippone 2024-07-16 13:23:16 +0200
  • 08a69985c8 Take out unneeded file sfilippone 2024-07-16 13:22:47 +0200
  • 13a402031e Fixed docs. Salvatore Filippone 2024-07-16 13:20:36 +0200
  • 497cd31018 Fix configure sfilippone 2024-07-16 13:14:02 +0200
  • 7006665d82 Introduce submodules, adjust Makefile sfilippone 2024-07-12 16:34:48 +0200
  • 0707cc0a72 Take out reference to elldev_mod sfilippone 2024-07-12 13:01:32 +0200
  • 9e8294066d Fix Makefile sfilippone 2024-07-12 13:01:22 +0200
  • 7c256df451 Merge branch 'oacc_loloum' of https://github.com/sfilippone/psblas3 into oacc_loloum tloloum 2024-07-12 09:28:35 +0200
  • 2b5f09ddf9 all methods implementations for psb_d_oacc_csr_sparse_mat tloloum 2024-07-12 09:26:06 +0200
  • 99a334d93d Merge branch 'repackage' into oacc_loloum sfilippone 2024-07-11 14:13:14 +0200
  • e9285c7aad Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage sfilippone 2024-07-11 13:12:37 +0200
  • 1911fec97b Update docs Salvatore Filippone 2024-07-11 13:12:17 +0200
  • 0aa5c9409b Remove spurious pdf file. sfilippone 2024-07-11 12:12:42 +0200
  • e9147c089e Update docs sfilippone 2024-07-11 12:11:07 +0200
  • 681ea2fff7 Updated docs sfilippone 2024-07-11 12:10:14 +0200
  • c7bbfb8b68 new Makefile, test compile well + data vect tloloum 2024-07-10 10:06:27 +0200
  • a81d1d9b68 Now oacc_vect compiles cleanly sfilippone 2024-07-09 13:45:16 +0200
  • 3bb05fab54 Merge branch 'oacc_loloum' of github.com:sfilippone/psblas3 into oacc_loloum sfilippone 2024-07-09 13:18:34 +0200
  • 1f0e591827 Reworked oacc_vect_mod. oacc_mlt_v_X generates ICE, to be fixed sfilippone 2024-07-09 13:16:36 +0200
  • 6be998ac66 oacc_env_mod tloloum 2024-07-09 11:22:28 +0200
  • e0d7091ecc Makefile in openacc/impl sfilippone 2024-07-09 10:04:32 +0200
  • 93c9df0277 Adjust Makefiles & source sfilippone 2024-07-09 10:03:57 +0200
  • e3a3e39caf Modify configure --enable-openacc --with-fcopenacc=..... sfilippone 2024-07-09 10:03:30 +0200
  • 40e40e69f5 Merge branch 'repackage' into development sfilippone 2024-07-09 09:39:48 +0200
  • 2b8671fba6 oacc : d_vect + i_vect + d_csr Théophane Loloum 2024-07-08 10:14:42 +0200
  • d25a746067 Some bugs fixed psblas-bgmres gabrielequatrana 2024-07-07 23:27:00 +0200
  • b22cba4413 Added distributed MGS QR working on GPU gabrielequatrana 2024-07-06 15:14:40 +0200
  • 3f22276aff Added Givens rotations to compute Y one time gabrielequatrana 2024-06-24 19:05:14 +0200
  • 9f2b8a2623 Cleanup sfilippone 2024-06-24 08:18:07 +0200
  • e3a55967a5 Modify CUDA code to compile with 12.4/12.5 sfilippone 2024-06-23 16:03:10 +0200
  • f7d44a70d6 Multivector Product GPU gabrielequatrana 2024-06-13 18:49:03 +0200
  • 34a2a7ddbc Update dpdegenmm.F90 gabrielequatrana 2024-05-31 13:14:27 +0200
  • 39cfcd3893 Fix allocation in coo_impl sfilippone 2024-05-29 16:54:10 +0200
  • a38867be25 Fix allocation in coo_impl sfilippone 2024-05-29 16:51:58 +0200
  • b9ad357648 Improve temp memory allocation in fix_coo sfilippone 2024-05-29 16:25:49 +0200
  • cfa0a785c5 Fixed multiple bugs gabrielequatrana 2024-05-28 18:38:19 +0200
  • fe87ca52e3 Missing impl files repack-csga Salvatore Filippone 2024-05-27 07:15:00 -0400
  • 173ffec2d3 First working version of CSGA. To be tested and refined. Salvatore Filippone 2024-05-27 07:02:00 -0400
  • 3ba2002e60 First attempt at running CSGA. To be debugged. sfilippone 2024-05-24 17:03:46 +0200
  • c35b3b9ef3 Merge branch 'repackage' into repack-csga sfilippone 2024-05-21 17:57:59 +0200
  • 42293c62b6 Fix usage of sync() Salvatore Filippone 2024-05-21 17:49:48 +0200
  • a177e94ba5 Fix comments, Salvatore Filippone 2024-05-21 17:49:40 +0200
  • a3f839ad62 Push some intermediate mods. sfilippone 2024-05-21 17:44:44 +0200
  • c547de218d Merge branch 'repackage' into repack-csga sfilippone 2024-05-21 13:01:59 +0200
  • 15477c9eb2 Fix csga_mod sfilippone 2024-05-21 13:00:53 +0200
  • d71d355b68 Refactor cusparse includes.. sfilippone 2024-05-21 12:59:44 +0200