Commit Graph

  • 4e71fa971c Modify clean_zeros and have it only at base% repackage sfilippone 2025-01-08 17:18:54 +0100
  • 66a9d5e50a Interface psb_sizeof_X size computation Salvatore Filippone 2025-01-04 09:44:58 +0100
  • c83137cf78 Constants psb_sizeof_X should be 8 bytes Salvatore Filippone 2025-01-04 09:40:30 +0100
  • 2af0d56938 Malloc and trasnfers with CUDA should use (size_t) casts sfilippone 2025-01-02 11:49:08 +0100
  • 20958f654a Make CUDAKERN tests lighter sfilippone 2025-01-02 11:46:51 +0100
  • 488f5dfd86 CMAKE should look for util/psb_metis_int.c cmake sfilippone 2025-01-01 12:32:12 +0100
  • 7669f2ee26 Hot fix Luca Pepè Sciarria 2024-12-12 10:51:06 +0100
  • ef71a32484 Merge branch 'cmake2' into cmake Luca Pepè Sciarria 2024-12-12 10:11:45 +0100
  • 61976812be Add cmake building for base, cbind, ext, linsolve, prec, util Luca Pepè Sciarria 2024-12-12 10:08:47 +0100
  • 5b81cbac12 Add CMake building for base, prec, ext, util, krylov, cbind Luca Pepè Sciarria 2024-12-09 15:09:25 +0100
  • f904764d0b Merge branch 'openacc' of github.com:sfilippone/psblas3 into openacc openacc sfilippone 2024-12-03 10:54:55 +0100
  • 288805671f New test in openacc sfilippone 2024-12-03 10:54:33 +0100
  • 63fb828528 Fix Makefile for I2 Salvatore Filippone 2024-12-03 10:36:52 +0100
  • 61456cd42a Fix dependencies for I2 Salvatore Filippone 2024-12-03 10:34:17 +0100
  • fa6b8d5e33 Enable I2 send/receive/collectives sfilippone 2024-11-25 12:35:39 +0100
  • 7e73900703 Fix clean_zeros' description sfilippone 2024-11-19 12:16:46 +0100
  • 999830f225 Update description of clean_zeros sfilippone 2024-11-19 12:11:26 +0100
  • 283e849c94 Cleanup use of RICHARDSON sfilippone 2024-11-18 16:24:01 +0100
  • 5936c05eb4 Fix UG sfilippone 2024-11-18 15:58:53 +0100
  • 931000cc68
    Update README.md Fabio Durastante 2024-11-18 10:15:20 +0100
  • f825bf37a1 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage sfilippone 2024-11-17 15:12:15 +0100
  • 8907a7600a Fix krylov-> linsolve in test/openacc sfilippone 2024-11-17 15:10:39 +0100
  • 7187575915 Doc fixes Salvatore Filippone 2024-11-16 16:20:55 +0100
  • 9c4f4c4d15 Doc fixes Salvatore Filippone 2024-11-16 12:03:28 +0100
  • eefc67bbdd Docs and krylov->linsolve changes Salvatore Filippone 2024-11-16 10:50:31 +0100
  • ef1cc4b321 Update CUDA description in README sfilippone 2024-11-15 17:48:11 +0100
  • 6743913690 Update README Fortran 2003 to 2008 sfilippone 2024-11-15 17:28:48 +0100
  • f35a9ec13c
    Update README.md Fabio Durastante 2024-11-15 16:42:43 +0100
  • 961a315936
    Update README.md Fabio Durastante 2024-11-15 16:41:05 +0100
  • d7cb13a283 Final fix for log_conv in rgmres sfilippone 2024-11-13 18:12:34 +0100
  • e2605d5e48 Final fix for log_conv in rgmres sfilippone 2024-11-13 17:47:49 +0100
  • 0ed2c105ac Fix RGMRES log_conv sfilippone 2024-11-13 16:16:07 +0100
  • 29f72195ef Fix GMRES convergence logs sfilippone 2024-11-13 14:08:13 +0100
  • ecc59b8a4b Fix test makefile for linsolve sfilippone 2024-11-13 13:21:37 +0100
  • a02440afff Updatex docs for linsolve repack-newsolve sfilippone 2024-11-11 17:48:49 +0100
  • 4f4006cf6b Configure fixes sfilippone 2024-11-10 17:30:51 +0100
  • e9aa9a5237 Restructuring linsolve. sfilippone 2024-11-10 10:59:28 +0100
  • 98d5db7377 Krylov into linsolve, step 4 sfilippone 2024-11-10 10:47:05 +0100
  • 14dce3eefd Krylov into linsolve, step 3. sfilippone 2024-11-10 10:27:31 +0100
  • ea8c526bf2 Rename krylov into linsolve step 2. sfilippone 2024-11-10 10:12:31 +0100
  • ceac2faad0 Rename krylov into linsolve where needed, step 1. sfilippone 2024-11-10 10:08:50 +0100
  • 029903dbad New Richardson method. sfilippone 2024-11-08 18:22:43 +0100
  • f1d21b1c95 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage Salvatore Filippone 2024-11-02 12:29:08 +0100
  • f10c6c1822 Fix GEPRT Salvatore Filippone 2024-11-02 12:28:24 +0100
  • 5430ba0e22 Fix multivect constructor in CUDA Salvatore Filippone 2024-11-02 12:28:08 +0100
  • ade79bcc7e Fixes for compilation with CUDA sfilippone 2024-10-09 14:27:17 +0200
  • abdf7fc05a Fix constructor name for multivector sfilippone 2024-10-09 13:32:29 +0200
  • 6972c50542 Updated readme sfilippone 2024-10-09 10:03:37 +0200
  • 744f14d2f5 Merge branch 'oacc_loloum' into repackage sfilippone 2024-10-08 17:43:40 +0200
  • 5903c0b272 Fix DOT in OpenACC oacc_loloum sfilippone 2024-10-08 17:41:42 +0200
  • 4822861b73 Merge branch 'oacc_loloum' of github.com:sfilippone/psblas3 into oacc_loloum sfilippone 2024-10-08 17:37:28 +0200
  • 49469ce021 Various changes into openacc sfilippone 2024-10-08 17:36:44 +0200
  • 3d9fee2dd7 Fix DOT on CUDA vectors. sfilippone 2024-10-08 17:07:10 +0200
  • 949499265e Simplify clean_zeros sfilippone 2024-10-08 11:48:48 +0200
  • c74be820ea Rework configry for CUDA sfilippone 2024-10-08 11:48:15 +0200
  • ee56c6be3c Cosmetic changes to OpenACC vectors sfilippone 2024-10-08 11:47:37 +0200
  • 740609a4d8 Fix present() clauses sfilippone 2024-10-07 12:45:18 +0200
  • 68f20c0e7a Modify init sfilippone 2024-10-07 12:44:45 +0200
  • 9601a837f5 Define --enable-cuda --with-cudadir for CUDA configry sfilippone 2024-10-04 11:18:39 +0200
  • 1c235f9281 Improve clean_zeros sfilippone 2024-10-04 10:28:52 +0200
  • 108d544fc1 Fix clean_zeros to always keep the diagonal sfilippone 2024-10-04 08:37:45 +0200
  • 8ab5cef448 OpenACC environment fixes sfilippone 2024-10-04 08:37:22 +0200
  • 174a8e7aef Merge branch 'oacc_loloum' into repackage sfilippone 2024-09-13 11:05:28 +0200
  • a8dcba2964 Merge branch 'development' into repackage sfilippone 2024-09-12 19:09:01 +0200
  • 736f3cc629 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage sfilippone 2024-09-12 19:08:53 +0200
  • e5504ddddc Fix memory traffic in GTH/SCT sfilippone 2024-09-12 15:08:21 +0200
  • 096bce08c1 Merged changes from V4 OpenACC sfilippone 2024-09-11 12:43:46 +0200
  • bcbe0c89c7 Backporting fixes from version 4 sfilippone 2024-09-10 17:37:35 +0200
  • 6236f3489c Remove obsolete files sfilippone 2024-08-30 16:05:08 +0200
  • f783478df3 Merge updates from V4 sfilippone 2024-08-30 16:03:04 +0200
  • fbb974fb8b Change name sync|free space, unify allocate impl sfilippone 2024-08-30 10:20:58 +0200
  • 479135c62d Merge some changes from V4 sfilippone 2024-08-29 16:36:24 +0200
  • 95c546aadd Fix OpenACC version of ELL vect_mv sfilippone 2024-08-26 08:22:35 +0200
  • fa5e7ff945 Fixes for vector methods and sync() sfilippone 2024-08-20 19:38:34 +0200
  • 9314b2cf53 Fix missing method in oacc_ell sfilippone 2024-08-08 15:08:45 +0200
  • 8220140729 Merged recent changes from development sfilippone 2024-08-08 14:59:09 +0200
  • cf2cc6cab9 Precedence of oacc_vect modules sfilippone 2024-08-08 14:34:28 +0200
  • 3168b7e8f7 Merge branch 'development' into oacc_loloum sfilippone 2024-08-08 14:24:21 +0200
  • 7857015923 Cosmetic changes to vect_mod development sfilippone 2024-08-08 14:23:03 +0200
  • 2709aa9f16 Fix upd_xyz name sfilippone 2024-08-08 11:10:04 +0200
  • 2982aaee27 Implementation in OpenACC for ELL and HLL into templates. Merge from development sfilippone 2024-08-08 11:07:28 +0200
  • ff8513b4c6 Ignore .smod files in git sfilippone 2024-08-07 17:04:29 +0200
  • 03aaa090db New AX_OPENACC macro and supporting flags sfilippone 2024-08-07 17:04:04 +0200
  • 464baceb13 HLL loop nest cannot run with collapse sfilippone 2024-08-07 10:54:19 +0200
  • a28d3b048b Fix configure for OpenACC (added warning message) sfilippone 2024-08-07 10:30:11 +0200
  • 9c244964db Merge branch 'development' into oacc_merge merge development into oacc tloloum 2024-08-06 11:44:12 +0200
  • 55d1067ec2 collapse loop tloloum 2024-08-06 11:42:52 +0200
  • e6fa1d17a2 oacc hll tloloum 2024-07-30 14:25:01 +0200
  • 4461b44eda Change name abgdxyz into upd_xyz sfilippone 2024-07-29 16:59:27 +0200
  • 10ec5eafab ELL oacc impl tloloum 2024-07-26 09:56:32 +0200
  • 162b2fc78f beginning of oacc_ell tloloum 2024-07-22 15:27:33 +0200
  • 08a9744413 moving test tloloum 2024-07-19 13:01:07 +0200
  • b5a8c549dd psb_d_oacc_pde3d draft tloloum 2024-07-19 11:35:11 +0200
  • e8491380e2 Take out obsolete test targets from makefile sfilippone 2024-07-19 09:40:14 +0200
  • 9e18545151 Fix typos sfilippone 2024-07-17 13:04:07 +0200
  • 686bac4224 Account for S/D/C/Z variants sfilippone 2024-07-17 13:03:46 +0200
  • b6fe0f3344 New version for modules and methods sfilippone 2024-07-17 08:52:43 +0200
  • db558cace3 Merge branch 'development' of github.com:sfilippone/psblas3 into development Salvatore Filippone 2024-07-16 13:53:34 +0200
  • ab38a91d10 Fix metis interfacing Salvatore Filippone 2024-07-16 13:53:00 +0200
  • de27c8f616 Merge branch 'repackage' into development sfilippone 2024-07-16 13:23:16 +0200