Commit Graph

  • 14644548a3 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage repackage sfilippone 2025-02-15 12:37:56 +0100
  • 12440e577f Fix util Makefile to delete metis_int.h and all .mod files sfilippone 2025-02-15 12:36:12 +0100
  • 5f7e40b0f5 Merge branch 'cmake' into repackage sfilippone 2025-02-13 17:17:50 +0100
  • 654733b004 Use FLIBS in configry for specific Fortran libs sfilippone 2025-02-13 17:15:57 +0100
  • 12f8c31228 Add cmake building to test examples cmake Luca Pepè Sciarria 2025-02-10 16:13:27 +0100
  • f362b5878e Update files to be compiled: added .c files Luca Pepè Sciarria 2025-02-10 16:12:05 +0100
  • 094f229fb8 Changes to configry to accommodate LLVM sfilippone 2025-02-10 15:50:39 +0100
  • c9386c58d8 Remove default(none) from some OpenMP directives. sfilippone 2025-02-10 15:49:49 +0100
  • dd31854042 Add --with-extra-opt to configry sfilippone 2025-02-09 16:22:54 +0100
  • e2b00032b6 Fix compile warnings for printf and missing return() sfilippone 2025-02-09 16:21:50 +0100
  • 6772315d4d Fix use of psi_Xaxpby sfilippone 2025-02-09 16:20:39 +0100
  • a527493258 Fix mat_get_nnz to return correct value sfilippone 2025-02-09 14:06:32 +0100
  • 269477435b Fix usage of serial AXPBY sfilippone 2025-02-09 14:05:50 +0100
  • 30810f804e Additional OpenMP tweaks, to be redesigned in the future. sfilippone 2025-02-09 14:04:34 +0100
  • eb46c1a558 Hot fix: now the CMakeLists.txt uses the right prefix path Luca Pepè Sciarria 2025-01-30 14:51:28 +0100
  • 26fb1ae755 Add CMake export throught <name>Config.cmake and <name>TargetsCmake.cmake files Luca Pepè Sciarria 2025-01-27 16:37:23 +0100
  • 7278dde01a Add Cmake compilation for hello and pingpong through cmake building Luca Pepè Sciarria 2025-01-27 16:35:28 +0100
  • 772533b6f1 Avoid excessive OpenMP overhead in SPINS. Will have to be redesigned. sfilippone 2025-01-23 15:38:05 +0100
  • 928ae981f9 add metis building. It needs a check whether all the compile flags are present gnumlab 2025-01-13 13:30:45 +0100
  • 5e07d7aad5 Merge branch 'cmake' of github.com:sfilippone/psblas3 into cmake Luca Pepè Sciarria 2025-01-13 10:13:15 +0100
  • bf0149d4eb update CMakeLists.txt to cover Metis Luca Pepè Sciarria 2025-01-13 10:10:53 +0100
  • 4e71fa971c Modify clean_zeros and have it only at base% sfilippone 2025-01-08 17:18:54 +0100
  • 66a9d5e50a Interface psb_sizeof_X size computation Salvatore Filippone 2025-01-04 09:44:58 +0100
  • c83137cf78 Constants psb_sizeof_X should be 8 bytes Salvatore Filippone 2025-01-04 09:40:30 +0100
  • 2af0d56938 Malloc and trasnfers with CUDA should use (size_t) casts sfilippone 2025-01-02 11:49:08 +0100
  • 20958f654a Make CUDAKERN tests lighter sfilippone 2025-01-02 11:46:51 +0100
  • 488f5dfd86 CMAKE should look for util/psb_metis_int.c sfilippone 2025-01-01 12:32:12 +0100
  • 7669f2ee26 Hot fix Luca Pepè Sciarria 2024-12-12 10:51:06 +0100
  • ef71a32484 Merge branch 'cmake2' into cmake Luca Pepè Sciarria 2024-12-12 10:11:45 +0100
  • 61976812be Add cmake building for base, cbind, ext, linsolve, prec, util Luca Pepè Sciarria 2024-12-12 10:08:47 +0100
  • 5b81cbac12 Add CMake building for base, prec, ext, util, krylov, cbind Luca Pepè Sciarria 2024-12-09 15:09:25 +0100
  • f904764d0b Merge branch 'openacc' of github.com:sfilippone/psblas3 into openacc openacc sfilippone 2024-12-03 10:54:55 +0100
  • 288805671f New test in openacc sfilippone 2024-12-03 10:54:33 +0100
  • 63fb828528 Fix Makefile for I2 Salvatore Filippone 2024-12-03 10:36:52 +0100
  • 61456cd42a Fix dependencies for I2 Salvatore Filippone 2024-12-03 10:34:17 +0100
  • fa6b8d5e33 Enable I2 send/receive/collectives sfilippone 2024-11-25 12:35:39 +0100
  • 7e73900703 Fix clean_zeros' description sfilippone 2024-11-19 12:16:46 +0100
  • 999830f225 Update description of clean_zeros sfilippone 2024-11-19 12:11:26 +0100
  • 283e849c94 Cleanup use of RICHARDSON sfilippone 2024-11-18 16:24:01 +0100
  • 5936c05eb4 Fix UG sfilippone 2024-11-18 15:58:53 +0100
  • 931000cc68
    Update README.md Fabio Durastante 2024-11-18 10:15:20 +0100
  • f825bf37a1 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage sfilippone 2024-11-17 15:12:15 +0100
  • 8907a7600a Fix krylov-> linsolve in test/openacc sfilippone 2024-11-17 15:10:39 +0100
  • 7187575915 Doc fixes Salvatore Filippone 2024-11-16 16:20:55 +0100
  • 9c4f4c4d15 Doc fixes Salvatore Filippone 2024-11-16 12:03:28 +0100
  • eefc67bbdd Docs and krylov->linsolve changes Salvatore Filippone 2024-11-16 10:50:31 +0100
  • ef1cc4b321 Update CUDA description in README sfilippone 2024-11-15 17:48:11 +0100
  • 6743913690 Update README Fortran 2003 to 2008 sfilippone 2024-11-15 17:28:48 +0100
  • f35a9ec13c
    Update README.md Fabio Durastante 2024-11-15 16:42:43 +0100
  • 961a315936
    Update README.md Fabio Durastante 2024-11-15 16:41:05 +0100
  • d7cb13a283 Final fix for log_conv in rgmres sfilippone 2024-11-13 18:12:34 +0100
  • e2605d5e48 Final fix for log_conv in rgmres sfilippone 2024-11-13 17:47:49 +0100
  • 0ed2c105ac Fix RGMRES log_conv sfilippone 2024-11-13 16:16:07 +0100
  • 29f72195ef Fix GMRES convergence logs sfilippone 2024-11-13 14:08:13 +0100
  • ecc59b8a4b Fix test makefile for linsolve sfilippone 2024-11-13 13:21:37 +0100
  • a02440afff Updatex docs for linsolve repack-newsolve sfilippone 2024-11-11 17:48:49 +0100
  • 4f4006cf6b Configure fixes sfilippone 2024-11-10 17:30:51 +0100
  • e9aa9a5237 Restructuring linsolve. sfilippone 2024-11-10 10:59:28 +0100
  • 98d5db7377 Krylov into linsolve, step 4 sfilippone 2024-11-10 10:47:05 +0100
  • 14dce3eefd Krylov into linsolve, step 3. sfilippone 2024-11-10 10:27:31 +0100
  • ea8c526bf2 Rename krylov into linsolve step 2. sfilippone 2024-11-10 10:12:31 +0100
  • ceac2faad0 Rename krylov into linsolve where needed, step 1. sfilippone 2024-11-10 10:08:50 +0100
  • 029903dbad New Richardson method. sfilippone 2024-11-08 18:22:43 +0100
  • f1d21b1c95 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage Salvatore Filippone 2024-11-02 12:29:08 +0100
  • f10c6c1822 Fix GEPRT Salvatore Filippone 2024-11-02 12:28:24 +0100
  • 5430ba0e22 Fix multivect constructor in CUDA Salvatore Filippone 2024-11-02 12:28:08 +0100
  • ade79bcc7e Fixes for compilation with CUDA sfilippone 2024-10-09 14:27:17 +0200
  • abdf7fc05a Fix constructor name for multivector sfilippone 2024-10-09 13:32:29 +0200
  • 6972c50542 Updated readme sfilippone 2024-10-09 10:03:37 +0200
  • 744f14d2f5 Merge branch 'oacc_loloum' into repackage sfilippone 2024-10-08 17:43:40 +0200
  • 5903c0b272 Fix DOT in OpenACC oacc_loloum sfilippone 2024-10-08 17:41:42 +0200
  • 4822861b73 Merge branch 'oacc_loloum' of github.com:sfilippone/psblas3 into oacc_loloum sfilippone 2024-10-08 17:37:28 +0200
  • 49469ce021 Various changes into openacc sfilippone 2024-10-08 17:36:44 +0200
  • 3d9fee2dd7 Fix DOT on CUDA vectors. sfilippone 2024-10-08 17:07:10 +0200
  • 949499265e Simplify clean_zeros sfilippone 2024-10-08 11:48:48 +0200
  • c74be820ea Rework configry for CUDA sfilippone 2024-10-08 11:48:15 +0200
  • ee56c6be3c Cosmetic changes to OpenACC vectors sfilippone 2024-10-08 11:47:37 +0200
  • 740609a4d8 Fix present() clauses sfilippone 2024-10-07 12:45:18 +0200
  • 68f20c0e7a Modify init sfilippone 2024-10-07 12:44:45 +0200
  • 9601a837f5 Define --enable-cuda --with-cudadir for CUDA configry sfilippone 2024-10-04 11:18:39 +0200
  • 1c235f9281 Improve clean_zeros sfilippone 2024-10-04 10:28:52 +0200
  • 108d544fc1 Fix clean_zeros to always keep the diagonal sfilippone 2024-10-04 08:37:45 +0200
  • 8ab5cef448 OpenACC environment fixes sfilippone 2024-10-04 08:37:22 +0200
  • 174a8e7aef Merge branch 'oacc_loloum' into repackage sfilippone 2024-09-13 11:05:28 +0200
  • a8dcba2964 Merge branch 'development' into repackage sfilippone 2024-09-12 19:09:01 +0200
  • 736f3cc629 Merge branch 'repackage' of github.com:sfilippone/psblas3 into repackage sfilippone 2024-09-12 19:08:53 +0200
  • e5504ddddc Fix memory traffic in GTH/SCT sfilippone 2024-09-12 15:08:21 +0200
  • 096bce08c1 Merged changes from V4 OpenACC sfilippone 2024-09-11 12:43:46 +0200
  • bcbe0c89c7 Backporting fixes from version 4 sfilippone 2024-09-10 17:37:35 +0200
  • 6236f3489c Remove obsolete files sfilippone 2024-08-30 16:05:08 +0200
  • f783478df3 Merge updates from V4 sfilippone 2024-08-30 16:03:04 +0200
  • fbb974fb8b Change name sync|free space, unify allocate impl sfilippone 2024-08-30 10:20:58 +0200
  • 479135c62d Merge some changes from V4 sfilippone 2024-08-29 16:36:24 +0200
  • 95c546aadd Fix OpenACC version of ELL vect_mv sfilippone 2024-08-26 08:22:35 +0200
  • fa5e7ff945 Fixes for vector methods and sync() sfilippone 2024-08-20 19:38:34 +0200
  • 9314b2cf53 Fix missing method in oacc_ell sfilippone 2024-08-08 15:08:45 +0200
  • 8220140729 Merged recent changes from development sfilippone 2024-08-08 14:59:09 +0200
  • cf2cc6cab9 Precedence of oacc_vect modules sfilippone 2024-08-08 14:34:28 +0200
  • 3168b7e8f7 Merge branch 'development' into oacc_loloum sfilippone 2024-08-08 14:24:21 +0200
  • 7857015923 Cosmetic changes to vect_mod development sfilippone 2024-08-08 14:23:03 +0200