Commit Graph

60 Commits (18ed3d20fff592d8a50d25cef2ae04f0dbdcfff8)

Author SHA1 Message Date
sfilippone 9e682111be Merge branch 'development' into cmake 12 months ago
sfilippone 07fa2323eb Fixes for IPK8 12 months ago
gnumlab 1b667c2c1e remove unused files from building 1 year ago
Luca Pepè Sciarria aea5e36b63 init cuda CMakeLists.txt 1 year ago
sfilippone 24e4b76241 Fix cuda subdir makefile 1 year ago
sfilippone c4ab4be45d Cleanup some cuda compile warnings. 1 year ago
sfilippone 243fe4e78f Split type definitions from psb_config 1 year ago
sfilippone ea6c4181f7 Changed all defines with prefix PSB_ 1 year ago
sfilippone 333f0e0fb7 Fix cuda sources to use psb_config.h 1 year ago
Salvatore Filippone 67876fec73 Use $(AR) in makefiles merged from EXT 1 year ago
sfilippone e2b00032b6 Fix compile warnings for printf and missing return() 1 year ago
sfilippone 2af0d56938 Malloc and trasnfers with CUDA should use (size_t) casts 1 year ago
sfilippone ade79bcc7e Fixes for compilation with CUDA 2 years ago
sfilippone abdf7fc05a Fix constructor name for multivector 2 years ago
sfilippone 3d9fee2dd7 Fix DOT on CUDA vectors. 2 years ago
sfilippone c74be820ea Rework configry for CUDA 2 years ago
sfilippone 4461b44eda Change name abgdxyz into upd_xyz 2 years ago
sfilippone 9f2b8a2623 Cleanup 2 years ago
sfilippone e3a55967a5 Modify CUDA code to compile with 12.4/12.5 2 years ago
sfilippone d71d355b68 Refactor cusparse includes.. 2 years ago
sfilippone 2e3f862e42 Start refactoring cusparse.h 2 years ago
sfilippone ee66db5efd Refactor interface to cusparse in preparation for CSR Adaptive 2 years ago
sfilippone c8cc2275d0 Fix cuda/makefile for make -j 2 years ago
sfilippone d01b8145c6 Fix cuda makefile dependencies 2 years ago
sfilippone e18de650f2 Take out debug print 2 years ago
sfilippone 0760e4d553 Fix C function declarations for compilation with LLVM/clang in CUDA 2 years ago
sfilippone 3a25d7b04a Fixes for LLVM compilation 2 years ago
sfilippone e0a4d362fa Define flag TRACK_CUDA_MALLOC 2 years ago
Salvatore Filippone b5f1442ac8 Merge branch 'nond-rep' into repackage 2 years ago
sfilippone 48455190ec Add GPU version of XYZW 2 years ago
sfilippone a11f328e62 Added CUDA version of XYZW 2 years ago
sfilippone b5d5f97661 Improve cuda%zero() 2 years ago
sfilippone 0e269ed641 typo in Cabgdxyz 2 years ago
Salvatore Filippone d95077ffd6 Fix typo in vectordev_mod 2 years ago
Salvatore Filippone 2d3773df98 CUDA kernels for ABGDXYZ 2 years ago
Salvatore Filippone 2a75d677d0 ABGDXYZ in vectordev_mod 2 years ago
sfilippone 2391f64df6 X_cuda_vect%abgdxyz 2 years ago
sfilippone 93c71c4316 Fix %ZERO() on cuda 2 years ago
sfilippone 0568a83734 Fix ifdef and old code 2 years ago
Salvatore Filippone 35d68aa4e3 Reuse calls to getDeviceProperties done at init time 2 years ago
Salvatore Filippone 1ba8dfc7b7 Switch FOR and IF in AXPBY 2 years ago
Salvatore Filippone f9677bc892 Enabled new CUDA version of ABGDXYZ 2 years ago
Salvatore Filippone 4681767ef8 New implementation for ABGDXYZ in CUDA 2 years ago
Salvatore Filippone 105aa3c570 Intermediate impl of ABGDXYZ 2 years ago
Salvatore Filippone 864872ecac Intermediate implementation of abgdxyz on cuda 2 years ago
Salvatore Filippone a41b209144 Better AXPBY implementation in CUDA. 2 years ago
Salvatore Filippone ebc7c6b3b4 Fix call to base%abgdxyz 2 years ago
Salvatore Filippone 14c4ff0f32 Added new methd for two combined axpbys 2 years ago
sfilippone 6433dc797e Fix CUDA implementation of %set_scal and %zero 2 years ago
sfilippone 097d63147a Fix cuda dir makefile 2 years ago