Commit Graph

56 Commits (540946bc9311401dbfe448a8f04ee8a2ff78b108)

Author SHA1 Message Date
sfilippone 24e4b76241 Fix cuda subdir makefile 1 year ago
sfilippone c4ab4be45d Cleanup some cuda compile warnings. 1 year ago
sfilippone 243fe4e78f Split type definitions from psb_config 1 year ago
sfilippone ea6c4181f7 Changed all defines with prefix PSB_ 1 year ago
sfilippone 333f0e0fb7 Fix cuda sources to use psb_config.h 1 year ago
Salvatore Filippone 67876fec73 Use $(AR) in makefiles merged from EXT 1 year ago
sfilippone e2b00032b6 Fix compile warnings for printf and missing return() 1 year ago
sfilippone 2af0d56938 Malloc and trasnfers with CUDA should use (size_t) casts 1 year ago
sfilippone ade79bcc7e Fixes for compilation with CUDA 2 years ago
sfilippone abdf7fc05a Fix constructor name for multivector 2 years ago
sfilippone 3d9fee2dd7 Fix DOT on CUDA vectors. 2 years ago
sfilippone c74be820ea Rework configry for CUDA 2 years ago
sfilippone 4461b44eda Change name abgdxyz into upd_xyz 2 years ago
sfilippone 9f2b8a2623 Cleanup 2 years ago
sfilippone e3a55967a5 Modify CUDA code to compile with 12.4/12.5 2 years ago
sfilippone d71d355b68 Refactor cusparse includes.. 2 years ago
sfilippone 2e3f862e42 Start refactoring cusparse.h 2 years ago
sfilippone ee66db5efd Refactor interface to cusparse in preparation for CSR Adaptive 2 years ago
sfilippone c8cc2275d0 Fix cuda/makefile for make -j 2 years ago
sfilippone d01b8145c6 Fix cuda makefile dependencies 2 years ago
sfilippone e18de650f2 Take out debug print 2 years ago
sfilippone 0760e4d553 Fix C function declarations for compilation with LLVM/clang in CUDA 2 years ago
sfilippone 3a25d7b04a Fixes for LLVM compilation 2 years ago
sfilippone e0a4d362fa Define flag TRACK_CUDA_MALLOC 2 years ago
Salvatore Filippone b5f1442ac8 Merge branch 'nond-rep' into repackage 2 years ago
sfilippone 48455190ec Add GPU version of XYZW 2 years ago
sfilippone a11f328e62 Added CUDA version of XYZW 2 years ago
sfilippone b5d5f97661 Improve cuda%zero() 2 years ago
sfilippone 0e269ed641 typo in Cabgdxyz 2 years ago
Salvatore Filippone d95077ffd6 Fix typo in vectordev_mod 2 years ago
Salvatore Filippone 2d3773df98 CUDA kernels for ABGDXYZ 2 years ago
Salvatore Filippone 2a75d677d0 ABGDXYZ in vectordev_mod 2 years ago
sfilippone 2391f64df6 X_cuda_vect%abgdxyz 2 years ago
sfilippone 93c71c4316 Fix %ZERO() on cuda 2 years ago
sfilippone 0568a83734 Fix ifdef and old code 2 years ago
Salvatore Filippone 35d68aa4e3 Reuse calls to getDeviceProperties done at init time 2 years ago
Salvatore Filippone 1ba8dfc7b7 Switch FOR and IF in AXPBY 2 years ago
Salvatore Filippone f9677bc892 Enabled new CUDA version of ABGDXYZ 2 years ago
Salvatore Filippone 4681767ef8 New implementation for ABGDXYZ in CUDA 2 years ago
Salvatore Filippone 105aa3c570 Intermediate impl of ABGDXYZ 2 years ago
Salvatore Filippone 864872ecac Intermediate implementation of abgdxyz on cuda 2 years ago
Salvatore Filippone a41b209144 Better AXPBY implementation in CUDA. 2 years ago
Salvatore Filippone ebc7c6b3b4 Fix call to base%abgdxyz 2 years ago
Salvatore Filippone 14c4ff0f32 Added new methd for two combined axpbys 2 years ago
sfilippone 6433dc797e Fix CUDA implementation of %set_scal and %zero 2 years ago
sfilippone 097d63147a Fix cuda dir makefile 2 years ago
Salvatore Filippone 20a01d4d71 Attempt at fixing CSRG in CUDA 10.2. Not complete yet. 2 years ago
sfilippone 1bc2a884e2 Adjust conditional compilation on CUDA version 2 years ago
Salvatore Filippone 62db7c0449 Fix spsv with CSRG handling of descriptors. 2 years ago
Salvatore Filippone d28ea462d9 Modified CSRG to work with latest versions; cusparse docs are unclear 2 years ago