|
|
5c8d3f7e25
|
Compute mean and variance
|
2024-04-29 20:55:11 +02:00 |
|
|
|
0ab623fa23
|
Implement vector mean calculation
|
2024-04-28 22:04:15 +02:00 |
|
|
|
a32c737785
|
Allocate activation on heap
|
2024-04-22 18:59:16 +02:00 |
|
|
|
bdbb3f978e
|
Fix matmul and max reduce memcheck errors
|
2024-04-21 22:11:02 +02:00 |
|
|
|
9a6152469a
|
Update activation test
|
2024-04-21 14:00:43 +02:00 |
|
|
|
0170afaf3f
|
Improve cuda error handling
|
2024-04-21 12:19:19 +02:00 |
|
|
|
18522c2dea
|
Cleanup and refactor
|
2024-04-11 22:52:41 +02:00 |
|
|
|
4b9d123e94
|
Implement device vector utils
|
2024-04-11 22:22:33 +02:00 |
|
|
|
f4257afd5a
|
Remove cublas dependency
|
2024-03-05 18:41:35 +01:00 |
|
|
|
48ba09b28d
|
Format source code using clang-format
|
2024-02-27 18:52:12 +01:00 |
|
|
|
6e99525ad0
|
Rename hheader files to .cuh
|
2024-02-26 19:53:46 +01:00 |
|
|
|
035f3b053b
|
Rename files to .cu and fix IDX2C usage
|
2024-02-21 20:03:04 +01:00 |
|
|
|
ee1a8cc6e6
|
Set up basic tests with gtest
|
2024-02-17 23:07:26 +01:00 |
|
|
|
22e33a395b
|
Initialize and use cuBLAS properly
|
2024-02-10 17:17:07 +01:00 |
|
|
|
950021389c
|
Start dense layer implementation
|
2024-02-08 19:19:51 +01:00 |
|
|
|
b16ec69469
|
Initial CUDA test
|
2024-02-07 21:14:46 +01:00 |
|