Commit Graph

21 Commits

Author SHA1 Message Date
58af95eb25 Remove cudaDeviceReset calls from tests 2024-04-21 22:47:12 +02:00
bdbb3f978e Fix matmul and max reduce memcheck errors 2024-04-21 22:11:02 +02:00
18522c2dea Cleanup and refactor 2024-04-11 22:52:41 +02:00
4b9d123e94 Implement device vector utils 2024-04-11 22:22:33 +02:00
710a33bdde Move softmax partial kernels to matmul 2024-04-11 22:01:47 +02:00
bf7c961b9e Add cudaDeviceReset at the end of each test 2024-04-11 19:55:02 +02:00
b49dddf34a Improve softmax numerical stability 2024-04-08 23:25:46 +02:00
e419a93408 Fix softmax sum kernel 2024-04-08 22:09:18 +02:00
7bc329a043 Add more softmax tests 2024-03-22 22:32:08 +01:00
6cf604423a Combine padding and conv kernel 2024-03-18 19:53:40 +01:00
42d646750b Abstract activation and implement softmax 2024-03-17 18:37:15 +01:00
0c22fac64e Add toplevel CUDANet namespace 2024-03-17 16:08:53 +01:00
77004c16be Use shared memory for mat vec mul kernel 2024-03-13 22:13:11 +01:00
d2ab78fbc7 Add Kernels namespace 2024-03-11 21:04:23 +01:00
07f231a30b Switch padding kernel to row major 2024-03-05 21:04:11 +01:00
f4257afd5a Remove cublas dependency 2024-03-05 18:41:35 +01:00
7e4460cc5e Implement padding kernel 2024-03-03 15:24:39 +01:00
019ccc33d9 Start implementing padding kernel 2024-02-29 22:21:48 +01:00
045359cca2 Remove not needed code 2024-02-29 22:21:32 +01:00
e267f08a2f Format code 2024-02-27 21:49:05 +01:00
b1eb8b5806 Add activations test 2024-02-27 20:19:17 +01:00