|
|
f17debc244
|
Implement getOutputSize and getInputSize for seq layers
|
2024-04-22 20:31:58 +02:00 |
|
|
|
58af95eb25
|
Remove cudaDeviceReset calls from tests
|
2024-04-21 22:47:12 +02:00 |
|
|
|
bdbb3f978e
|
Fix matmul and max reduce memcheck errors
|
2024-04-21 22:11:02 +02:00 |
|
|
|
18522c2dea
|
Cleanup and refactor
|
2024-04-11 22:52:41 +02:00 |
|
|
|
4b9d123e94
|
Implement device vector utils
|
2024-04-11 22:22:33 +02:00 |
|
|
|
710a33bdde
|
Move softmax partial kernels to matmul
|
2024-04-11 22:01:47 +02:00 |
|
|
|
bf7c961b9e
|
Add cudaDeviceReset at the end of each test
|
2024-04-11 19:55:02 +02:00 |
|
|
|
b49dddf34a
|
Improve softmax numerical stability
|
2024-04-08 23:25:46 +02:00 |
|
|
|
e419a93408
|
Fix softmax sum kernel
|
2024-04-08 22:09:18 +02:00 |
|
|
|
7bc329a043
|
Add more softmax tests
|
2024-03-22 22:32:08 +01:00 |
|
|
|
6cf604423a
|
Combine padding and conv kernel
|
2024-03-18 19:53:40 +01:00 |
|
|
|
42d646750b
|
Abstract activation and implement softmax
|
2024-03-17 18:37:15 +01:00 |
|
|
|
0c22fac64e
|
Add toplevel CUDANet namespace
|
2024-03-17 16:08:53 +01:00 |
|
|
|
77004c16be
|
Use shared memory for mat vec mul kernel
|
2024-03-13 22:13:11 +01:00 |
|
|
|
d2ab78fbc7
|
Add Kernels namespace
|
2024-03-11 21:04:23 +01:00 |
|
|
|
07f231a30b
|
Switch padding kernel to row major
|
2024-03-05 21:04:11 +01:00 |
|
|
|
f4257afd5a
|
Remove cublas dependency
|
2024-03-05 18:41:35 +01:00 |
|
|
|
7e4460cc5e
|
Implement padding kernel
|
2024-03-03 15:24:39 +01:00 |
|
|
|
019ccc33d9
|
Start implementing padding kernel
|
2024-02-29 22:21:48 +01:00 |
|
|
|
045359cca2
|
Remove not needed code
|
2024-02-29 22:21:32 +01:00 |
|
|
|
e267f08a2f
|
Format code
|
2024-02-27 21:49:05 +01:00 |
|
|
|
b1eb8b5806
|
Add activations test
|
2024-02-27 20:19:17 +01:00 |
|