CUDANet

mirror of https://github.com/lordmathis/CUDANet.git synced 2025-12-23 14:54:28 +00:00

Author	SHA1	Message	Date
LordMathis	f17debc244	Implement getOutputSize and getInputSize for seq layers	2024-04-22 20:31:58 +02:00
LordMathis	58af95eb25	Remove cudaDeviceReset calls from tests	2024-04-21 22:47:12 +02:00
LordMathis	bdbb3f978e	Fix matmul and max reduce memcheck errors	2024-04-21 22:11:02 +02:00
LordMathis	18522c2dea	Cleanup and refactor	2024-04-11 22:52:41 +02:00
LordMathis	4b9d123e94	Implement device vector utils	2024-04-11 22:22:33 +02:00
LordMathis	710a33bdde	Move softmax partial kernels to matmul	2024-04-11 22:01:47 +02:00
LordMathis	bf7c961b9e	Add cudaDeviceReset at the end of each test	2024-04-11 19:55:02 +02:00
LordMathis	b49dddf34a	Improve softmax numerical stability	2024-04-08 23:25:46 +02:00
LordMathis	e419a93408	Fix softmax sum kernel	2024-04-08 22:09:18 +02:00
LordMathis	7bc329a043	Add more softmax tests	2024-03-22 22:32:08 +01:00
LordMathis	6cf604423a	Combine padding and conv kernel	2024-03-18 19:53:40 +01:00
LordMathis	42d646750b	Abstract activation and implement softmax	2024-03-17 18:37:15 +01:00
LordMathis	0c22fac64e	Add toplevel CUDANet namespace	2024-03-17 16:08:53 +01:00
LordMathis	77004c16be	Use shared memory for mat vec mul kernel	2024-03-13 22:13:11 +01:00
LordMathis	d2ab78fbc7	Add Kernels namespace	2024-03-11 21:04:23 +01:00
LordMathis	07f231a30b	Switch padding kernel to row major	2024-03-05 21:04:11 +01:00
LordMathis	f4257afd5a	Remove cublas dependency	2024-03-05 18:41:35 +01:00
LordMathis	7e4460cc5e	Implement padding kernel	2024-03-03 15:24:39 +01:00
LordMathis	019ccc33d9	Start implementing padding kernel	2024-02-29 22:21:48 +01:00
LordMathis	045359cca2	Remove not needed code	2024-02-29 22:21:32 +01:00
LordMathis	e267f08a2f	Format code	2024-02-27 21:49:05 +01:00
LordMathis	b1eb8b5806	Add activations test	2024-02-27 20:19:17 +01:00

22 Commits