Logo
Explore Help
Register Sign In
Mathis/CUDANet
1
0
Fork 0
You've already forked CUDANet
mirror of https://github.com/lordmathis/CUDANet.git synced 2025-11-06 17:54:27 +00:00
Code Issues Packages Projects Releases Wiki Activity
132 Commits 1 Branch 0 Tags
3320f610db89bc428c85428a9e1b5070d26699ef
Commit Graph

10 Commits

Author SHA1 Message Date
LordMathis
f17debc244 Implement getOutputSize and getInputSize for seq layers 2024-04-22 20:31:58 +02:00
LordMathis
58af95eb25 Remove cudaDeviceReset calls from tests 2024-04-21 22:47:12 +02:00
LordMathis
bdbb3f978e Fix matmul and max reduce memcheck errors 2024-04-21 22:11:02 +02:00
LordMathis
18522c2dea Cleanup and refactor 2024-04-11 22:52:41 +02:00
LordMathis
4b9d123e94 Implement device vector utils 2024-04-11 22:22:33 +02:00
LordMathis
710a33bdde Move softmax partial kernels to matmul 2024-04-11 22:01:47 +02:00
LordMathis
bf7c961b9e Add cudaDeviceReset at the end of each test 2024-04-11 19:55:02 +02:00
LordMathis
b49dddf34a Improve softmax numerical stability 2024-04-08 23:25:46 +02:00
LordMathis
0c22fac64e Add toplevel CUDANet namespace 2024-03-17 16:08:53 +01:00
LordMathis
77004c16be Use shared memory for mat vec mul kernel 2024-03-13 22:13:11 +01:00
Powered by Gitea Version: 1.24.6 Page: 1109ms Template: 14ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API