|
|
dc86cddeb7
|
Use tiling shmem for mat vec mul kernel
|
2024-03-15 23:33:09 +01:00 |
|
|
|
77004c16be
|
Use shared memory for mat vec mul kernel
|
2024-03-13 22:13:11 +01:00 |
|
|
|
9d91896f13
|
Change forward function to return output pointer
|
2024-03-12 20:50:49 +01:00 |
|
|
|
d2ab78fbc7
|
Add Kernels namespace
|
2024-03-11 21:04:23 +01:00 |
|
|
|
e0178e2d5c
|
Cleanup and refactor
|
2024-03-11 20:39:44 +01:00 |
|
|
|
fceef07a9b
|
Remove linear activation kernel
|
2024-03-09 22:54:23 +01:00 |
|
|
|
f4257afd5a
|
Remove cublas dependency
|
2024-03-05 18:41:35 +01:00 |
|
|
|
f37320594a
|
Add activations enum
|
2024-03-03 15:24:54 +01:00 |
|
|
|
045359cca2
|
Remove not needed code
|
2024-02-29 22:21:32 +01:00 |
|
|
|
9747abe53e
|
Fix dense layer forward prop
|
2024-02-27 21:47:46 +01:00 |
|
|
|
48ba09b28d
|
Format source code using clang-format
|
2024-02-27 18:52:12 +01:00 |
|
|
|
5e1e0ed1d1
|
Initial activations implementation
|
2024-02-27 00:24:57 +01:00 |
|
|
|
6e99525ad0
|
Rename hheader files to .cuh
|
2024-02-26 19:53:46 +01:00 |
|
|
|
035f3b053b
|
Rename files to .cu and fix IDX2C usage
|
2024-02-21 20:03:04 +01:00 |
|