Commit Graph

203 Commits

Author SHA1 Message Date
432adf57bd Test model weights loading 2024-04-16 21:07:06 +02:00
9fb9d7e8e1 Implement getting layer, weights and biases 2024-04-16 19:09:41 +02:00
f4ae45f867 Start implementing weights import 2024-04-15 22:17:48 +02:00
d8c50116e8 Implement pytorch weights export 2024-04-15 22:17:14 +02:00
13b455e4ba Implement test multiple inference 2024-04-14 00:19:29 +02:00
b20ade27d8 Implement model destructor 2024-04-14 00:05:57 +02:00
53c976733b Refactor model test 2024-04-14 00:05:32 +02:00
457e0f669f Implement model test 2024-04-13 13:31:29 +02:00
18522c2dea Cleanup and refactor 2024-04-11 22:52:41 +02:00
4b9d123e94 Implement device vector utils 2024-04-11 22:22:33 +02:00
710a33bdde Move softmax partial kernels to matmul 2024-04-11 22:01:47 +02:00
bf7c961b9e Add cudaDeviceReset at the end of each test 2024-04-11 19:55:02 +02:00
bc86ed1782 Add activation to pooling layers 2024-04-11 19:50:54 +02:00
e86e04f6d6 Add clearing kernel 2024-04-11 19:49:09 +02:00
b49dddf34a Improve softmax numerical stability 2024-04-08 23:25:46 +02:00
e419a93408 Fix softmax sum kernel 2024-04-08 22:09:18 +02:00
7bc329a043 Add more softmax tests 2024-03-22 22:32:08 +01:00
9482d7bc43 Add model predict test 2024-03-22 22:31:32 +01:00
9ecd51c63a Split python scripts 2024-03-22 20:03:22 +01:00
87db47089e Add output layer to model predict 2024-03-21 23:22:12 +01:00
90fb104dae Implement output layer 2024-03-21 23:07:46 +01:00
e46d5d3f76 Update README 2024-03-21 00:26:11 +01:00
a9d0a0832a Change model input layer creation 2024-03-21 00:24:49 +01:00
af6838e8ae Initial model implementation 2024-03-20 22:31:39 +01:00
6f4cdf3792 Implement avg pool test 2024-03-20 21:57:22 +01:00
dfff0360d9 Implement max pooling test 2024-03-20 21:44:04 +01:00
c062e89972 Use 3d memory layout for pooling 2024-03-20 19:21:30 +01:00
5860faf85e Use 3d memory layout for convolution 2024-03-20 19:15:27 +01:00
ef63cbd9f1 Implement avg pooling 2024-03-19 22:33:43 +01:00
a0fc1b00ae Implement max pooling layer 2024-03-19 22:04:58 +01:00
364715ff70 Refactor kernels 2024-03-19 21:37:25 +01:00
b6c4b7d2ae Refactor layers 2024-03-19 21:35:05 +01:00
8d14b74f66 Implement Add layer 2024-03-18 20:37:13 +01:00
d9c6c663c8 Rename ILayer to WeightedLayer 2024-03-18 20:36:52 +01:00
6cf604423a Combine padding and conv kernel 2024-03-18 19:53:40 +01:00
e6d3757312 Change unsigned int to int 2024-03-18 19:40:00 +01:00
aac0c3a826 Implement concat layer 2024-03-17 21:38:29 +01:00
9aab7cdb10 Update README 2024-03-17 19:12:39 +01:00
48060dc38d Rename python tool script 2024-03-17 19:11:41 +01:00
cbdb4e7707 Test softmax 2024-03-17 19:08:16 +01:00
42d646750b Abstract activation and implement softmax 2024-03-17 18:37:15 +01:00
b1621819ca Update README 2024-03-17 16:18:42 +01:00
0c22fac64e Add toplevel CUDANet namespace 2024-03-17 16:08:53 +01:00
dc86cddeb7 Use tiling shmem for mat vec mul kernel 2024-03-15 23:33:09 +01:00
88f7fff217 Add prefix to guards 2024-03-13 22:23:23 +01:00
77004c16be Use shared memory for mat vec mul kernel 2024-03-13 22:13:11 +01:00
09480e42e5 Add doxygen config 2024-03-12 22:09:37 +01:00
7157a27e56 Add documentation comments 2024-03-12 21:50:06 +01:00
708164e4d0 Implement simple input layer 2024-03-12 21:16:46 +01:00
9d91896f13 Change forward function to return output pointer 2024-03-12 20:50:49 +01:00