223 Commits

Author SHA1 Message Date
c2acad151b Implement simple model validation 2024-04-22 20:57:40 +02:00
f17debc244 Implement getOutputSize and getInputSize for seq layers 2024-04-22 20:31:58 +02:00
fbf6c44bdd Remove extra cudaFree 2024-04-22 19:05:02 +02:00
a32c737785 Allocate activation on heap 2024-04-22 18:59:16 +02:00
26cea9b12c Fix more cuda memcheck errors 2024-04-21 22:56:55 +02:00
58af95eb25 Remove cudaDeviceReset calls from tests 2024-04-21 22:47:12 +02:00
bdbb3f978e Fix matmul and max reduce memcheck errors 2024-04-21 22:11:02 +02:00
9a6152469a Update activation test 2024-04-21 14:00:43 +02:00
942ee6a32b Add layer name to vector 2024-04-21 12:20:02 +02:00
0170afaf3f Improve cuda error handling 2024-04-21 12:19:19 +02:00
d64a28bc9c Fix model weights export 2024-04-21 00:05:56 +02:00
9c5d853b75 Fix bin file seek offset 2024-04-20 21:30:01 +02:00
5e663b9029 Fix bias in conv layer 2024-04-20 19:09:00 +02:00
d08567a563 Fix weigh bias parsing and better error logging 2024-04-20 18:36:53 +02:00
ecf7416f8e Rework padding size setting 2024-04-20 16:31:28 +02:00
dbaab5652e Ignore .bin files 2024-04-20 15:51:02 +02:00
0807a0f2b8 Export pretrained alexnet 2024-04-20 15:50:12 +02:00
ab10959f35 Start implementing AlexNet 2024-04-18 22:31:14 +02:00
ef3a62f399 Remove hardcoded cuda architecture 2024-04-18 20:44:32 +02:00
c1d93bd008 Update README 2024-04-16 21:41:10 +02:00
432adf57bd Test model weights loading 2024-04-16 21:07:06 +02:00
9fb9d7e8e1 Implement getting layer, weights and biases 2024-04-16 19:09:41 +02:00
f4ae45f867 Start implementing weights import 2024-04-15 22:17:48 +02:00
d8c50116e8 Implement pytorch weights export 2024-04-15 22:17:14 +02:00
13b455e4ba Implement test multiple inference 2024-04-14 00:19:29 +02:00
b20ade27d8 Implement model destructor 2024-04-14 00:05:57 +02:00
53c976733b Refactor model test 2024-04-14 00:05:32 +02:00
457e0f669f Implement model test 2024-04-13 13:31:29 +02:00
18522c2dea Cleanup and refactor 2024-04-11 22:52:41 +02:00
4b9d123e94 Implement device vector utils 2024-04-11 22:22:33 +02:00
710a33bdde Move softmax partial kernels to matmul 2024-04-11 22:01:47 +02:00
bf7c961b9e Add cudaDeviceReset at the end of each test 2024-04-11 19:55:02 +02:00
bc86ed1782 Add activation to pooling layers 2024-04-11 19:50:54 +02:00
e86e04f6d6 Add clearing kernel 2024-04-11 19:49:09 +02:00
b49dddf34a Improve softmax numerical stability 2024-04-08 23:25:46 +02:00
e419a93408 Fix softmax sum kernel 2024-04-08 22:09:18 +02:00
7bc329a043 Add more softmax tests 2024-03-22 22:32:08 +01:00
9482d7bc43 Add model predict test 2024-03-22 22:31:32 +01:00
9ecd51c63a Split python scripts 2024-03-22 20:03:22 +01:00
87db47089e Add output layer to model predict 2024-03-21 23:22:12 +01:00
90fb104dae Implement output layer 2024-03-21 23:07:46 +01:00
e46d5d3f76 Update README 2024-03-21 00:26:11 +01:00
a9d0a0832a Change model input layer creation 2024-03-21 00:24:49 +01:00
af6838e8ae Initial model implementation 2024-03-20 22:31:39 +01:00
6f4cdf3792 Implement avg pool test 2024-03-20 21:57:22 +01:00
dfff0360d9 Implement max pooling test 2024-03-20 21:44:04 +01:00
c062e89972 Use 3d memory layout for pooling 2024-03-20 19:21:30 +01:00
5860faf85e Use 3d memory layout for convolution 2024-03-20 19:15:27 +01:00
ef63cbd9f1 Implement avg pooling 2024-03-19 22:33:43 +01:00
a0fc1b00ae Implement max pooling layer 2024-03-19 22:04:58 +01:00