|
|
33a3c6afcb
|
Fix batch norm layer
|
2024-05-14 21:42:01 +02:00 |
|
|
|
da8f3167cb
|
Add utils vector mean function
|
2024-05-14 21:41:18 +02:00 |
|
|
|
5c8d3f7e25
|
Compute mean and variance
|
2024-04-29 20:55:11 +02:00 |
|
|
|
0ab623fa23
|
Implement vector mean calculation
|
2024-04-28 22:04:15 +02:00 |
|
|
|
26c12dafc0
|
Update README
|
2024-04-28 21:47:12 +02:00 |
|
|
|
96536a44fd
|
Implement batch norm test
|
2024-04-28 21:46:42 +02:00 |
|
|
|
f60d62f6bd
|
Implement batch norm layer
|
2024-04-28 19:58:00 +02:00 |
|
|
|
3320f610db
|
Add imagenet class map
|
2024-04-28 19:03:51 +02:00 |
|
|
|
1f708f3ea3
|
Change vector scalar math kernels
|
2024-04-28 19:03:06 +02:00 |
|
|
|
69111f6cb1
|
Add version to model bin format
|
2024-04-23 19:53:48 +02:00 |
|
|
|
1592f06121
|
Add opencv to alexnet usage
|
2024-04-22 22:56:29 +02:00 |
|
|
|
88e852a6e5
|
Fix missing predict argument
|
2024-04-22 22:04:46 +02:00 |
|
|
|
157d4c736b
|
Add Alexnet README
|
2024-04-22 21:59:08 +02:00 |
|
|
|
9a9b034ce5
|
Add torch predict function
|
2024-04-22 21:58:32 +02:00 |
|
|
|
f3ba519229
|
Fix alexnet normalization
|
2024-04-22 21:48:53 +02:00 |
|
|
|
74262309ad
|
Fix max pooling output size calc
|
2024-04-22 20:58:02 +02:00 |
|
|
|
c2acad151b
|
Implement simple model validation
|
2024-04-22 20:57:40 +02:00 |
|
|
|
f17debc244
|
Implement getOutputSize and getInputSize for seq layers
|
2024-04-22 20:31:58 +02:00 |
|
|
|
fbf6c44bdd
|
Remove extra cudaFree
|
2024-04-22 19:05:02 +02:00 |
|
|
|
a32c737785
|
Allocate activation on heap
|
2024-04-22 18:59:16 +02:00 |
|
|
|
26cea9b12c
|
Fix more cuda memcheck errors
|
2024-04-21 22:56:55 +02:00 |
|
|
|
58af95eb25
|
Remove cudaDeviceReset calls from tests
|
2024-04-21 22:47:12 +02:00 |
|
|
|
bdbb3f978e
|
Fix matmul and max reduce memcheck errors
|
2024-04-21 22:11:02 +02:00 |
|
|
|
9a6152469a
|
Update activation test
|
2024-04-21 14:00:43 +02:00 |
|
|
|
942ee6a32b
|
Add layer name to vector
|
2024-04-21 12:20:02 +02:00 |
|
|
|
0170afaf3f
|
Improve cuda error handling
|
2024-04-21 12:19:19 +02:00 |
|
|
|
d64a28bc9c
|
Fix model weights export
|
2024-04-21 00:05:56 +02:00 |
|
|
|
9c5d853b75
|
Fix bin file seek offset
|
2024-04-20 21:30:01 +02:00 |
|
|
|
5e663b9029
|
Fix bias in conv layer
|
2024-04-20 19:09:00 +02:00 |
|
|
|
d08567a563
|
Fix weigh bias parsing and better error logging
|
2024-04-20 18:36:53 +02:00 |
|
|
|
ecf7416f8e
|
Rework padding size setting
|
2024-04-20 16:31:28 +02:00 |
|
|
|
dbaab5652e
|
Ignore .bin files
|
2024-04-20 15:51:02 +02:00 |
|
|
|
0807a0f2b8
|
Export pretrained alexnet
|
2024-04-20 15:50:12 +02:00 |
|
|
|
ab10959f35
|
Start implementing AlexNet
|
2024-04-18 22:31:14 +02:00 |
|
|
|
ef3a62f399
|
Remove hardcoded cuda architecture
|
2024-04-18 20:44:32 +02:00 |
|
|
|
c1d93bd008
|
Update README
|
2024-04-16 21:41:10 +02:00 |
|
|
|
432adf57bd
|
Test model weights loading
|
2024-04-16 21:07:06 +02:00 |
|
|
|
9fb9d7e8e1
|
Implement getting layer, weights and biases
|
2024-04-16 19:09:41 +02:00 |
|
|
|
f4ae45f867
|
Start implementing weights import
|
2024-04-15 22:17:48 +02:00 |
|
|
|
d8c50116e8
|
Implement pytorch weights export
|
2024-04-15 22:17:14 +02:00 |
|
|
|
13b455e4ba
|
Implement test multiple inference
|
2024-04-14 00:19:29 +02:00 |
|
|
|
b20ade27d8
|
Implement model destructor
|
2024-04-14 00:05:57 +02:00 |
|
|
|
53c976733b
|
Refactor model test
|
2024-04-14 00:05:32 +02:00 |
|
|
|
457e0f669f
|
Implement model test
|
2024-04-13 13:31:29 +02:00 |
|
|
|
18522c2dea
|
Cleanup and refactor
|
2024-04-11 22:52:41 +02:00 |
|
|
|
4b9d123e94
|
Implement device vector utils
|
2024-04-11 22:22:33 +02:00 |
|
|
|
710a33bdde
|
Move softmax partial kernels to matmul
|
2024-04-11 22:01:47 +02:00 |
|
|
|
bf7c961b9e
|
Add cudaDeviceReset at the end of each test
|
2024-04-11 19:55:02 +02:00 |
|
|
|
bc86ed1782
|
Add activation to pooling layers
|
2024-04-11 19:50:54 +02:00 |
|
|
|
e86e04f6d6
|
Add clearing kernel
|
2024-04-11 19:49:09 +02:00 |
|