Your mission Optimize deep neural networks for real-time inference on embedded devices (Jetson, FPGA, custom boards) Apply quantization, pruning, distillation, and compression techniques for efficient deployment Use TensorRT, CUDA, C++... -
Voir cette offre d'emploi