Facebook -- TVM AWS Meetup TalkAutoregressive sampling net running at faster than real-time - Compute split between GRU units and FC layers - 24kHz sampling frequency requires 40us sampling net runtime - First PyTorch model used0 码力 | 11 页 | 3.08 MB | 5 月前3
XDNN TVM - Nov 2019with FPGA ˃ TVM pipeline needed. CPU/FPGA partitions ideally run in parallel >> 13 Post-Process (fc/softmax/nms) FPGA Acceleration Pre-Process (resize)© Copyright 2018 Xilinx FPGA Pipeline report0 码力 | 16 页 | 3.35 MB | 5 月前3
共 2 条
- 1













