Lowest latency machine learning inference accelerator. Outperforms GPUs where latencies of tens of microseconds or less are required. SDK enables ML developers to compile their models and run them on VOLLO directly from PyTorch or TensorFlow and without requiring any FPGA expertise or tools. For those who do have FPGA expertise and tools, an FPGA netlist version is also available.