Hardware Mapping Limitations of Quantized ViT-B/32 

The model selected for FPGA deployment is the ViT-B/32 (Vision Transformer Base). The model was adapted for classification on 200 classes of the Tiny-ImageNet dataset. The quantization process was carried out using Xilinx's Brevitas library, employing Quantization-Aware Training (QAT) with 8-bit weights and 8-bit activations (a8w8), using per-tensor symmetric quantization. The quantized model was exported in QONNX format (Quantized ONNX) using Brevitas's export_qonnx function. For the deployment of the quantized model on FPGA, the Xilinx FINN framework was used in combination with finn-plus. 
The results from the completed build are shown below. The screenshots are taken from the final step of the build process. As can be observed, certain nodes could not be absorbed into hardware operators — both the colored nodes (which represent hardware operators) and the uncolored nodes (which represent software nodes executing on the CPU).Out of a total of 959 nodes, only 51 nodes were successfully mapped to hardware operators, while the remaining 908 nodes remain as software nodes executing on the CPU.   
As a first proposed solution, re-exporting the model from Brevitas using ONNX opset version 11 is recommended. This approach would resolve the root cause of the hardware mapping limitations, as opset 11 does not support the fused LayerNormalization node that is introduced in opset 17. 


<img width="384" height="902" alt="Image" src="https://github.com/user-attachments/assets/1227a7cf-3f8f-4e3e-86b2-d9263e34eadd" />
<img width="602" height="943" alt="Image" src="https://github.com/user-attachments/assets/04bdacd2-fe52-4711-852a-dec3feedda38" />
<img width="622" height="850" alt="Image" src="https://github.com/user-attachments/assets/8536c3f1-9a34-4f46-a3a1-07cdb17b9f05" />
<img width="514" height="706" alt="Image" src="https://github.com/user-attachments/assets/41c4a444-1c79-423f-a805-45d1ffb0ee9e" />

[build_finn_nohw.py](https://github.com/user-attachments/files/26741458/build_finn_nohw.py)
@fpjentzsch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hardware Mapping Limitations of Quantized ViT-B/32 #198

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Hardware Mapping Limitations of Quantized ViT-B/32 #198

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions