Potential new bottlenecks due to input quant/reshape operators

### KWS:

<img width="292" height="789" alt="Image" src="https://github.com/user-attachments/assets/00ea42f7-5ba2-45bf-a6e5-8ddcd8a74b96" />

The new Thresholding (input quant) comes before the flatten (realized implicitly by the DWC), so its max PE is 1, leading to an interval of 490 cycles, which is the new bottleneck:

>   "Thresholding_rtl_0": 490,
>   "MVAU_hls_0": 392,
>   "MVAU_hls_1": 256,
>   "MVAU_hls_2": 256,
>   "MVAU_hls_3": 384,
>   "LabelSelect_hls_0": 13

### bnn-pynq TFC:

<img width="271" height="567" alt="Image" src="https://github.com/user-attachments/assets/f27acfb6-f205-4a2c-b6a7-891352a9f8d0" />

The Reshape (no-op) performs the flattening of the 28x28 input. Its PE is limited to 28, so its lowest possible interval is 28 cycles. With the current folding this is not a problem (bottleneck 64 cycles), but I can imagine cases where it could be.

>   "Reshape_rtl_0": 28,
>   "Thresholding_rtl_0": 28,
>   "MVAU_hls_0": 64,
>   "MVAU_hls_1": 64,
>   "MVAU_hls_2": 64,
>   "MVAU_hls_3": 8,
>   "LabelSelect_hls_0": 11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potential new bottlenecks due to input quant/reshape operators #188

KWS:

bnn-pynq TFC:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Potential new bottlenecks due to input quant/reshape operators #188

Description

KWS:

bnn-pynq TFC:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions