Qualcomm AI Engine Direct - Add fp16a8w quantization config#19537
Conversation
This PR needs a
|
|
Hi @psiddh, Bug Fix PR: Op Enablement PR:
Claude Skill PR: LLM Related PR:
Debugging Related PR:
Others: |
Thanks for listing all the PRs I will start looking at them |
Summary: - Add a pass `insert_cast_for_fp_act_quantized_weight.py` to cast fp32 -> fp16 due to constraint in QNN HTP - Add a test case to run conv2d and linear with fp16a8w
d69272f to
a7fca32
Compare
|
@claude review this pr |
### Summary:
- Add fp16a8w quantization config
- Note that fp16a8w is only supported with Conv2d (kernel size = 1) and
Linear by QNN HTP
- Add a pass `insert_cast_for_fp_act_quantized_weight.py` to cast fp32
-> fp16 due to constraint in QNN HTP
- Add a test case to run conv2d and linear with fp16a8w
### Test plan
```
python3 backends/qualcomm/tests/test_qnn_delegate.py TestQNNFloatingPointOperator.test_qnn_backend_fp16a8w_simple_model -b build-android -H ${HOST} -s ${DEVICE} -m SM8750 -r /path/to/executorch -a /path/to/artifacts
```

Summary:
insert_cast_for_fp_act_quantized_weight.pyto cast fp32 -> fp16 due to constraint in QNN HTPTest plan