qconfig

horizon_plugin_pytorch.quantization.get_qconfig (observer: 
~typing.Type[~horizon_plugin_pytorch.quantization.observer_v2.ObserverBase] = 
<class 'horizon_plugin_pytorch.quantization.observer_v2.MinMaxObserver'>, 
in_dtype: ~torch.dtype | ~horizon_plugin_pytorch.dtype.QuantDType | None = None, 
weight_dtype: ~torch.dtype | ~horizon_plugin_pytorch.dtype.QuantDType | None = 'qint8', 
out_dtype: ~torch.dtype | ~horizon_plugin_pytorch.dtype.QuantDType | None = 'qint8', 
fix_scale: bool = False)

Get qconfig.

Parameters:

observer (Type[ObserverBase]) – observer type for input and output. Support MinMaxObserver and MSEObserver.

in_dtype (Union[dtype, QuantDType, None]) – input dtype.

weight_dtype (Union[dtype, QuantDType, None]) – weight dtype.

out_dtype (Union[dtype, QuantDType, None]) – output dtype.

fix_scale (bool) – Whether fix input/output scale.

PTQ转换工具

hb_compile工具

PTQ转换步骤

PTQ转换示例

常见问题及故障处理

附录

开发指南

深入探索

API参考

QAT

模型导出

Horizon算子

常见问题及常见故障

模型推理开发

模型推理API手册

数据结构

功能接口

模型推理工具介绍

hrt_model_exec工具介绍

hbm_infer工具介绍

UCP通用API介绍

数据结构

功能接口

UCP性能分析工具

常见问题及错误码

模型部署原理及流程

模型部署实践指导实例

HMCT API Reference

工具链算子支持约束列表

算子支持列表

算子BPU约束列表

社区优质文章

qconfig

hb_compile工具

QAT

模型导出

Horizon算子

模型推理API手册

数据结构

功能接口

模型推理工具介绍

hrt_model_exec工具介绍

hbm_infer工具介绍

数据结构

功能接口

算子支持列表

算子BPU约束列表

#qconfig

qconfig