quantization.prepare

horizon_plugin_pytorch.quantization.prepare (model: Module, 
example_inputs: Any | None = None, qconfig_setter: Tuple[QconfigSetterBase, ...] | 
QconfigSetterBase | QconfigSetter | None = None, method: PrepareMethod = PrepareMethod.JIT_STRIP, 
example_kw_inputs: Any | None = None, check_result_dir: str | None = None, *, 
fuse_mode: FuseMode | None = None)

Prepare model.

Prepare and check a copy of the model for QAT.

Parameters:

model (Module) – Model to be prepared.

example_inputs (Optional[Any]) – Model inputs. Used to trace and check model.

qconfig_setter (Union[Tuple[QconfigSetterBase, ...], QconfigSetterBase, QconfigSetter, None]) – Qconfig setter. Used to set qconfig.

method (PrepareMethod) –

Method used to trace model, availiable options are:

PrepareMethod.EAGER: Don’t trace.

PrepareMethod.JIT_STRIP: Use jit trace and strip the graph outside QuantStub and Dequantstub.

example_kw_inputs (Optional[Any]) – Model keyword inputs. Used to trace and check model.

check_result_dir (Optional[str]) – Directory to save qat check result txt.

fuse_mode (Optional[FuseMode]) –

Control op fusion on compute graph, availiable options are:

FuseMode.OnlyBN: Only fuse conv + bn, add and relu can be handled by qconfig template.

None: Automatically choose from FuseMode.OnlyBN and FuseMode.BNAddReLU according to current qconfig template.

FuseMode.BNAddReLU: Fuse conv + bn + add + relu.

FuseMode.NoFuse: Not do any fusion.

Return type: Module

PTQ转换工具

hb_compile工具

PTQ转换步骤

PTQ转换示例

常见问题及故障处理

附录

开发指南

深入探索

API参考

QAT

模型导出

Horizon算子

常见问题及常见故障

模型推理开发

模型推理API手册

数据结构

功能接口

模型推理工具介绍

hrt_model_exec工具介绍

hbm_infer工具介绍

UCP通用API介绍

数据结构

功能接口

UCP性能分析工具

常见问题及错误码

模型部署原理及流程

模型部署实践指导实例

HMCT API Reference

工具链算子支持约束列表

算子支持列表

算子BPU约束列表

社区优质文章

quantization.prepare

hb_compile工具

QAT

模型导出

Horizon算子

模型推理API手册

数据结构

功能接口

模型推理工具介绍

hrt_model_exec工具介绍

hbm_infer工具介绍

数据结构

功能接口

算子支持列表

算子BPU约束列表

#quantization.prepare

quantization.prepare