Tflite Interpreter

tflite/guide/inference.md

The model is created by reinterpret a raw memory directly.

call graph

Interpreter::Invoke
Subgraph::Invoke
Subgraph::OpInvoke
TfLiteRegistration::Invoke

In tflite, ops are registerred to the interpreter.

lite/python/optimize/calibrator_wrapper.cc:CreateWrapperCPPFromBuffer > lite/kernels/register.cc:BuiltinOpResolver
1. BuiltinOpResolver creates a default registration
the Resolver is used by BuildLocalIndexToRegistrationMapping
1. which create flatbuffer_op_index_to_registration_
The mapping is used by model.cc:InterpreterBuilder::ParseNodes
1. which retrieve registration from op_index
2. the registration and node is added by Subgraph::AddNodeWithParameter
The node and registration is used in Invoke

the info is stored in lite/schema :TensorT:quantization
moved by lite/tools/optimize/calibration/calibration_reader.cc:AddCalibrationToModel
1. from logger_ -> GetCalibrationValues
created in lite/tools/optimize/calibration/calibrator.cc:BuildLoggingInterpreter
1. in GetCalibratorRegistry()->CreateCalibrator
Calibrator is based on result stored in LoggingOpResolver
LoggingOpResolver overwrite TfLiteRegistration::invoke
1. with calibrator.cc:LoggingEval
2. calibrator is retrieved from the context. (singleton)
calls LogTensorValue
1. updates tensor stats map calibration_logger.h:Update
2. with min, max -> absolute value

in tensorflow, tensor is refereced by string

todo: how does this work in tflite

scale and zero point

calculation

scale and zero point is taken as an array

if input and output scale is the same, memcpy if they are different,