Description: The inference model is a mathematical representation used to make predictions based on input data. This type of model is grounded in statistics and machine learning, where the goal is to establish relationships between variables to infer outcomes. Inference models can be linear or nonlinear, and their complexity varies depending on the amount of data and the nature of the relationships being modeled. Their main characteristic is the ability to generalize patterns from previous examples, allowing predictions to be made on unseen data. In the context of edge inference, these models are particularly relevant as they enable data processing on local devices, minimizing latency and bandwidth usage. This is crucial in applications where response speed is essential, such as surveillance systems, IoT devices, and mobile applications. The implementation of inference models in various technological domains also poses challenges, such as the need to optimize performance and energy efficiency, which in turn drives innovation in algorithms and hardware.