Describes the fields where AI systems can be applied, including but not limited to computer vision, machine translation, speech processing, etc.
This layer includes algorithmic models for specific application scenarios, such as:
Provides underlying computational operations such as matrix multiplication and convolution, which are fundamental for building deep learning models.
Used to optimize the computational graph of models to improve execution efficiency and performance.
Involves optimization measures for deploying trained models into a production environment.
Optimization aimed at the model inference process to accelerate the model's prediction speed.
Converts the model's weights and activations from floating-point numbers to integers, reducing model size and improving inference speed.
Transfers knowledge from large, complex models to smaller models through knowledge distillation techniques.
Removes unimportant weights or neurons in the model to simplify its structure without significantly affecting performance.
Provides a variety of deep learning frameworks and engines, such as Tensorflow, Pytorch, ONNX, etc., for model development and training.
Automated machine learning processes for the automatic design and optimization of machine learning models.
Describes the hardware platforms on which AI models can run, including: