AxGazeEstimation : A Machine Learning Model for Estimating Gaze

This is an introduction to「AxGazeEstimation」, a machine learning model that can be used with ailia SDK. You can easily use this model to create AI applications using ailia SDK as well as many other ready-to-use ailia MODELS.


AxGazeEstimation is a machine learning model developed by ax Inc. to detect the direction of gaze of a person from an input image.



AxGazeEstimation uses BlazeFace to detect faces in an image and estimates the gaze using the detected face as input. Two methods of gaze estimation are available: direct estimation from the face image, and estimation from face image combined with face orientation.

The network backbone uses a reduced version of ResNet50 (stage 3).

The training was performed using our in-house dataset made of 97,059 training images, and 11,775 validation images.


Use the following command to run the gaze estimation on the webcam video stream.

$ python3 -v 0

The following command can be used to estimate the face orientation in combination with the face detection.

$ python3 -v 0 --include-head-pose

Here is an example of AxGazeEstimation in action.

Related topics

ax Inc. has developed ailia SDK, which enables cross-platform, GPU-based rapid inference.

ax Inc. provides a wide range of services from consulting and model creation, to the development of AI-based applications and SDKs. Feel free to contact us for any inquiry.



