Object Segmentation Technology: Smoother Background Processing Effects

Janette Zhou
ZEGOCLOUD-DEV
Published in
2 min readJul 17, 2023

The rise of online communication has led to increasing demands for higher-quality real-time interaction. Along with fundamental technical capabilities such as sound and video quality, more personalized and refined demands have emerged, including the need to process the background of interactive users. This is where object segmentation technology comes in.

Object segmentation is an add-on capability provided by the ZEGO Express SDK, which uses AI algorithms to identify the content in a video frame and assigns transparency information to each pixel. The pixels in the object area are “opaque,” while the pixels outside are set as “transparent,” creating a visual effect of separating the object from the original video.

ZEGOCLOUD provides two segmentation capabilities: “green-screen background segmentation” and “arbitrary background segmentation.” Arbitrary segmentation refers to separating the object from the actual scene, while green-screen segmentation separates the object from a background with a green screen installed. Different functionalities can be achieved by processing the transparency information of the segmented pixels in different ways.

To improve the accuracy of video segmentation, ZEGOCLOUD utilized a significant amount of video data as background data and referred to the inter-frame relationships of the previous and next frames in the algorithm. This vastly improves the overall visual experience.

ZEGOCLOUD’s green-screen segmentation also developed a highly lightweight green-screen cutout algorithm using AI. The model size of this algorithm is only 1kb, and on the Snapdragon 855 platform, the CPU time is 2ms, and the GPU time is 1ms. To completely solve the green spillage problem, a dedicated green suppression module was inserted into the algorithm.

ZEGOCLOUD uses algorithms to separate the main object and the background, enabling users to achieve different background processing methods, including blurred background, virtual background, and transparent background. These effects offer greater playability than blurred backgrounds and can be applied in almost all scenarios.

Object segmentation technology combined with Alpha data transmission and rendering functionality can extend it to more multi-person interactive scenarios. This technology makes it possible for users to appear in the same frame for real-time interaction or achieve an unobstructed display of presentation content.

In conclusion, object segmentation technology is a solution that provides businesses with personalized background processing capabilities, which is an area that needs to be urgently addressed. ZEGOCLOUD’s technology offers a range of solutions that can help businesses enhance their real-time interaction experiences and meet the demands of users in various scenarios.

Learn More about ZEGOCLOUD Object Segmentation: https://www.zegocloud.com/blog/object-segmentation

--

--