Artificial Intelligence in Real-Time Video Encoding from Theoretical Promises to Operational Gains (2022)

By Jan De Cock, Synamedia

Many books and articles have been written about artificial intelligence (AI) and machine learning (ML), in a variety of applications. ML is far from new, has an established theoretical foundation, and lots of different types of ML techniques have been introduced over the past decades. These techniques can be classified in different ways, but a full taxonomy is outside of the scope of this article. In this paper, we focus mostly on ML algorithms, as a subset of AI. Excellent introductions and overviews have been provided in e.g. [Goodfellow16, Bishop95].

Lots of successes have been claimed based on ML, and reports of AI intelligence are already the subject of ethical discussions [Google22]. Still, the powers of machine learning are not always a solution, and in many applications, even though they make for an interesting marketing statement, they do not lead to net gains or operational savings.

Machine learning has powerful applications in computer vision, image and video processing, and approaches using deep neural networks have become the center of academic and industry research. For example, residual neural networks have shown impressive results for image classification and recognition [Simonyan14, He16]. Still in most of these cases, very complex algorithms are needed, requiring e.g. deep neural networks containing dozens or hundreds of layers. While it’s acceptable to have a very complex training stage (which needs to be executed once), it’s primarily the complexity of the inference network (which needs to be repeated many times) that determines the feasibility of ML approaches. An important unit of expressing the complexity of ML inference networks is the number of multiply-accumulate operations (MACs). Some of the best-performing image recognition networks use millions of MACs per image.

Often, new approaches are deemed feasible when they can be run on state-of-the-art GPUs inside a server. In certain cases, this is acceptable, and the cost of a dedicated CPU or GPU is warranted. For real-time, cost-sensitive applications, however, this is not an option. In typical video encoding/transcoding set-ups, dozens or even hundreds of channels need to be processed on a single server, and the cost per channel is a crucial criterion. Furthermore, the latency of offloading decisions to accelerators (if they would be cost effective, which is not the case), would be prohibitive.

In this paper, we discuss the applicability of machine learning approaches in different areas of real-time video compression. We successively cover encoder complexity reduction, rate control, video quality improvements and video quality measurement. In each of these areas, we have studied ways to reduce the complexity of ML inference, to end up with algorithms that are applicable in real-time, cost-sensitive applications.

By clicking the "Download Paper" button, you are agreeing to our terms and conditions.

Similar Papers

Real-Time Analytics for IP Video Multicast
By Dr. Claudio Righetti, Emilia Gibellini, Florencia De Arca, Mariela Fiorenzo & Gabriel Carro, Telecom Argentina
2018
Low Complexity Real-time Video Encoding for Soft Set-Top Box Platforms
By Krasimir Kolarov, Feei Chung, William Lynch, Interval Research Corporation
2000
Can Future Networks Survive Without Artificial Intelligence?
By Dr. Claudio Righetti, Emilia Gibellini, Carlos Germán Carreño Romano & Gabriel Carro; Telecom Argentina S.A.
2019
Artificial Intelligence In Cable TV Applications
By Louis P. Slothouber and Aaron Ye, BIAP Systems, Inc.
2003
Augmented Intelligence: Next Level Network and Services Intelligence
By Dr. Claudio Righetti, Mariela Fiorenzo, Omar Hurtado & Gabriel Carro, Telecom Argentina S.A.
2020
Augmented Reality and Artificial Intelligence Approaches for Inventory Synchronization
By Salvatore (Sam) Torrente, Petar Djukic, Dmitri Fedorov, Mehran Bagheri & Marco Naveda, Ciena
2021
Predicting Service Impairments from Set-top Box Errors in Near Real-Time and What to Do About It
By Justin Watson, Comcast; Roger Brooks, Andrew Colby, Pankaj Kumar, Anant Malhotra & Mudit Jain, Guavus, Inc.
2018
Improving Operational Intelligence for Maintaining Cable Networks
By Mike Spaulding, Comcast Corporation; Larry Wolcott, Comcast Corporation; Jason Rupe, CableLabs
2022
A PNM System Using Artificial Intelligence, HFC Network Impairment, Atmospheric and Weather Data to Predict HFC Network Degradation and Avert Customer Impact
By Larry Wolcott, Michael O'Dell, Peter Kuykendall, Vishnu Gopal, Jason Woodrich & Nick Pinckernell, Comcast
2018
Artificial Intelligence (AI)-Based Cross-Platform Node Name Mapping (Cmap-NN)
By Jordan Kupersmith, Cox Communications; Wei Cai, Cox Communications; Jeonpaolo Barvez, Cox Communications; Pavan Chandrashekar, Cox Communications; Richard Brown, Cox Communications
2023
More Results >>