Paper - Artificial Intelligence in Real-Time Video Encoding from Theoretical Promises to Operational Gains

Artificial Intelligence in Real-Time Video Encoding from Theoretical Promises to Operational Gains (2022)

By Jan De Cock, Synamedia

Many books and articles have been written about artificial intelligence (AI) and machine learning (ML), in a variety of applications. ML is far from new, has an established theoretical foundation, and lots of different types of ML techniques have been introduced over the past decades. These techniques can be classified in different ways, but a full taxonomy is outside of the scope of this article. In this paper, we focus mostly on ML algorithms, as a subset of AI. Excellent introductions and overviews have been provided in e.g. [Goodfellow16, Bishop95].

Lots of successes have been claimed based on ML, and reports of AI intelligence are already the subject of ethical discussions [Google22]. Still, the powers of machine learning are not always a solution, and in many applications, even though they make for an interesting marketing statement, they do not lead to net gains or operational savings.

Machine learning has powerful applications in computer vision, image and video processing, and approaches using deep neural networks have become the center of academic and industry research. For example, residual neural networks have shown impressive results for image classification and recognition [Simonyan14, He16]. Still in most of these cases, very complex algorithms are needed, requiring e.g. deep neural networks containing dozens or hundreds of layers. While it’s acceptable to have a very complex training stage (which needs to be executed once), it’s primarily the complexity of the inference network (which needs to be repeated many times) that determines the feasibility of ML approaches. An important unit of expressing the complexity of ML inference networks is the number of multiply-accumulate operations (MACs). Some of the best-performing image recognition networks use millions of MACs per image.

Often, new approaches are deemed feasible when they can be run on state-of-the-art GPUs inside a server. In certain cases, this is acceptable, and the cost of a dedicated CPU or GPU is warranted. For real-time, cost-sensitive applications, however, this is not an option. In typical video encoding/transcoding set-ups, dozens or even hundreds of channels need to be processed on a single server, and the cost per channel is a crucial criterion. Furthermore, the latency of offloading decisions to accelerators (if they would be cost effective, which is not the case), would be prohibitive.

In this paper, we discuss the applicability of machine learning approaches in different areas of real-time video compression. We successively cover encoder complexity reduction, rate control, video quality improvements and video quality measurement. In each of these areas, we have studied ways to reduce the complexity of ML inference, to end up with algorithms that are applicable in real-time, cost-sensitive applications.

Download Paper

By clicking the "Download Paper" button, you are agreeing to our terms and conditions.

Similar Papers

Real-Time Analytics for IP Video Multicast By Dr. Claudio Righetti, Emilia Gibellini, Florencia De Arca, Mariela Fiorenzo & Gabriel Carro, Telecom Argentina	2018
Low Complexity Real-time Video Encoding for Soft Set-Top Box Platforms By Krasimir Kolarov, Feei Chung, William Lynch, Interval Research Corporation	2000
Can Future Networks Survive Without Artificial Intelligence? By Dr. Claudio Righetti, Emilia Gibellini, Carlos Germán Carreño Romano & Gabriel Carro; Telecom Argentina S.A.	2019
Generative Artificial Intelligence and Its Impact on the Cable Industry By Claudio Righetti, Austral University; Matías Torchinsky, Intraway	2024
Artificial Intelligence and the Nanogrid in Critical Facility Power Infrastructure By Ron Slutter, EnerSys; Rahul Khandekar, Ph.D., EnerSys; Francisco Paz, Ph.D., EnerSys; Dan Cooper, Greenside LLC	2024
Augmented Intelligence: Next Level Network and Services Intelligence By Dr. Claudio Righetti, Mariela Fiorenzo, Omar Hurtado & Gabriel Carro, Telecom Argentina S.A.	2020
Artificial Intelligence In Cable TV Applications By Louis P. Slothouber and Aaron Ye, BIAP Systems, Inc.	2003
A Telecommunication Engineer’s Guide to Applied Artificial Intelligence By Roy Pereira, Cox Communications Inc	2024
Predicting Service Impairments from Set-top Box Errors in Near Real-Time and What to Do About It By Justin Watson, Comcast; Roger Brooks, Andrew Colby, Pankaj Kumar, Anant Malhotra & Mudit Jain, Guavus, Inc.	2018
Augmented Reality and Artificial Intelligence Approaches for Inventory Synchronization By Salvatore (Sam) Torrente, Petar Djukic, Dmitri Fedorov, Mehran Bagheri & Marco Naveda, Ciena	2021
More Results >>