Retour

InternIntegerized Ai-Based Video Compression Models H/F

InterDigital
03 janv., 2026

Description

Summary In this internship at the London AI Video Lab, the objective is to study fixed-point arithmetic solutions for ensuring bit-exact video compression in AI-based video codecs. Current AI-based video compression models outperform conventional codecs, like HEVC, VVC and AV1. However, AI-based video compression models are trained using floating-point arithmetic. Unfortunately, floating point arithmetic is insufficient to ensure bit-exact execution. Bit-exact execution is needed to ensure encoded bitstreams are universally decodable across any device. Fixed-point arithmetic is a potential solution to this problem. The goal of the internship is to determine a fixed-point arithmetic setup capable of ensuring bit-exactness while maintaining model performance. This work will be seen as one step forward toward the deployment of end-to-end trained AI-based video compression models. The goal will be to study various fixed-point arithmetic setups for layers and components of AI-based video compression models. Quantization bit-width, scaling and bias will be studied on a per component basis. The setup will be integrated into the London AI Video Lab's end-to-end trained video compression model. The performance of the proposed solution will be evaluated and compared to existing models. The internship will take place in the London AI Video Lab. The intern will be mentored by scientists and will be part of a research project developing end-to-end trained AI-based video compression models. Duration: 5-6 months, starting January-April 2026 Responsibilities State-of-the-art and analysis of existing solutions Implementation of deterministic fixed-point Deep Learning layers with varying bit-depths Evaluation and reporting of results Related work Nagel, Markus, et al. "A white paper on neural network quantization." arXiv preprint arXiv:21 (2021). Jia, Zhaoyang, et al. "Towards practical real-time neural video compression." Proceedings of the Computer Vision and Pattern Recognition Conference. 2025. Li, Zhikai, Gu, Qingyi I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 2023. Keywords: scientific computing, computer vision, video compression, machine learning (deep learning), real-time video processing Expected Outcomes: Apart from the expected outcome that corresponds to the bit-exact model and its evaluation, this internship will be expected to generate patents and publications.

Date de début

15 déc., 2025

Profil

Qualifications List minimum required qualifications, preferred skills, abilities, experience, and education MSc in Computer Science, Machine Learning, Mathematics, Physics or a related field Fluency in C++ and Python, video processing, computer vision, PyTorch

Répartition du temps de travail

Full time

Durée (Mois)

Formation

RJ/Qualif/Ingenieur_B5

Secteur

Ind_hightech_telecom

A propos de InterDigital

About InterDigital InterDigital is a global research and development company focused primarily on wireless, video, artificial intelligence (AI), and related technologies. We design and develop foundational technologies that enable connected, immersive experiences in a broad range of communications and entertainment products and services. We license our innovations worldwide to companies providing such products and services, including makers of wireless communications devices, consumer electronics, IoT devices, cars and other motor vehicles, and providers of cloud-based services such as video streaming. As a leader in wireless technology, our engineers have designed and developed a wide range of innovations that are used in wireless products and networks, from the earliest digital cellular systems to 5G and today's most advanced Wi-Fi technologies. We are also a leader in video processing and video encoding/decoding technology, with a significant AI research effort that intersects with both wireless and video technologies. Founded in 1972, InterDigital is listed on Nasdaq. InterDigital is a registered trademark of InterDigital, Inc. À propos d'InterDigital InterDigital est une entreprise mondiale de recherche et de développement qui se concentre principalement sur les technologies sans fil, vidéo, d'intelligence artificielle ("AI") et les autres technologies connexes. Nous concevons et développons des technologies fondamentales qui permettent des expériences connectées et immersives dans une large gamme de produits et de services de communication et de divertissement. Nous concédons des licences sur nos innovations dans le monde entier à des entreprises qui fournissent de tels produits et services, notamment des fabricants d'appareils de communication sans fil, d'appareils électroniques grand public, d'appareils IoT, de voitures et d'autres véhicules à moteur, ainsi que des fournisseurs de services basés sur le cloud, tels que la diffusion vidéo. En tant que leader de la technologie sans fil, nos ingénieurs ont conçu et développé un large éventail d'innovations utilisées dans les produits et les réseaux sans fil, depuis les premiers systèmes cellulaires numériques jusqu'à la technologie 5G et les technologies Wi-Fi les plus avancées d'aujourd'hui. Nous sommes également un leader dans le domaine du traitement vidéo et de la technologie de codage/décodage vidéo, avec un important effort de recherche en matière d'IA qui recoupe à la fois les technologies sans fil et les technologies vidéo. Fondée en 1972, InterDigital est une société cotée au NASDAQ. InterDigital est une marque déposée d'InterDigital, Inc. Pour plus d'informations, n'hésitez pas à consulter le site www.interdigital.com.

Profil de l'entreprise