Multimedia Communication

Vignesh V Menon

Title: INCEPT: INTRA CU Depth Prediction for HEVC

IEEE 23rd International Workshop on Multimedia Signal Processing

October 06–08, 2021, Tampere, Finland

Authors: Vignesh V Menon (Alpen-Adria-Universitat Klagenfurt); Hadi Amirpour (Alpen-Adria-Universität Klagenfurt); Christian Timmerer (Alpen-Adria-Universität Klagenfurt, Austria); Mohammad Ghanbari (University of Essex, UK).

Abstract: High Efficiency Video Coding (HEVC) improves the encoding efficiency by utilizing sophisticated tools such as flexible Coding Tree Unit (CTU) partitioning. The Coding Unit (CU) can be split recursively into four equally sized CUs ranging from 64×64 to 8×8 pixels. At each depth level (or CU size), intra prediction via exhaustive mode search was exploited in HEVC to improve the encoding efficiency and result in a very high encoding time complexity. This paper proposes an Intra CU Depth Prediction (INCEPT) algorithm, which limits Rate-Distortion Optimization (RDO) for each CTU in HEVC by utilizing the spatial correlation with the neighboring CTUs, which is computed using a DCT energy-based feature. Thus, INCEPT reduces the number of candidate CU sizes required to be considered for each CTU in HEVC intra coding. Experimental results show that the INCEPT algorithm achieves a better trade-off between the encoding efficiency and encoding time saving (i.e., BDR/∆T) than the benchmark algorithms. While BDR/∆T is 12.35% and 9.03% for the benchmark algorithms, it is 5.49% for the proposed algorithm. As a result, INCEPT achieves a 23.34% reduction in encoding time on average while incurring only a 1.67% increase in bit rate than the original coding in the x265 HEVC open-source encoder.

Keywords: HEVC, Intra coding, CTU, CU, depth decision

ViSNext’21: 1st ACM CoNEXT Workshop on Design, Deployment, and Evaluation of Network-assisted Video Streaming

In recent years, we have witnessed phenomenal growth in live video traffic over the Internet, accelerated by the rise of novel video streaming technologies, advancements in networking paradigms, and our ability to generate, process, and display videos on heterogeneous devices. Regarding the existing constraints and limitations in different components on the video delivery path from the origin server to clients, the network plays an essential role in boosting the perceived Quality of Experience (QoE) by clients. The ViSNext workshop aims to bring together researchers and developers working on all aspects of video streaming, in particular network-assisted concepts backed up by experimental evidence. Read more about the workshop, call for papers at ViSNext2021 and registration here.

Conference info: IEEE LCN

Authors: Jesús Aguilar Armijo (Alpen-Adria-Universität Klagenfurt), Christian Timmerer (Alpen-Adria-Universität Klagenfurt) and Hermann Hellwagner (Alpen-Adria-Universität Klagenfurt)

Abstract: Mobile networks equipped with edge computing nodes enable access to information that can be leveraged to assist client-based adaptive bitrate (ABR) algorithms in making better adaptation decisions to improve both Quality of Experience (QoE) and fairness. For this purpose, we propose a novel on-the-fly edge mechanism, named EADAS (Edge Assisted Adaptation Scheme for HTTP Adaptive Streaming), located at the edge node that assists and improves the ABR decisions on-the-fly. EADAS proposes (i) an edge ABR algorithm to improve QoE and fairness for clients and (ii) a segment prefetching scheme. The results show a QoE increase of 4.6%, 23.5%, and 24.4% and a fairness increase of 11%, 3.4%, and 5.8% when using a buffer-based, a throughput-based, and a hybrid ABR algorithm, respectively, at the client compared with client-based algorithms without EADAS. Moreover, QoE and fairness among clients can be prioritized using parameters of the EADAS algorithm according to service providers’ requirements.

Keywords: Dynamic Adaptive Streaming over HTTP (DASH), Edge Computing, Network-Assisted Video Streaming, Quality of Experience (QoE).

A Special Session on ‘Video Coding for Large Scale HTTP Adaptive Streaming Deployments‘ was organized by Christian Timmerer (Alpen-Adria-Universität Klagenfurt, Austria), Mohammad Ghanbari (University of Essex, UK), and Alex Giladi (Comcast, USA) on July 2 at the 35th Picture Coding Symposium (PCS) 2021. Read more about it here.

Conference info: The 46th IEEE Conference on Local Computer Networks (LCN) October 4-7, 2021

Authors: Farzad Tashtarian (Alpen-Adria-Universität Klagenfurt), Abdelhak Bentaleb (National University of Singapore), Reza Farahani (Alpen-Adria-Universität Klagenfurt), Minh Nguyen (Alpen-Adria-Universität Klagenfurt), Christian Timmerer (Alpen-Adria-Universität Klagenfurt), Hermann Hellwagner (Alpen-Adria-Universität Klagenfurt), and Roger Zimmermann (National University of Singapore)

Abstract: Live User Generated Content (UGC) has become very popular in today’s video streaming applications, in particular with gaming and e-sport. However, streaming UGC presents unique challenges for video delivery. When dealing with the technical complexity of managing hundreds or thousands of concurrent streams that are geographically distributed, UGCsystems are forces to made difficult trade-offs with video quality and latency. Read more

Conference info: The 46th IEEE Conference on Local Computer Networks (LCN) October 4-7, 2021

Authors: Reza Farahani (Alpen-Adria-Universität Klagenfurt), Farzad Tashtarian (Alpen-Adria-Universität Klagenfurt), Hadi Amirpour (Alpen-Adria-Universität Klagenfurt), Christian Timmerer (Alpen-Adria-Universität Klagenfurt), Mohammad Ghanbari (School of Computer Science and Electronic Engineering, University of Essex, Colchester, UK) and Hermann Hellwagner (Alpen-Adria-Universität Klagenfurt)

Abstract: With the increasing demand for video streaming applications, HTTP Adaptive Streaming (HAS) technology has become the dominant video delivery technique over the Internet. Current HAS solutions only consider either client- or server-side optimization, which causes many problems in achieving high-quality video, leading to sub-optimal users’ experience and network resource utilization. Recent studies have revealed that network-assisted HAS techniques, by providing a comprehensive view of the network, can lead to more significant gains in HAS system performance. Read more

Successful review of the first research phase: Christian Doppler ‘pilot’ laboratory ATHENA to transition to a regular CD laboratory two years after launch. Read more about it at ATHENA website and the press release at AAU website.

 

Hadi

Hadi Amirpour has been appointed co-chair of Task Force 7 (TF7)
Immersive Media Experience (IMEx) at the 15th Qualinet meeting

Co-chairs:

 

TF7: Immersive Media Experiences (IMEx)

Immersive media applications are entering our daily lives starting from VR/AR/360° video applications to multi-sensory/multimedia experiences potentially addressing all human senses rather than focusing on hearing and seeing. The overall goal of providing Immersive Media Experiences (IMEx) to end-users is giving them the sensation of being part of the particular media which shall result in a worthwhile, informative user and quality of experience.

The actual objectives of this task force are as follows:

  • disseminating the white paper
  • working towards submission of the extended version
  • liaison with other communities (UX, sensory sciences) and standards developing organizations (JPEG, MPEG, EBU)
  • Identification of different QoE aspects of immersive experiences
  • QoE models and QoE assessment approaches for immersive experiences, addressing various audiovisual modalities; e.g. HDR, omnidirectional video, light fields, point clouds, and spatial audio.

HTTP Adaptive Streaming – Quo Vadis?

Christian Timmerer, Tuesday, June 29, 2021

35th Picture Coding Symposium (PCS) 2021

Abstract: Video traffic on the Internet is constantly growing; networked multimedia applications consume a predominant share of the available Internet bandwidth. A major technical breakthrough and enabler in multimedia systems research and of industrial networked multimedia services certainly was the HTTP Adaptive Streaming (HAS) technique. This resulted in the standardization of MPEG Dynamic Adaptive Streaming over HTTP (MPEG-DASH) which, together with HTTP Live Streaming (HLS), is widely used for multimedia delivery in today’s networks. Existing challenges in multimedia systems research deal with the trade-off between (i) the ever-increasing content complexity, (ii) various requirements with respect to time (most importantly, latency), and (iii) quality of experience (QoE). Optimizing towards one aspect usually negatively impacts at least one of the other two aspects if not both.

This situation sets the stage for our research work in the ATHENA Christian Doppler (CD) Laboratory (Adaptive Streaming over HTTP and Emerging Networked Multimedia Services; https://athena.itec.aau.at/), jointly funded by public sources and industry.

In this talk, we will present selected novel approaches and research results of the first year of the ATHENA CD Lab’s operation. We will highlight HAS-related research on (i) multimedia content provisioning (machine learning for video encoding); (ii) multimedia content delivery (support of edge processing and virtualized network functions for video networking); (iii) multimedia content consumption and end-to-end aspects (player-triggered segment retransmissions to improve video playout quality); and (iv) novel QoE investigations (adaptive point cloud streaming). We will also put the work into the context of the international multimedia systems research.

Vignesh V Menon

At IEEE International Conference on Image Processing (ICIP) on September 19-22, 2021, Alaska, USA.

Authors: Vignesh V Menon (Alpen-Adria-Universität Klagenfurt),  Hadi Amirpour (Alpen-Adria-Universität Klagenfurt), Mohammad Ghanbari (School of Computer Science and Electronic Engineering, University of Essex, Colchester, UK), and Christian Timmerer (Alpen-Adria-Universität Klagenfurt).

Abstract: Video delivery over the Internet has been becoming a commodity in recent years, owing to the widespread use of DASH. The DASH specification defines a hierarchical data model for Media Presentation Descriptions (MPDs) in terms of segments. This paper focuses on segmenting video into multiple shots for encoding in  VoD HAS applications.
This paper proposes a novel DCT feature-based shot detection and successive elimination algorithm for shot detection algorithm and benchmark the algorithm against the default shot detection algorithm of the x265 implementation of the HEVC standard. Our experimental results demonstrate that the proposed feature-based pre-processor has a recall rate of 25% and an F-measure of 20% greater than the benchmark algorithm for shot detection.

Keywords: HTTP Adaptive Streaming, Video-on-Demand, Shot detection, multi-shot encoding.

Link: https://2021.ieeeicip.org