Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control

Jun Hao Hu, Wen-Hsiao Peng, Chia Hua Chung

研究成果: Conference contribution同行評審

28 引文 斯高帕斯(Scopus)

摘要

Reinforcement learning has proven effective for solving decision making problems. However, its application to modern video codecs has yet to be seen. This paper presents an early attempt to introduce reinforcement learning to HEVC/H.265 intra-frame rate control. The task is to determine a quantization parameter value for every coding tree unit in a frame, with the objective being to minimize the frame-level distortion subject to a rate constraint. We draw an analogy between the rate control problem and the reinforcement learning problem, by considering the texture complexity of coding tree units and bit balance as the environment state, the quantization parameter value as an action that an agent needs to take, and the negative distortion of the coding tree unit as an immediate reward. We train a neural network based on Q-learning to be our agent, which observes the state to evaluate the reward for each possible action. When trained on only limited sequences, the proposed model can already perform comparably with the rate control algorithm in HM-16.15.

原文English
主出版物標題2018 IEEE International Symposium on Circuits and Systems, ISCAS 2018 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁數5
ISBN(電子)9781538648810
DOIs
出版狀態Published - 27 5月 2018
事件2018 IEEE International Symposium on Circuits and Systems, ISCAS 2018 - Florence, Italy
持續時間: 27 5月 201830 5月 2018

出版系列

名字Proceedings - IEEE International Symposium on Circuits and Systems
2018-May
ISSN(列印)0271-4310

Conference

Conference2018 IEEE International Symposium on Circuits and Systems, ISCAS 2018
國家/地區Italy
城市Florence
期間27/05/1830/05/18

指紋

深入研究「Reinforcement Learning for HEVC/H.265 Intra-Frame Rate Control」主題。共同形成了獨特的指紋。

引用此