TY - GEN
T1 - OMRA
T2 - 31st IEEE International Conference on Image Processing, ICIP 2024
AU - Gao, Zong Lin
AU - NguyenQuang, Sang
AU - Peng, Wen Hsiao
AU - HoangVan, Xiem
N1 - Publisher Copyright:
© 2024 IEEE
PY - 2024
Y1 - 2024
N2 - Learned hierarchical B-frame coding aims to leverage bidirectional reference frames for better coding efficiency. However, the domain shift between training and test scenarios due to dataset limitations poses a challenge. This issue arises from training the codec with small groups of pictures (GOP) but testing it on large GOPs. Specifically, the motion estimation network, when trained on small GOPs, is unable to handle large motion at test time, incurring a negative impact on compression performance. To mitigate the domain shift, we present an online motion resolution adaptation (OMRA) method. It adapts the spatial resolution of video frames on a per-frame basis to suit the capability of the motion estimation network in a pre-trained B-frame codec. Our OMRA is an online, inference technique. It need not re-train the codec and is readily applicable to existing B-frame codecs that adopt hierarchical bi-directional prediction. Experimental results show that OMRA significantly enhances the compression performance of two state-of-the-art learned B-frame codecs on commonly used datasets.
AB - Learned hierarchical B-frame coding aims to leverage bidirectional reference frames for better coding efficiency. However, the domain shift between training and test scenarios due to dataset limitations poses a challenge. This issue arises from training the codec with small groups of pictures (GOP) but testing it on large GOPs. Specifically, the motion estimation network, when trained on small GOPs, is unable to handle large motion at test time, incurring a negative impact on compression performance. To mitigate the domain shift, we present an online motion resolution adaptation (OMRA) method. It adapts the spatial resolution of video frames on a per-frame basis to suit the capability of the motion estimation network in a pre-trained B-frame codec. Our OMRA is an online, inference technique. It need not re-train the codec and is readily applicable to existing B-frame codecs that adopt hierarchical bi-directional prediction. Experimental results show that OMRA significantly enhances the compression performance of two state-of-the-art learned B-frame codecs on commonly used datasets.
KW - B-frame Coding
KW - Domain Shift
KW - Learned Video Coding
UR - http://www.scopus.com/inward/record.url?scp=85216858554&partnerID=8YFLogxK
U2 - 10.1109/ICIP51287.2024.10647370
DO - 10.1109/ICIP51287.2024.10647370
M3 - Conference contribution
AN - SCOPUS:85216858554
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 1960
EP - 1966
BT - 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
PB - IEEE Computer Society
Y2 - 27 October 2024 through 30 October 2024
ER -