Presentation is loading. Please wait.

Presentation is loading. Please wait.

Bhavana Prabhakar Electrical Engineering Graduate Student The University of Texas at Arlington Advisor Dr. K. R. Rao, EE Dept, UTA Committee Members Dr.

Similar presentations


Presentation on theme: "Bhavana Prabhakar Electrical Engineering Graduate Student The University of Texas at Arlington Advisor Dr. K. R. Rao, EE Dept, UTA Committee Members Dr."— Presentation transcript:

1 Bhavana Prabhakar Electrical Engineering Graduate Student The University of Texas at Arlington Advisor Dr. K. R. Rao, EE Dept, UTA Committee Members Dr. W. Alan Davis, EE Dept, UTA Dr. Ioannis D. Schizas, EE Dept, UTA

2 Introduction- Need for compression Overview of H.264/AVC Adaptive prediction block filter Quality assessment metrics Experimental results Conclusions Future research References

3

4 * Large storage requirements of multimedia data * Relatively slow storage devices which do not allow playing multimedia data in real-time * The present network’s bandwidth does not allow real-time video data transmission

5 TextImageAudioVideo Object type -ASCII -EBCDIS -Bitmapped graphics -Still photos -Faxes Non coded stream of digitized audio or voice TV analog or digital image with synched streams at 24- 30 frames/s Size and bandwidth 2KB per page -Simple 64KB/image -Detailed(color) 7.5MB/image Voice/Phone 8 KHz/8 bits (mono) 6-44 KB/s AUDIO CD 44.1 KHz/ 16 bit/stereo 176 KB/s 27.7 MB/s for 640 × 480 × 24 pixels per frame (24-bit color) 30 frames/s

6 * Lossy * Lossless

7

8 * Transform for reduction of spatial correlation. * Quantization for controlling the bitrate. * Motion compensated prediction for reduction of temporal correlation. * Entropy coding for reduction in statistical correlation.

9 Recent block-oriented motion-compensation-based codec. Good video quality at substantially lower bit rates. Better rate-distortion performance and compression efficiency than MPEG-2. Simple syntax specifications, very flexible. Network friendly. Wide variety of applications such as video broadcasting, video streaming, video conferencing, D-Cinema, HDTV, VOD.

10

11

12

13 R = Correlation matrix r = correlation vector w = weights

14 The following filters have been introduced in H.264 to provide a more precise reference picture. * Adaptive interpolation filter * Adaptive loop Filter

15 Integer pixels (shaded blocks with upper-case letters) and fractional pixel positions (non-shaded blocks with lower-case letters).

16 * 6- tap Wiener filter with filter coefficients (1,-5,20,20,-5,1)/32. * Half pel positions are calculated using horizontal/vertical 6-tap wiener filter. * Bilinear filter is applied at already calculated half pel and full pel positions to obtain quarter pel position.

17 * Minimum mean squared error (MMSE) estimator to calculate the coefficients and achieve minimum motion compensated prediction (MCP) error. * The filter coefficients are optimized on a frame basis, such that for each frame the energy of the MCP error is minimized. * The optimal coefficients are quantized, coded, and transmitted as the side information of the associated frame.

18 * Block-based adaptive loop filter * Quadtree-based adaptive loop filter

19

20 * Although wiener filter can restore the reconstructed picture to the original picture globally, there are degraded pixels locally. * Since the degraded area reduces the filtering efficiency, if these areas are not filtered, the capabilities of picture restoration is improved. * Block-based ALF signals a flag for each block to indicate whether the block is filtered or not

21

22

23

24

25

26 Two aspects need evaluation – the type and amount of degradation induced in the reconstructed image Measures Objective quality measure- PSNR Bitrate PSNR for a NxM pixel image are defined as where x is the original image and y is the reconstructed image. M and N are the width and height of an image and ‘L’ is the maximum pixel value in the NxM pixel image. dB

27

28

29 SequenceSizeFrame rate (fps)Selection ratio (%) Kimono 1920 × 1080 (1080p) 249 ParkScene243.36 Cactus507.06 RaceHorses 832 × 480 (WVGA) 3012.33 BasketballDrill506.88 BQMall602.91 RaceHorses 416 × 240 (WQVGA) 305.07 BasketballPass500.44 BlowingBubbles508.76 Selection ratio of intra-modes in P frame. Frames for 1 second are coded by the original H.264/AVC standard [16].

30 SequenceSizeH.264/AVC + 3 × 3 APBF BD-PSNR (dB)BD- bitrate (%)∆T Enc (%)∆T Dec (%) Kimono1920 × 1080 (1080p)0.21-4.7565.7884.65 ParkScene0.16-3.5866.8460.12 Cactus0.19-6.6465.53111.64 RaceHorses832 × 480 (WVGA)0.15-3.1263.5197.23 BasketballDrill0.38-9.8565.9115.96 BQMall0.29-6.4463.2694.57 RaceHorses416 × 240 (WQVGA)0.13-2.3762.4384.96 BasketballPass0.22-4.6869.7572.53 BlowingBubbles0.18-3.9663.8194.94 Average on 1080p seq.0.187-4.9966.0585.47 Average on WVGA seq.0.273-6.4764.22102.58 Average on WQVGA seq.0.177-3.6765.3384.14 Average on overall0.212-5.04365.2090.73

31

32

33 SequenceSizeH.264/AVC + 5 × 5 APBF BD-PSNR (dB)BD- bitrate (%)∆T Enc (%)∆T Dec (%) Kimono1920 × 1080 (1080p)0.18-4.5695.48197.47 ParkScene0.15-3.7196.06166.19 Cactus0.17-5.7295.3278.44 RaceHorses832 × 480 (WVGA)0.13-2.6789.29242.35 BasketballDrill0.38-8.7995.58299.5 BQMall0.28-5.9293.48241.4 RaceHorses416 × 240 (WQVGA)0.1-1.7384.79208.78 BasketballPass0.21-3.9194.94188.08 BlowingBubbles0.17-3.8786.32249.7 Average on 1080p seq.0.167-4.6695.61214.03 Average on WVGA seq.0.263-5.7992.78261.08 Average on WQVGA seq.0.16-3.1788.68215.52 Average on overall0.197-4.5492.36230.21

34

35

36 SequenceSizeH.264/AVC + 7 × 7 APBF BD-PSNR (dB)BD- bitrate (%)∆T Enc (%)∆T Dec (%) Kimono1920 × 1080 (1080p)0.16-3.47157.06413.67 ParkScene0.13-2.83153.44406.93 Cactus0.16-5.85155.91556.88 RaceHorses832 × 480 (WVGA)0.09-1.87143.21484.7 BasketballDrill0.33-7.96154.04599 BQMall0.25-5.66147482.8 RaceHorses416 × 240 (WQVGA)0.09-1.42142.42417.56 BasketballPass0.17-3.83154.51376.16 BlowingBubbles0.16-3.48138.66499.4 Average on 1080p seq.0.15-4.05155.47459.16 Average on WVGA seq.0.223-5.163148.083522.167 Average on WQVGA seq.0.14-2.91145.197431.04 Average on overall0.171-4.041149.58470.789

37

38

39

40

41

42 The best compression performance is obtained when the 3×3 APBF is utilized, and the average BD-bitrate gain is about 5.04% for overall sequences. The complexities of the encoder and the decoder are unavoidably increased because of the filter coefficient computation and filtering process. The increasing rate of the encoder complexity is about 1.6 times and decoder complexity is about 1.9 times on an average when 3×3 APBF is used.

43

44 1.Coding efficiency and computational complexity can be improved if other filter shapes are exploited - a cross-shaped filter [16] adaptively adjusted depending on the target applications. 2.If the adaptive filter is applied to both decoded and predicted signals, additional coding gain is achieved at the expense of increase in the computational complexity. 3.The computational complexity can be reduced to a large extent if QALF is implemented along with 3x3 APBF. 4.Instead of LMS scheme, RLS (Recursive least squares), LRLS (Lattice recursive least squares) or NLRLS (Normalized lattice recursive least squares) schemes can be used to obtain better APBFs. 5.APBF on SBs can also be implemented in HEVC standard keeping in view of the increase in encoder and decoder complexity.

45

46

47 [1] T. Wiegand et al., “Overview of the H.264/AVC Video Coding Standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 560-576, July 2003. [2] ITU-T VCEG KTA Reference software http://iphome.hhi.de/suehring/tml/download/KTA/ http://iphome.hhi.de/suehring/tml/download/KTA/ [3] Y. Vatis and J. Ostermann, “Adaptive Interpolation Filter for H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 19, no. 2, pp. 179-192, Feb. 2009. [4] T. Chujoh, N. Wada, and G. Yasuda, “Quadtree-Based Adaptive Loop Filter,” ITU-T SG16/Q.6 Doc. COM16-C181-E, Geneva, Switzerland, Jan. 2009. [5] Y. Ye, P. Chen, and M. Karczewicz, “High Precision Interpolation and Prediction,” ITU-T SG16/Q.6 Doc. VCEGAI33, Berlin, Germany, July 2008. [6] J. Jung and G. Laroche, “Competition-Based Scheme for Motion Vector Selection and Coding,” ITU-T SG16/Q.6 Doc. VCEGAC06, Klagenfurt, Austria, July 2006. [7] P. Chen, Y. Ye, and M. Karczewicz, “Video Coding Using Extended Block Sizes,” ITU-T SG16/Q.6 Doc. VCEG-AJ23, San Diego, USA, Oct. 2008. [8] Y. Liu, “Unified Loop Filter for Video Compression,” IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 10, pp. 1378-1382, Oct. 2010.

48 [9] G.J. Sullivan and T. Wiegand, “Rate-Distortion Optimization for Video Compression,” IEEE Signal Process. Mag., vol. 15, no. 6, pp. 74-90, Nov. 1998. [10] H. Lee et al., “Enhanced Block-Based Adaptive Loop Filter with Multiple Symmetric Structures for Video Coding,” ETRI J., vol. 32, no. 4, pp. 626-629, Aug. 2010. [11] ISO/IEC JTC1/SC29/WG11 and ITU-T SG16/Q.6, “Joint Call for Proposals on Video Compression Technology,” WG11 Doc.N11113 and ITU-T SG16/Q.6 Doc. VCEG-AM91, Kyoto, Japan, Jan. 2010. [12] G. Bjøntegaard, “Calculation of Average PSNR Differences between RD- Curves,” ITU-T SG16/Q.6 Doc. VCEG-M33, Austin, USA, Apr. 2001. [13] D. Marpe, T. Wiegand and G. J. Sullivan, “The H.264/MPEG-4 AVC standard and its applications”, IEEE Communications Magazine, vol. 44, pp. 134-143, Aug. 2006. [14] T. Wiegand and G. J. Sullivan, “The picturephone is here: Really”, IEEE Spectrum, vol.48, pp. 50-54, Sep. 2011. [15] I. E. Richardson, “The H.264 Advanced Video Compression Standard”, 2nd Edition, Wiley 2010.

49 [16] Yeo-Jin Yoon et al., “Adaptive Prediction Block Filter for Video Coding”, ETRI J., vol.34, no. 1, pp 106-109, Feb. 2012. [17] Ke-Ying Liao et al., “Rate-Distortion Cost Estimation for H.264/AVC”, IEEE Transactions on circuits and systems for video technology, vol. 20, no. 1, pp. 38-49, Jan. 2010. [18] http://www.h265.net/2010/07/adaptive-interpolation-filter-for-video- coding.htmlhttp://www.h265.net/2010/07/adaptive-interpolation-filter-for-video- coding.html [19] http://www.h265.net/2009/08/adaptive-post-loop-filters-in-jmkta-part- 2.htmlhttp://www.h265.net/2009/08/adaptive-post-loop-filters-in-jmkta-part- 2.html [20] E. A. Fox, “Advances in interactive digital multimedia systems”, IEEE Computer, vol. 24, pp. 9-21, Oct. 1991. [21] B. Furht, “Survey of multimedia compression techniques and standards. Part 1: JPEG standard”, Real time imaging, vol. 1, pp.49-67, 1995. [22] B. Furht, “Multimedia systems : an overview”, IEEE Multimedia, vol. 1, pp. 47-59, 1994. [23] C. E. Manning “Why do we need compression?”, http://www.newmediarepublic.com/dvideo/compression/adv03.html, 1996. http://www.newmediarepublic.com/dvideo/compression/adv03.html [24] S. Kwon, A. Tamhankar and K.R. Rao, ”Overview of H.264 / MPEG-4 Part 10”, J. Visual Communication and Image Representation, vol. 17, pp.186-216, April 2006.

50 [25] Open source article, “Intra frame coding” : http://www.cs.cf.ac.uk/Dave/Multimedia/node248.html http://www.cs.cf.ac.uk/Dave/Multimedia/node248.html [26] Open source article, “H.264/MPEG-4 AVC,” Wikipedia Foundation, http://en.wikipedia.org/wiki/H.264/MPEG-4_AVC http://en.wikipedia.org/wiki/H.264/MPEG-4_AVC [27] A. K. Kulkarni,” Implementation of fast inter-prediction mode decision in H.264/AVC video encoder”, M.S. Thesis, E.E Dept, UTA, 2012. [28] S. S. Vaidyanath, “ Low complexity H.264 encoder using machine learning for streaming applications” M.S Thesis, E.E Dept, UTA, 2011. [29] “http://www.vcodex.com/_les/," working of H.264 codec. [30] I. E.G. Richardson, “H.264 and MPEG-4 video compression: video coding for next-generation multimedia”, Wiley, 2003. [31] S. Subbarayappa, “ Implementation and analysis of directional discrete cosine transform in H.264 for baseline profile ” M.S. Thesis, E.E Dept, UTA, 2012. [32] Y. Vatis and J. Ostermann, ITU-T SG16/Q [15] (VCEG) VCEG-AE16, Marrakech, Morocco, Jan. 2007. [33] JVT of ISO/IEC & ITU-T, Draft ITU-T Recommendation H.264 and Draft ISO/IEC 14496-10 AVC, Doc JVT-Go50. Pattaya, Thailand, 2003.

51 [34] T. Wedi, “Adaptive interpolation filter for motion and aliasing compensated prediction”, in Proc VCIP, San Jose, CA, USA, pp. 415–422, Jan. 2002. [35] JM11 reference software JM11KTA2.3 http://www.h265.net/2009/04/kta-software-jm11kta23.htmlJM11KTA2.3 http://www.h265.net/2009/04/kta-software-jm11kta23.html [36] Open source article, “Bit rate”, http://en.wikipedia.org/wiki/Bit_ratehttp://en.wikipedia.org/wiki/Bit_rate [37] Open source article, “Wiener filter” http://en.wikipedia.org/wiki/Wiener_filter http://en.wikipedia.org/wiki/Wiener_filter [38] S. V. Vaseghi, “Advanced digital signal processing and noise reduction”, Wiley, 2 nd edition, 2000.


Download ppt "Bhavana Prabhakar Electrical Engineering Graduate Student The University of Texas at Arlington Advisor Dr. K. R. Rao, EE Dept, UTA Committee Members Dr."

Similar presentations


Ads by Google