Scalable light field coding using weighted binary images

Slides:



Advertisements
Similar presentations
VSMC MIMO: A Spectral Efficient Scheme for Cooperative Relay in Cognitive Radio Networks 1.
Advertisements

INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
Outline Introduction Introduction Fast Inter Prediction Mode Decision for H.264 – –Pre-encoding An Efficient Inter Mode Decision Approach for H.264 Video.
Binary Image Compression Using Efficient Partitioning into Rectangular Regions IEEE Transactions on Communications Sherif A.Mohamed and Moustafa M. Fahmy.
Video Coding with Linear Compensation (VCLC) Arif Mahmood, Zartash Afzal Uzmi, Sohaib A Khan Department of Computer.
Real-time Combined 2D+3D Active Appearance Models Jing Xiao, Simon Baker,Iain Matthew, and Takeo Kanade CVPR 2004 Presented by Pat Chan 23/11/2004.
Video Compression Bee Fong. Lossy Compression  Inter Frame Compression Compression among frames Compression among frames  Intra Frame Compression Compression.
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
Source-Channel Prediction in Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Laboratory ECE Department University of California,
Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,
4/24/2002SCL UCSB1 Optimal End-to-end Distortion Estimation for Drift Management in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression.
09/24/02ICIP20021 Drift Management and Adaptive Bit Rate Allocation in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression Lab ECE Department.
Image processing Lecture 4.
Automatic Camera Calibration
Coding techniques for digital cinema Andreja Samčović University of Belgrade Faculty of Transport and Traffic Engineering.
GODIAN MABINDAH RUTHERFORD UNUSI RICHARD MWANGI.  Differential coding operates by making numbers small. This is a major goal in compression technology:
Philipp Merkle, Aljoscha Smolic Karsten Müller, Thomas Wiegand CSVT 2007.
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
Image Representation. Objectives  Bitmaps: resolution, colour depth and simple bitmap file calculations.  Vector graphics: drawing list – objects and.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
Directional DCT Presented by, -Shreyanka Subbarayappa, Sadaf Ahamed, Tejas Sathe, Priyadarshini Anjanappa K. R. RAO 1.
Adaptive Rate Control for HEVC Visual Communications and Image Processing (VCIP), 2012 IEEE Junjun Si, Siwei Ma, Xinfeng Zhang, Wen Gao 1.
Chapter 9 DTW and VQ Algorithm  9.1 Basic idea of DTW  9.2 DTW algorithm  9.3 Basic idea of VQ  9.4 LBG algorithm  9.5 Improvement of VQ.
A New Operating Tool for Coding in Lossless Image Compression Radu Rădescu University POLITEHNICA of Bucharest, Faculty of Electronics, Telecommunications.
Wonjun Kim and Changick Kim, Member, IEEE
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
Image Processing Architecture, © Oleh TretiakPage 1Lecture 5 ECEC 453 Image Processing Architecture Lecture 5, 1/22/2004 Rate-Distortion Theory,
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
DAC50, Designer Track, 156-VB543 Parallel Design Methodology for Video Codec LSI with High-level Synthesis and FPGA-based Platform Kazuya YOKOHARI, Koyo.
Perceptually-Driven Video Coding with the Daala Video Codec
3.3 Fundamentals of data representation
Computer vision: models, learning and inference
Cost Estimation & Cost Behaviour
JPEG Compression What is JPEG? Motivation
Quality Evaluation and Comparison of SVC Encoders
Adaptive Block Coding Order for Intra Prediction in HEVC
Vocabulary byte - The technical term for 8 bits of data.
ECE 3301 General Electrical Engineering
Image Processing Digital image Fundamentals. Introduction to the course Grading – Project: 30% – Midterm Exam: 30% – Final Exam : 40% – Total: 100% –
Image Compression using Vector Quantization
Vocabulary byte - The technical term for 8 bits of data.
Chapter 3 向量量化編碼法.
Huffman Coding, Arithmetic Coding, and JBIG2
Rate 7/8 (1344,1176) LDPC code Date: Authors:
Enhanced-alignment Measure for Binary Foreground Map Evaluation
Histogram Histogram is a graph that shows frequency of anything. Histograms usually have bars that represent frequency of occuring of data. Histogram has.
Coding Approaches for End-to-End 3D TV Systems
Transformations of Functions
A Color Image Hiding Scheme Based on SMVQ and Modulo Operator
Linear Predictive Coding Methods
/ Fast block partitioning method in HEVC Intra coding for UHD video /
Scalable Speech Coding for IP Networks: Beyond iLBC
Predicting Fault-Prone Modules Based on Metrics Transitions
Sangeun Han and Yi Liang
MPEG4 Natural Video Coding
MPEG-Immersive 3DoF+ Standard Work:
Hiding Data in a Color Palette Image with Hybrid Strategies
Image Coding and Compression
○ Hisashi Shimosaka (Doshisha University)
第 四 章 VQ 加速運算與編碼表壓縮 4-.
Longest increasing subsequence (LIS) Matrix chain multiplication
第 九 章 影像邊緣偵測 9-.
A Color Image Hiding Scheme Based on SMVQ and Modulo Operator
Authors: Chin-Chen Chang, Yi-Hui Chen, and Chia-Chen Lin
Author :Ji-Hwei Horng (洪集輝) Professor National Quemoy University
Linear Equations in Linear Algebra
Linear Equations in Linear Algebra
Li Li, Zhu Li, Vladyslav Zakharchenko, Jianle Chen, Houqiang Li
Presentation transcript:

Scalable light field coding using weighted binary images Koji Komatsu, Keita Takahashi, Toshiaki Fujii Graduate School of Engineering, Nagoya University, Japan Thank you for your introduction. I’m Koji Komatsu from Nagoya University in Japan. Today, I’d like to talk about “Scalable light field coding using weighted binary images”. 2018/10/07-10 IEEE International Conference on Image Processing

Multi-view image Depth estimation Multi-view image shoot Camera array Depth estimation shoot Light Field Camera (Lytro ILLUM) Multi-view image So first of all, I mention multi-view images for my research. Multi-view image is a lot of images taken from multiple different view-points, and used for several purposes. For example, depth-estimation, free viewpoint video, and so on. To obtain this multi-view image, camera array is often used previously. As the Characteristic of multi-view image obtained from this camera , interval between view and view is large. And, shooting cost is very high. Free viewpoint video ・ Interval between view and view is large. ・ Shooting cost is very high. 2018/10/07-10 IEEE International Conference on Image Processing

Multi-view image Depth estimation Multi-view image shoot Camera array Depth estimation shoot Light Field Camera (Lytro ILLUM) Multi-view image On the other hand, recently Light Field Camera has been developed. Thanks to this camera, it is very easy to obtain multi-view images, because we can obtain them with just only one shooting. The images obtained from this camera, interval between view and view is very small. And shooting cost is very low. Free viewpoint video ・ Interval between view and view is very small. ・ Shooting cost is very low. 2018/10/07-10 IEEE International Conference on Image Processing

Purpose of our research shoot Camera array Depth estimation shoot Light Field Camera (Lytro ILLUM) Multi-view image You know that multi-view images have a lot of amount of data, so they need to be compressed. So, in our research, we proposed a new scheme which efficiently compresses such dense multi-view images. Free viewpoint video We propose a new scheme which efficiently compresses such dense multi-view images. 2018/10/07-10 IEEE International Conference on Image Processing

inverse quantization, IDCT, predictive coding, etc… Modern video codecs Modern video codec Decode figure inverse quantization, IDCT, predictive coding, etc… To compress multi-view image, most people use modern video codecs, such as HEVC. These codecs are composed of inverse quantization, IDCT, predictive coding, and so on. And you can see the decode figure at top right. As shown in this figure, you know that decoding process is very complex. 2018/10/07-10 IEEE International Conference on Image Processing

Modern video codec and our proposed method Decode figure inverse quantization, IDCT, predictive coding, etc… Our proposed method Decode figure common binary images On the other hand, our proposed method is composed of some binary images and corresponding weights. You can see the decode figure at bottom right. Comparing modern video codecs and our proposed method, the principle of decoding is completely different and our proposed method is much simpler than modern video codecs. weight per viewpoint … 2018/10/07-10 IEEE International Conference on Image Processing

Achievements of our proposed method Modern video codec Decode figure ・ Decoding process is extremely simple. ・ Rate-distortion performances are comparable to that of modern video codecs. inverse quantization, IDCT, predictive coding, etc… Our proposed method Decode figure common binary images As main achievements of our proposed method, we can mainly raise two points as follows. First, decoding process is extremely simple. Second, rate-distortion performances are comparable to that of modern video codecs. So, I will explain our proposed method. weight per viewpoint … 2018/10/07-10 IEEE International Conference on Image Processing

Proposed method (gray scale) × T S h w 𝐵 1 (𝑥,𝑦) 𝑟 1 (𝑠,𝑡) corresponding × 𝐵 2 (𝑥,𝑦) 𝑟 2 (𝑠,𝑡) ⋯ ⋯ × 𝐵 𝑛 (𝑥,𝑦) 𝑟 𝑛 (𝑠,𝑡) ⋯ ⋯ 𝐿(𝑠,𝑡,𝑥,𝑦) × 𝐵 𝑁 (𝑥,𝑦) For simplicity, we treat gray-scale multi-view image. In order to reconstruct multi-view images, we only use common N binary images and weights. The color graphs indicate weights corresponding to the view points. 𝑟 𝑁 (𝑠,𝑡) Binary images weights 2018/10/07-10 IEEE International Conference on Image Processing

Proposed method (gray scale) × 𝐵 1 (𝑥,𝑦) 𝐿(1,1,𝑥,𝑦) 𝑟 1 (𝑠,𝑡) × 𝐵 2 (𝑥,𝑦) 𝑟 2 (𝑠,𝑡) ⋯ ⋯ × 𝐵 𝑛 (𝑥,𝑦) 𝑟 𝑛 (𝑠,𝑡) ⋯ ⋯ 𝐿(𝑠,𝑡,𝑥,𝑦) × 𝐵 𝑁 (𝑥,𝑦) Firstly, when we reconstruct the top left view, (1, 1) view, we multiply the weights corresponding to the viewpoint to N binary images. 𝑟 𝑁 (𝑠,𝑡) (example) -22× +28× ⋯ +7× +18× = × ⋯ 𝑟 1 (1,1) + × + × 𝐵 1 + 𝑟 2 (1,1) × 𝐵 2 𝑟 𝑛 (1,1) 𝐵 𝑛 𝑟 𝑁 (1,1) 𝐵 𝑁 = 𝐿(1,1,𝑥,𝑦) 2018/10/07-10 IEEE International Conference on Image Processing

Proposed method (gray scale) × 𝐵 1 (𝑥,𝑦) 𝑟 1 (𝑠,𝑡) × 𝐵 2 (𝑥,𝑦) 𝑟 2 (𝑠,𝑡) ⋯ ⋯ 𝐿(17,17,𝑥,𝑦) × 𝐵 𝑛 (𝑥,𝑦) 𝑟 𝑛 (𝑠,𝑡) ⋯ ⋯ 𝐿(𝑠,𝑡,𝑥,𝑦) Change only weight values × 𝐵 𝑁 (𝑥,𝑦) Next, when we reconstruct bottom right view, (17, 17) view, , we multiply the different weights corresponding to the viewpoint to the same N binary images. Here, please note that we use the same binary images and only change weight values. 𝑟 𝑁 (𝑠,𝑡) (example) 35× -37× ⋯ +45× +7× = 𝑟 1 (17,17) × + ⋯ 𝑟 2 17,17 × 𝐵 2 + 𝑟 𝑛 (17,17) × 𝐵 𝑛 + × 𝐵 1 𝑟 𝑁 (17,17) 𝐵 𝑁 = 𝐿(17,17,𝑥,𝑦) 2018/10/07-10 IEEE International Conference on Image Processing

Proposed method (gray scale) × 𝐵 1 (𝑥,𝑦) 𝑟 1 (𝑠,𝑡) × 𝐵 2 (𝑥,𝑦) 𝑟 2 (𝑠,𝑡) ⋯ ⋯ × 𝐵 𝑛 (𝑥,𝑦) 𝑟 𝑛 (𝑠,𝑡) ⋯ ⋯ 𝐿(𝑠,𝑡,𝑥,𝑦) × 𝐵 𝑁 (𝑥,𝑦) So, we can reconstruct whole multi view images using only common binary images and corresponding weights, just by changing the weight values. This is our core proposal. 𝑟 𝑁 (𝑠,𝑡) We can reconstruct multi-view images using only common binary images and corresponding weights. 2018/10/07-10 IEEE International Conference on Image Processing

Optimization problem Multi-view images Binary images weights 𝑟 𝑛 (𝑠,𝑡) 𝑟 𝑛 (𝑠,𝑡) Next, I explain how binary images and corresponding weights are generated from input multi-view images. 2018/10/07-10 IEEE International Conference on Image Processing

(computational cost is low) Optimization problem Least squares method (computational cost is low) Multi-view images Binary images weights 𝑟 𝑛 (𝑠,𝑡) To obtain these optimum solution, we need to solve the following least square problem. This equation include two unknown pairs, binary images and weights. So, first of all, I appropriately initialize binary images and fix them, and then calculate the weights using least square method. This computational cost is not so high. fix 2018/10/07-10 IEEE International Conference on Image Processing

Optimization problem Multi-view images Binary images weights 𝑟 𝑛 (𝑠,𝑡) 𝑟 𝑛 (𝑠,𝑡) And then, fix the weights and obtain the binary images by solving 01 combinatorial optimization problem. This computational cost is very high, so this is a bottleneck of our proposed method. So I will mention this problem in detail later. fix 01 combinatorial optimization problem (computational cost is very high) 2018/10/07-10 IEEE International Conference on Image Processing

Optimization problem Alternate optimization Least squares method (computational cost is low) Multi-view images Binary images weights 𝑟 𝑛 (𝑠,𝑡) Alternate optimization And we alternately optimize these problems until convergence, we can obtain the optimum solutions, binary images and weights. 01 combinatorial optimization problem 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 𝑁 w h ≅ Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) Next, I explain the difficulty of solving 01 combinatorial optimization problem. We change the binary images’ pixel value 0 or 1, and search for the optimum combination when the following cost function is minimum. Column vector of multi-view images’ pixel values cost function 𝐸= 𝑳−𝑹𝑩 2 Search for the combination when the cost function is minimum Column vector of binary images’ pixel values matrix of weights 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 1 𝑁 w h ≅ Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) Like this, we apply 0 or 1, from 00000 to 11111. And we will find the optimum solution. Column vector of multi-view images’ pixel values cost function 𝐸= 𝑳−𝑹𝑩 2 Search for the combination when the cost function is minimum Column vector of binary images’ pixel values matrix of weights 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 1 𝑁 w h ≅ Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) Like this, we apply 0 or 1, from 00000 to 11111. And we will find the optimum solution. Column vector of multi-view images’ pixel values cost function 𝐸= 𝑳−𝑹𝑩 2 Search for the combination when the cost function is minimum Column vector of binary images’ pixel values matrix of weights 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 1 𝑁 w h ≅ Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) Like this, we apply 0 or 1, from 00000 to 11111. And we will find the optimum solution. Column vector of multi-view images’ pixel values cost function 𝐸= 𝑳−𝑹𝑩 2 Search for the combination when the cost function is minimum Column vector of binary images’ pixel values matrix of weights 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 1 𝑁 w h ≅ optimum Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) Like this, we apply 0 or 1, from 00000 to 11111. And we will find the optimum solution. Column vector of multi-view images’ pixel values cost function 𝐸= 𝑳−𝑹𝑩 2 Search for the combination when the cost function is minimum Column vector of binary images’ pixel values matrix of weights 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 1 𝑁 1 1 ≅ h optimum w Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) And we repeat this process for all pixel. Column vector of multi-view images’ pixel values cost function 𝐸= 𝑳−𝑹𝑩 2 Search for the combination when the cost function is minimum Column vector of binary images’ pixel values matrix of weights 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 1 𝑁 w h ≅ optimum Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) And we repeat this process for all pixel. Column vector of multi-view images’ pixel values cost function 𝐸= 𝑳−𝑹𝑩 2 Search for the combination when the cost function is minimum Column vector of binary images’ pixel values matrix of weights 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 𝑁 ≅ Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) And we can obtain the optimum N binary images. Column vector of multi-view images’ pixel values cost function 𝐸= 𝑳−𝑹𝑩 2 Search for the combination when the cost function is minimum Column vector of binary images’ pixel values matrix of weights 2018/10/07-10 IEEE International Conference on Image Processing

01 combinatorial optimization problem ST 𝑁 ≅ Multi-view images weights Binary images 𝑟 𝑛 (𝑠,𝑡) From this process, the amount of calculation is like this. So you know that as the number of binary images N increases, the computational cost increases explosively. This is the bottleneck of our proposed method. So, in order to solve this problem, we applied scalable coding to our method. calculation = 2 𝑁 ×𝑤×ℎ As the number of binary images (N) increases, computational cost explosively increases. Implement scalable coding 2018/10/07-10 IEEE International Conference on Image Processing

Reconstruct multi-view images with 𝑁 1 (1~3) binary images. Scalable coding — = input difference Reconstruct multi-view images with 𝑁 1 (1~3) binary images. Next, I’d like to explain the scalable coding. Firstly, we express the multi-view images using very few binary images, for example 1 to 3. Of course, the reconstructed multi-view images are not expressed well. And, we obtain the difference between the original multi-view images and the reconstructed images. 2018/10/07-10 IEEE International Conference on Image Processing

Reconstruct multi-view images with 𝑁 2 (1~3) binary images. Scalable coding — After that, in the same way we express the difference of multi-view images using very few binary images. And we obtain the difference between the previous difference and current reconstructed images. difference Reconstruct multi-view images with 𝑁 2 (1~3) binary images. 2018/10/07-10 IEEE International Conference on Image Processing

Scalable coding ⋮ ⋮ ⋮ M layer Calculation cost is very low for each layer because the number of binary images 𝑁 𝑚 is small ⋮ And repeat this process for M times. The residuals obviously decrease. Here, because we just optimize very few binary images for each layer, we can drastically reduce the computational cost. ⋮ ⋮ M layer 2018/10/07-10 IEEE International Conference on Image Processing

Add from top layer to any layer. 1~M’ (M’<M) Scalable coding Add from top layer to any layer. 1~M’ (M’<M) ⋮ And by adding from the top layer to any layer M’, we can control the bit rate. As the number of M’ increases, multi-view images can be expressed clearly. ⋮ ⋮ 2018/10/07-10 IEEE International Conference on Image Processing

Comparison of computational cost Non-scalable 2×𝑀 2 𝑁 Scalable :number of binary images for m-th layer suppose that And this is the comparison of computational cost between the non-scalable coding and scalable coding. As mentioned in the previous slide, the calculation amount is like this, and N_m denotes the number of binary images for m-th layer. When we suppose that N = N_m times M, N_m = 3, these amount of calculations in each condition can be reduced like this. When N=15, about 820 times When N=30, about 13.4million times 2018/10/07-10 IEEE International Conference on Image Processing

Summary so far Multi-view images can be expressed using common binary images and corresponding weights. It has high computational cost in optimization. Here, I summarize our proposed method so far. We proposed an efficient scheme which compress dense light fields using common binary images and corresponding weights. However, It needs high computational cost as we use many binary images. So we applied scalable coding to our method. By dividing into multiple hierarchies, we can drastically reduce the computational cost. By dividing into multiple hierarchies, the calculation cost is drastically reduced. 2018/10/07-10 IEEE International Conference on Image Processing

Experiments Performance comparisons between non-scalable method and scalable method. - PSNR - encoding time Performance comparisons between modern video codecs and our proposed method. Next, I will show you some experimental results. We have mainly conducted following experiments. 2018/10/07-10 IEEE International Conference on Image Processing

Experiments Performance comparisons between non-scalable method and scalable method. - PSNR - encoding time Performance comparisons between modern video codecs and our proposed method. Firstly, I will show you the performance comparisons between non-scalable coding and scalable coding. We evaluated PSNR and encoding time. 2018/10/07-10 IEEE International Conference on Image Processing

Experiments Multi-view image 17×17=289 views, 160×120 pixels, gray-scale Non-scalable Scalable This is the experimental condition. We compressed this multi-view images, which have 17 times 17 views and 160 times 120 pixels. Both methods use from 1 to 15 binary images and in scalable coding, the number of binary images for each layer N_m is set from 1 to 3. binary image : = 1 ~ 3 𝑀 layer ( ) binary image : N = 1 ~ 15 Experiment Ⅰ: comparison of PSNR. Experiment Ⅱ: comparison of encoding time. 2018/10/07-10 IEEE International Conference on Image Processing

Experimental result(PSNR) [dB] Total number of binary images Non-scalable = 3 (scalable) Compare encoding time = 1 (scalable) This is the experimental results of PSNR. The horizontal axis indicates the total number of binary images and the vertical axis indicates PSNR. The black graph shows the result of non-scalable coding and red graphs show the results of scalable coding. We confirm that the best performance can be achieved when N_m is 3. The performance of scalable coding is inferior to that of non-scalable coding, because it is not whole optimal solutions. But the performances are not declined so much. Next, I’d like to show you the encoding time when we use 12 binary images. = 2 (scalable) 2018/10/07-10 IEEE International Conference on Image Processing

Experimental results(encoding time) (scalable) (non-scalable) 8000 10000 ≈ about 1/100 This is the results of encoding time. Non-scalable coding takes over 8000 seconds, but each scalable coding takes only about 80 seconds. 2018/10/07-10 IEEE International Conference on Image Processing

Experimental results(encoding time) (scalable) (non-scalable) 8000 10000 ≈ High speed encoding can be achieved without declining in rate-distortion performances so much. about 1/100 From these results, we have achieved high speed encoding without declining in rate-distortion performances so much. 2018/10/07-10 IEEE International Conference on Image Processing

Experiments Performance comparisons between non-scalable method and scalable method. - PSNR - encoding time Performance comparisons between modern video codecs and our proposed method. Finally, I will show you the performance comparisons between modern video codecs and our proposed method. 2018/10/07-10 IEEE International Conference on Image Processing

Experiments Multi-view images: 17×17=289 views, gray-scale Proposed method Modern video codecs ⋮ Video file output Video frame ・binary image: 6 ~ 15 ・weight   : short int (16bit) ・initialize  : threshold left top image (average pixel value) ・implement with gzip ・non-scalable This is the experimental conditions. In proposed method, we use from 6 to 15 binary images and with non-scalable coding. And I also evaluated with gzip. Modern video codecs are implemented with ffmpeg, and the video codecs are HEVC, H.264, and with all intra frame mode. ・implemented with ffmpeg ・video codec: HEVC, H.264(w/ all_intra) ・bit rate : 6 ~ 30kbps ・frame rate : 30fps 2018/10/07-10 IEEE International Conference on Image Processing

Experimental datasets Amethyst (96×128) Bunny (128×128) This is the datasets for experiments. Knight (128×128) Truck (160×120) 2018/10/07-10 IEEE International Conference on Image Processing

Experimental results This is the experimental results. The horizontal axis indicates the bit per pixel and the vertical axis indicates PSNR. Red graph shows the result of our method and each black graph shows the results of modern video codecs. It is better when graph positions on the top left corner because it means that it achieves high PSNR with small data. 2018/10/07-10 IEEE International Conference on Image Processing

Experimental results Our proposed method with gzip can achieve comparable rate-distortion performances to that of modern video codecs. As you can see, the rate-distortion performance of our proposed method with gzip is comparable to that of modern video codecs. 2018/10/07-10 IEEE International Conference on Image Processing

Conclusion Efficient coding scheme for dense light fields. (using only common binary images and weights) Decoding process is extremely simple. Rate-distortion performances are comparable to that of modern video codecs. Scalable coding Reduce computational costs drastically without declining in rate-distortion performances so much. Finally, I’d like to summarize my presentation. We proposed an efficient coding scheme for dense light fields, which use only common binary images and corresponding weights. This is completely different from modern video codecs and the decoding process of our method is extremely simple. And, rate-distortion performances of our method are comparable to that of modern video codecs. Moreover I applied scalable coding to our proposed method because it needs high computational cost when using many binary images. Thanks to this approach, we can drastically reduce the computational cost without declining in rate-distortion performances so much. Thank you for listening to my presentation. 2018/10/07-10 IEEE International Conference on Image Processing