Improvement of Schema-Informed XML Binary Encoding Using Schema Optimization Method BumSuk Jang and Young-guk Ha' Konkuk University, Department of Computer.

Slides:



Advertisements
Similar presentations
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 3: Network Protocols and Communications Introduction to Networks.
Advertisements

CHAPTER Modulation.
OSI MODEL Maninder Kaur
Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
LYU0101 Wireless Digital Library on PDA Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu First semester FYP Presentation 2001~2002.
Applications of Numbered Undirected Graphs Gary s. Bloom and Solomon w. Golomb.
Sang-Chun Han Hwangjun Song Jun Heo International Conference on Intelligent Hiding and Multimedia Signal Processing (IIH-MSP), Feb, /05 Feb 2009.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
A Comparison of Layering and Stream Replication Video Multicast Schemes Taehyun Kim and Mostafa H. Ammar.
METS What is METS ? What is METS ? A schema that provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
An Error-Resilient GOP Structure for Robust Video Transmission Tao Fang, Lap-Pui Chau Electrical and Electronic Engineering, Nanyan Techonological University.
Efficient XML Interchange. XML Why is XML good? A widely accepted standard for data representation Fairly simple format Flexible It’s not used by everyone,
Smart Learning Services Based on Smart Cloud Computing
1 The Architectural Design of FRUIT: A Family of Retargetable User Interface Tools Yi Liu, H. Conrad Cunningham and Hui Xiong Computer & Information Science.
IT420: Database Management and Organization XML 21 April 2006 Adina Crăiniceanu
1 Seminar Presentation Multimedia Audio / Video Communication Standards Instructor: Dr. Imran Ahmad By: Ju Wang November 7, 2003.
11 Writing a Conference Research Paper Miguel A. Labrador Department of Computer Science & Engineering
Foundations of Computer Science Computing …it is all about Data Representation, Storage, Processing, and Communication of Data 10/4/20151CS 112 – Foundations.
A REFACTORING TOOL FOR DESIGN PATTERNS WITH MODEL TRANSFORMATIONS Zekai Demirezen Yasemin Topaloğlu Ege University Department of Computer Engineering
Web Services Description Language CS409 Application Services Even Semester 2007.
An Improved Algorithm to Accelerate Regular Expression Evaluation Author: Michela Becchi, Patrick Crowley Publisher: 3rd ACM/IEEE Symposium on Architecture.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Survey on Improving Dynamic Web Performance Guide:- Dr. G. ShanmungaSundaram (M.Tech, Ph.D), Assistant Professor, Dept of IT, SMVEC. Aswini. S M.Tech CSE.
© MATERNA GmbH 2012www.materna.com1 MATERNA WS4D-JMEDS Addons Jannis Müthing MATERNA GmbH.
Group No 5 1.Muhammad Talha Islam 2.Karim Akhter 3.Muhammad Arif 4.Muhammad Umer Khalid.
Paper # – 2009 A Comparison of Heterogeneous Video Multicast schemes: Layered encoding or Stream Replication Authors: Taehyun Kim and Mostafa H.
Efficient Downloading and Updating Application on Smart Cards Yongsu Park, Junyoung Heo, Yookun Cho School of Computer Science and Engineering Seoul National.
StriD 2 FA: Scalable Regular Expression Matching for Deep Packet Inspection Author: Xiaofei Wang, Junchen Jiang, Yi Tang, Bin Liu, and Xiaojun Wang Publisher:
Personal Project. Topic Modeling and Presenting Data from a Publication Objectives –Using XML related techniques to model and present data from a publication.
Department of Industrial Engineering Sharif University of Technology Session# 10.
Semantic Phyloinformatic Web Services Using the EvoInfo Stack Speaker: John Harney LSDIS Lab, Dept. of Computer Science, University of Georgia Mentor(s):
Design of On-Demand Analysis for Cloud Service Configuration using Related-Annotation Hyogun Yoon', Hanku Lee' 2 `, ' Center for Social Media Cloud Computing,
Design of a Cooperative Video Streaming System on Community based Resource Sharing Networks 2010 International Conference on P2P, Parallel, Grid, Cloud.
Memory-Efficient Regular Expression Search Using State Merging Author: Michela Becchi, Srihari Cadambi Publisher: INFOCOM th IEEE International.
A Method for Providing Personalized Home Media Service Using Cloud Computing Technology Cui Yunl, Myoungjin Kim l and Hanku Lee l 'z * ' Department of.
The Design of XML-Based Model and Experiment Description Languages for Network Simulation Andrew Hallagan Bucknell University Dept. of Computer Science.
1 MPEG-7 Overview - part 2. 2 Review Descriptor (D) - 對內容的特徵作定義。 - 通常用以描述 low-level features 。 Description Scheme (DS) - 通常用以描述 high-level features 。
Advanced Science and Technology Letters Vol.106 (Information Technology and Computer Science 2015), pp.17-21
Effect Analysis of Electric Vehicle Charging to Smart Grid with Anti-Islanding Method Bum-Sik Shin, Kyung-Jung Lee, Sunny Ro, Young-Hun Ki and Hyun-Sik.
HTML5 based Notification System for Updating E-Training Contents Yu-Doo Kim 1 and Il-Young Moon 1 1 Department of Computer Science Engineering, KoreaTech,
MPEG-4: Multimedia Coding Standard Supporting Mobile Multimedia System Lian Mo, Alan Jiang, Junhua Ding April, 2001.
A Framework with Behavior-Based Identification and PnP Supporting Architecture for Task Cooperation of Networked Mobile Robots Joo-Hyung Kiml, Yong-Guk.
Border Code: an Efficient Code System for Augmented Reality Seong-hun Park and Young-guk Ha' Konkuk University, Department of Computer Science and Engineering,
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Jayanth Nayak, Ertem Tuncel, Member, IEEE, and Deniz Gündüz, Member, IEEE.
Jayanth Nayak, Ertem Tuncel, Member, IEEE, and Deniz Gündüz, Member, IEEE.
TFA: A Tunable Finite Automaton for Regular Expression Matching Author: Yang Xu, Junchen Jiang, Rihua Wei, Yang Song and H. Jonathan Chao Publisher: ACM/IEEE.
BLFS: Supporting Fast Editing/Writing for Large- Sized Multimedia Files Seung Wan Jung 1, Seok Young Ko 2, Young Jin Nam 3, Dae-Wha Seo 1, 1 Kyungpook.
Video Content AdaptationQoS by using the tree algorithm Jaepil Han Jinho Choo Sunwoo Yang.
1 Using Network Coding for Dependent Data Broadcasting in a Mobile Environment Chung-Hua Chu, De-Nian Yang and Ming-Syan Chen IEEE GLOBECOM 2007 Reporter.
VLSI Design of View Synthesis for 3DVC/FTV Jongwoo Bae' and Jinsoo Cho 2, 1 Department of Information and Communication Engineering, Myongji University.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Distributed Video Transcoding System based on MapReduce for Video Content Delivery Myoungjin Kim', Hanku Lee l 'z* Hyeokju Lee' and Seungho Han' ' Department.
Efficient XML Aaron Braeckel Briefing to FAA Program Management 28 Dec 2008 National Center for Atmospheric Research Boulder, CO Copyright ©
For more course tutorials visit NTC 406 Entire Course NTC 406 Week 1 Individual Assignment Network Requirements Analysis Paper NTC 406.
Powerpoint Templates Data Communication Muhammad Waseem Iqbal Lecture # 07 Spring-2016.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Michael.
Chapter 3: Network Protocols and Communications
Network Infrastructure Services Supporting WAP Clients
CS644 Advanced Topics in Networking
Yasuhiro Kawano Graduate School of Software and Information Science,
HTML5 based Notification System for Updating
YangSun Lee*, YunSik Son**
Aziz Nasridinov and Young-Ho Park*
Multimedia Content Description Interface
Authors: A. Rasmussen, A. Kragelund, M. Berger, H. Wessing, S. Ruepp
Design principles for packet parsers
Authors: Ding-Yuan Lee, Ching-Che Wang, An-Yeu Wu Publisher: 2019 VLSI
Presentation transcript:

Improvement of Schema-Informed XML Binary Encoding Using Schema Optimization Method BumSuk Jang and Young-guk Ha' Konkuk University, Department of Computer Science and Engineering, Seoul, Republic of Korea Abstract. As XML documents are widely used for data communication on The Internet, there are many efforts to increase efficiency of XML data transmission. One of these efforts is to reduce the size of XML document such as BiM and EXI. These efforts are based on transformation of XML document into binary data. Basic principle of binary encoding is to assign XML event such as Element, Attribute, and Content to small code based on the schema. Size of the code of binary format is determined by the number of next XML events occurs. That is, size of schema affects size of binary encoded data. Generally, whole schema is used to encode the XML document, even though the XML document conforms to only a part of the schema. To reduce the size of binary encoded data more efficiently, this paper proposes an encoding method using a novel optimization technique for the XML schema used to encode XML documents. 1 Introduction As XML documents are widely used for data communication on The Internet, there are many efforts to increase efficiency of XML data transmission. One of these efforts is to reduce the size of XML document such as BiM and EXI. Binary Mpeg7 (BiM) [1] is binary encoding format of Mpeg7 specification and The Efficient XML Interchange (EXI) format [2] is W3C's XML binary encoding standard. These efforts are based on transformation of XML document into binary data based on automata which derived from XML schema. State of automata means XML event and each state connected to XML event which will occurs in next step. Basic principle of XML binary encoding is that state transition was assigned to code. The code of state transition only depends on the number of transition from same state. Thus, length of the code is determined as log n where n is number of transition from same state. Size of schema affects size of binary encoded data because as size of schema increase, number of state of automata was increased also. Most of XML document over informed from schema despite it conforms to only a part of the schema. Therefore, to reduce the size of binary encoded data more, this paper proposes an encoding method using a novel optimization technique which extracts a part of the schema was conformed from the 1 Corresponding author adfa, p. 1, © Springer-Verlag Berlin Heidelberg Computers, Networks, Systems, and Industrial Appications

XML document for the XML schema using EXI. The rest of this paper organized as follows. How schema optimization scheme reduces size of EXI encoded data was explained in section 2. In section 3, architecture of implementation was described. In section 4, show results of experiments. In section 6, conclusion and future work are described. 2 Schema Optimization Method Basic principle of binary encoding is based on automata which derived from schema. Each state of automata derived from XML event. The transition to next state is as- signed as a code. Size of the code is determined as log n where n is number of transi- tion from same state. Thus, size of binary encoded data is increased by the number of automata transition. Most of XML document over informed from schema for con- forms to only a part of the schema. Therefore, schema optimization technique which extracts a part of the schema was conformed from the XML document is efficient to reduce size of data. In additional, this method has less binary encoding time because has less the number of automata transition than original method. Proposed encoding method using schema optimization technique is divided two parts. That is XML structure analyzing and optimized schema generation. For the communication between these functions, Fingerprint which represents XML docu- ment's structure is proposed. Fingerprint consists of bit flag which indicate usages of element was defined from schema. If XML document use the element, bit flag set to 1 and else set to 0. That is, Fingerprint represents part of original schema was matched with the XML. There are two methods for schema optimization. One is Structure optimization method and the other is Definition optimization method. Structure optimization meth- od is removes unused element in parent node's definition. On the other hand, Defini- tion optimization method is removes unused element's definition. In section 4, we measure encoding performance improvement of schema optimization method using both of them. 3 Implementation of Schema Optimization Method Module structure of original EXI encoding process consists of 2 modules. That is EXI grammar generator and EXI encoder. Inputs of EXI encoder are XML document and grammar which generated from schema conformed from the XML document by EXI grammar generator. There are two additional modules than original process for sche- ma optimization technique insertion. Additional parts are XML structure analyzer (XSA) and Schema generator. XSA analyze XML structure and generate Fingerprint. Schema generator generate optimized schema using received Fingerprint. Generated optimized schema is used for EXI grammar generator. Session 4B 443

Schema Schema Generator Schema Fingerprint EXI Grammar XML Structure EXI G ammar Generator Analyzer Generator 1 Grammar XML EXI Encoder -0 EXI Stream XML EXI Encoder EXI Stream (a) (b) Fig. 1. Module structure of encoding process. (a) is module structure of original EXI encoding process. (b) is module structure of schema optimization method added EXI encoding process. 4 Experiments TV-anytime Forum Metadata specifications [3] made by BBC was used for the exper-iment. Exifficient which developed by Siemens as open source project was used as EXI encoding engine. Fig. 2 Shows results of experiments. Graph (a) showed schema optimization technique based encoding method reduce size of encoded data than orig-inal encoding method. Graph (b) showed schema optimization technique based encod- ing method requires less encoding time than original encoding method. 1,400 1,200 1, MS 6,000 notes 5,500 5,000 4,500 4,000 3,500 3, Schema Optimization Method —Original Method Original Method (a) (b) Fig. 2. Comparison of schema optimization technique based encoding method and original method. (a) is comparison of encoding size of schema optimization technique based encoding method with original encoding method. (b) is comparison of encoding speed of schema optimi- zation technique based encoding method with original encoding method. 444 Computers, Networks, Systems, and Industrial Appications

5 Conclusion and Future Work This paper analyzed XML binary encoding methods and proposed method which was improved by schema optimization technique in schema informed XML binary encod- ing. Schema optimization technique was divided two modules which XML analyzer and Schema generator. Moreover, Fingerprint format that was defined for communi- cation between these modules was proposed. Fingerprint is format which represents usage of the element of XML Schema was conformed from the XML document. As shown in the results of experiments, schema optimization technique based encoding method more efficient than original encoding method. We conclude that Schema op- timization technique is useful to reduce size and time cost of XML binary encoding and it could improve the utility of bandwidth of the Internet. Another concern in this work, Schema generator was required for binary data decoding on client side. As mobile devices are used widely, client side issues are becomes important part of bina-ry encoded XML data transmission. In that respect this technique required less com- putational cost because optimized schema which has less transition of automata. More concern in this work, motivation of schema optimization technique is reduce the tran- sition of automata derived from schema. Because high level of depth in schema has only few elements, efficiency of schema optimization technique is depends on schema optimization level. In the future, we will define optimization level and discuss effi- ciency of each optimization level. 6 Acknowledgement This research was supported by Basic Science Research Program through the Na- tional Research Foundation of Korea(NRF) funded by the Ministry of Education, Science and Technology(grant number: ). References 1.J. Heuer, C. Thienot, and M. Wollborn, MPEG-7 Binary Format.MPEG-7, 2002, ch. 2.3, Introduction to MPEG-7: Multimedia Content Description Interface. 2.J. Schneider and T. Kamiya, "Efficient XML Interchange (EXI) Format 1.0," Candidate Recommendation 08 December "The TV-Anytime Metadata specification", anytime.org/workinggroups/wgmd.html Session 4B 445