Download presentation

Presentation is loading. Please wait.

Published byBrice Hyder Modified about 1 year ago

1
IAEA International Atomic Energy Agency Unicode Compliance and Future of WinFIBRE 36 th Consultative Meeting of INIS Liaison Officers 4-5 October 2012, Vienna, Austria Lubomir ILIEV

2
IAEA Unicode Compliance What is Unicode? Why do we need Unicode? How we did implement UNICODE? What is the impact?

3
IAEA Unicode Compliance Ascii Table

4
IAEA Character encoding standards ASCII (7 bit) Extended ASCII (8 bit) Codepage (Windows-1252) Unicode

5
IAEA Unicode Compliance 00C0 Controls and Basic Latin007F ABCDFF !"#$%&'()*+,-./ :;<=>? 50 PQRSTUVWXYZ[\]^_ 60 `abcdefghijklmno 70 pqrstuvwxyz{|}~

6
IAEA Unicode Compliance 2100Letterlike Symbols214F ABCDFF 2100 ℀℁ℂ℃℄ ℅ ℆ℇ℈℉ℊℋℌℍℎℏ 2110 ℐℑℒ ℓ ℔ℕ №℗ ℘ℙℚℛℜℝ℞℟ 2120 ℠℡ ™ ℣ℤ℥ Ω ℧ℨ℩KÅℬℭ ℮ ℯ 2130 ℰℱ Ⅎ ℳℴℵℶℷℸ 2140 ⅍ⅎ

7
IAEA Unicode Compliance Why do we need UNICODE All others use UNICODE Statistics for Elsevier Batch 28 Total records different Unicode characters total Unicode characters Average ~ 9 chars/record

8
IAEA Unicode Compliance Why do we need UNICODE Expressions (Formulas) Letters Bullets and other characters How we process data without using UNICODE

9
IAEA Unicode Compliance How we implemented Unicode Restrictions/Limitations (Fibre IRPS) Different representation: TTF/XML Extending the standard for “Logical Characters” (INIS-1 Guide to bibliographic description)

10
IAEA Unicode Compliance Current “Logical Characters” (72 total)

11
IAEA Unicode Compliance The “new” set contains 2155 symbols 0080 C1 Controls and Latin-1 Supplement 0100 Latin Extended-A 0370 Greek and Coptic 2000 General Punctuation 20A0 Currency Symbols 2100 Letterlike Symbols 2150 Number Forms 2190 Arrows 2200 Mathematical Operators 2300 Miscellaneous Technical 25A0 Geometric Shapes 2600 Miscellaneous Symbols 27F0 Supplemental Arrows-A 2900 Supplemental Arrows-B 2980 Miscellaneous Mathematical Symbols-B 2A00 Supplemental Mathematical Operators

12
IAEA Unicode Compliance Compatibility Up compatibility “Old” records will be valid Down compatibility – 2 levels 0 - Unicode 1 - TTF-style encoding + text

13
IAEA Unicode Compliance Original text In the WKB regime, qQ ≫ℏ, we obtain a simple expression for the fundamental quasinormal resonances: ω=qQ/r + −i2πT BH (n+1/2 ),... TTF format In the WKB regime, qQ#Much Greater-Than##Planck Constant Over Two Pi#, we obtain a simple expression for the fundamental quasinormal resonances: #Greek Small Letter Omega#=qQ/r_+#Minus Sign#i2#Greek Small Letter Pi#T_B_H(n+1/2 ),...

14
IAEA Unicode Compliance Level 0 – Unicode In the WKB regime, qQ ≫ℏ, we obtain a simple expression for the fundamental quasinormal resonances: ω=qQ/r_+−i2πT_B_H(n+1/2 ),… Level 1 - TTF-style encoding + text In the WKB regime, qQ Much-Greater-Than Planck-Constant-Over-Two-Pi, we obtain a simple expression for the fundamental quasinormal resonances: #omega#=qQ/r_+-i2#pi#T_B_H(n+1/2 ),…

15
IAEA Unicode Compliance Is using Unicode enough? Glyphs not in Unicode Expressions and Formulas Future possible directions: Unsing TeX and/or MathML

16
IAEA Future of WinFIBRE What is FIBRE? 1. Definition Specialised text editor with checking capabilities 2. Functionalities Existing Create and edit records Verify records Field-specific editing Planned Edit existing records (“change record”) Managing record relations

17
IAEA What is FIBRE? Actual limitations for input preparation 1. Checking rules Correct but not always helpful error messages Bibliographic and indexing part cannot be checked separately 2. Editing Difficult to create and read mathematical and other expressions Cannot manage relations between records

18
IAEA Future of WinFIBRE Decisional factors Performancevs.User friendliness Typingvs.Dialogs Record editingvs.Records editing

19
IAEA Future of WinFIBRE Recently implemented functionalities Expert system Under development Change record (under internal tests) Re-design WinFIBRE checking rules Unicode compliance for input preparation Improve records editing functionalities Manage record dependencies

20
IAEA Unicode Compliance for input preparation Added functionality in WinFIBRE Edit/Paste Edit/Insert Special Character… Preview (Visualisation) 2299⊙Circled Dot Operator = direct product = vector pointing out of page → 0298 ʘ latin letter bilabial click → 2609 ☉ sun → 2A00 ⨀ n-ary circled dot operator

21
IAEA Unicode Compliance for input preparation Edit/copy-paste #integral#_#alpha#"#beta#x"#gamma#"-"1 dx

22
IAEA Unicode Compliance for input preparation Edit/copy-paste E(f)=defp(D#Divides#f,#alpha#,#beta#)=#Integral#p(D#Divides#w,#bet a#,f)p(w#Divides##alpha#,f)dw.

23
IAEA Unicode Compliance for input preparation Preview P(G_p_t_,_ _m_-_v_e_c_t_o_r_ _,q) =#Greek Capital Letter Sigma#_i__1_=_1"3#Greek Capital Letter Sigma#_i__2_=_1"3 c_G__p__t_,_i__1_ _i__2#lambda#_i__1"m"_1#lambda#_i__2"m"_2 P(G_{{\rm pt},\vec{m}},q) = \sum _{i_1=1}^3 \sum _{i_2=1}^3 c_{{G_{{\rm pt}}},i_1 i_2} \lambda _{i_1}^{m_1}\lambda _{i_2}^{m_2}

24
IAEA Improve records editing functionalities

25
IAEA Manage record dependencies

26
IAEA Manage record dependencies

27
IAEA Manage record dependencies

28
IAEA Unicode Compliance and Future of WinFIBRE Thank you for your attention!

Similar presentations

© 2016 SlidePlayer.com Inc.

All rights reserved.

Ads by Google