Presentation is loading. Please wait.

Presentation is loading. Please wait.

Assembling, Repurposing And Manipulating Document Content Using The New Office File Format Brian Jones OFF 304 Program Manager Microsoft Corporation.

Similar presentations


Presentation on theme: "Assembling, Repurposing And Manipulating Document Content Using The New Office File Format Brian Jones OFF 304 Program Manager Microsoft Corporation."— Presentation transcript:

1 Assembling, Repurposing And Manipulating Document Content Using The New Office File Format Brian Jones OFF 304 Program Manager Microsoft Corporation

2 Agenda Overview of the new formats Role of XML in documents Evolution of MS Office file formats Microsoft Office Open XML format architecture Components of the new formats Reference schemas and custom defined schemas Developing against the formats Visual Studio Tools for Office (VSTO) support Sample solution scenarios Demos throughout

3 Microsoft Office Open XML Formats New Default Formats: New XML file formats for Word, Excel and PowerPoint New file type extensions Interoperable: Open, transparent format improves interoperability Published file format specification with royalty-free license Transparent, XML format enables new integration scenarios for documents and LOB systems Added Benefits: compact and robust ZIP container allows for standard compression on all files without user effort (Dramatic file size improvements) Significantly more robust files to help minimize data loss Backward Compatible: Office 2000, Office XP, Office 2003 will all support the new formats Patches for compatibility available by launch Open, edit and save new formats Legacy support: Current Office 97-2003 binary file formats supported Support for XML formats from Office 2003, Office XP continued Developers: Endless potential for developers Build solutions to read, write, and modify Office files (without the need to run Office APIs)

4 Office Open XML Formats Brian Jones Program Manager Microsoft Word Demo 1 & 2

5 The Role Of XML With Documents ScenarioExample Document Assembly Server-based or user-assisted construction of documents from archived content or database content Create sales reports from financial and forecast data stored in a CRM system Content Reuse Much easier to move content between documents, including different document types Apply content stored in Word documents to Web pages quickly and efficiently Content Tagging Add domain-specific metadata to document content to enable custom solutions Tag presentations using a specific taxonomy to improve knowledge management efficiency Document Interrogation Query document repositories based on custom data, content types or document metadata Search for all documents containing a specific company name or sales contact Document Sanitization Remove unwanted content like comments or embedded code from your document when appropriate Remove all tracked changes and comments from a Word document before it is published

6 Open XML Formats Architecture User view: single Office file Questionnaire. Questionnaire.docx Document Parts Most parts are XML Each XML part is a discreet, compressed component Can add, extract and modify individual parts without using Office programs Corruption or absence of any part would not prohibit the file from being opened Developer view: modular file

7 Create A Document From Scratch Brian Jones Program Manager Microsoft Word Demo 3

8 Components Of The New Formats We make heavy use of the Open Packaging Conventions These are the same conventions used by the XPS guys, and you can leverage the same APIs for accessing Office files Package – ZIP Container Part – The files inside the ZIP Content Types – Each part has a content type that is enforced on open Relationships – Any part that references another part must do so via a relationship

9 Modifying An Excel Spreadsheet Brian Jones Program Manager Microsoft Word Demos 4 & 5

10 The Role Of XML Reference and custom-defined schemas XML Reference Schemas Display-oriented (e.g. Bold, Italics, Tables, Paragraphs, Styles) Open Document Format Enable Archival & File Formats Interoperability Custom-defined Schemas Data-oriented (e.g., Price, Invoice) Represents the business information stored in the document Enable System Integration

11 The Role Of XML Reference and custom-defined schemas XML Reference Schemas Display-oriented (e.g. Bold, Italics, Tables, Paragraphs, Styles) Open Document Format Enable Archival & File Formats Interoperability John Doe Health Agency

12 3/24/2004 XML Conference (Europe) Role of XML on the Desktop Health Agency 100 25% … The Role Of XML Reference and custom-defined schemas Custom-defined Schemas Data-oriented (e.g., Price, Invoice) Represents the business information stored in the document Enable System Integration

13 XML Data Store Brian Jones Program Manager Microsoft Word Demo 7

14 Developing Against The Formats More Reliable Solutions 3 rd party tools were main cause of document corruptions Fully Documented Formats Freely available for download with a royalty free license Office file format schemas - Used to validate content for a given part Samples, samples, samples In the form of code snippets for easier use and integration into your VSTO solutions WinFx Packaging APIs Office Open XML Formats use the Open Packaging Conventions Access/maintain parts and relationships within a file Takes care of all ZIP level functionality XPath Navigation within content XML DOM Manipulating content Office Open XML Resource Kit Tools for constructing and deconstructing the new file formats Design time Validation tool Parses a file and reports on schema, relationship errors and warnings Runtime serialization tool Flattens package into a single file for ease of development in simple construction scenarios

15 Programming Against The Formats Brian Jones Program Manager Microsoft Word Demo 8

16 VSTO Support For XML Formats VSTO application manifest becomes a part Enables easier deployment and redeployment of VSTO solutions Cached data feature of VSTO will be fully supported in new file formats VSTOs ServerDocument object will be able to manipulate the new file formats without starting Office applications

17 Sample Solution Scenarios Data interoperability Content manipulation Content sharing and reuse Document assembly Document security Managing sensitive information Document styling Document profiling

18 Next Steps Schemas: Sneak peak at the Office 12 schemas We will provide an initial draft of the schemas by the end of this week. See my blog for more details Beta 1 Register for Beta 1 which comes out in the fourth quarter of this year

19 Recommended Sessions & Labs Upcoming Sessions OFF 316: Word 12: Integrating Business Data into Documents using XML-based Data/View Separation and Programmability Tristan Davis - Room 406AB (Thursday @ 11:30) PRS 333: Advances in Document Workflow, Securing, Viewing, and Printing Your Content Gregg Brown - Room 502AB (Wednesday @ 3:15) OFF 322: Building a Solution Using a Spreadsheet in Server-Based Scenarios Danny Khen - Room 404AB (Friday @ 8:30) Prior Sessions OFF 201: Office 12: Introduction to the Programmable Customization Model for the Office 12" User Experience (Part 1) Jensen Harris - Room 515AB (Today @ 1) OFF 302: Office 12'': Developing with the Programmable Customization Model for the Office 12" User Experience (Part 2) Andy Himberger - Room 402AB (Today @ 2:45) DAT 304: Unleashing the Power of XPS-Based File Formats for your Application Jesse McGatha - Room 408AB (Today @ 2:45)

20 Resources Office Preview Site: http://www.microsoft.com/office/preview/ Brian Joness Blog: http://blogs.msdn.com/Brian_Jones/ Office 2003 Reference Schema Information: http://www.microsoft.com/office/xml/ Office Developer Center: http://msdn.microsoft.com/office/

21 Questions

22 © 2005 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.


Download ppt "Assembling, Repurposing And Manipulating Document Content Using The New Office File Format Brian Jones OFF 304 Program Manager Microsoft Corporation."

Similar presentations


Ads by Google