Training Session B: Preparing and Importing Data

Slides:



Advertisements
Similar presentations
Lesson 12 Getting Started with Excel Essentials
Advertisements

© Paradigm Publishing, Inc Excel 2013 Level 2 Unit 2Managing and Integrating Data and the Excel Environment Chapter 7Automating Repetitive Tasks.
 Use the Left and Right arrow keys or the Page Up and Page Down keys to move between the pages. You can also click on the pages to move forward.  To.
COMPREHENSIVE Excel Tutorial 8 Developing an Excel Application.
MICROSOFT WORD GETTING STARTED WITH WORD. CONTENTS 1.STARTING THE PROGRAMSTARTING THE PROGRAM 2.BASIC TEXT EDITINGBASIC TEXT EDITING 3.SAVING A DOCUMENTSAVING.
CHAPTER 9 Introducing Microsoft Office Learning Objectives Start Office programs and explore common elements Use the Ribbon Work with files Use.
CHAPTER 9 Introducing Microsoft Office Learning Objectives Start Office programs and explore common elements Use the Ribbon Work with files Use.
XP 1 Microsoft Access 2003 Introduction To Microsoft Access 2003.
Excel Lesson 1 Excel Basics
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. WORD 2007 M I C R O S O F T ® THE PROFESSIONAL APPROACH S E R I E S Lesson 15 Advanced Tables.
1 Lesson 12 Getting Started with Excel Essentials Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
Chapter 2: Excel Basics and Formatting Spreadsheet-Based Decision Support Systems Prof. Name Position (123) University Name.
Microsoft Office 2013 Try It! Chapter 4 Storing Data in Access.
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 18 Getting Started with Excel Essentials 1 Morrison / Wells / Ruffolo.
Pasewark & Pasewark 1 Excel Lesson 1 Excel Basics Microsoft Office 2007: Introductory.
Microsoft Excel Illustrated Introductory Workbooks and Preparing them for the Web Managing.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
Chapter 8 Using Document Collaboration, Integration, and Charting Tools Microsoft Word 2013.
Exploring Office Grauer and Barber 1 Committed to Shaping the Next Generation of IT Experts. Chapter 1 – Introduction to Excel: What is a Spreadsheet?
Chapter 11 Enhancing an Online Form and Using Macros Microsoft Word 2013.
Understanding Microsoft Excel
Lecturer: Dalia Mirghani
Lesson 11 Exploring Microsoft Office 2007
Excel Tutorial 8 Developing an Excel Application
Practical Office 2007 Chapter 2
Understanding Microsoft Excel
5.2 Microsoft Excel.
Chapter 2: The Visual Studio .NET Development Environment
International Computer Driving Licence Syllabus version 5.0
Building a User Interface with Forms
Microsoft Excel Basics
Managing Worksheets And Workbooks
Microsoft Excel A Spreadsheet Program.
Guide To UNIX Using Linux Third Edition
Tutorial 6: Managing Multiple Worksheets and Workbooks
Using Excel with Google Maps
Tutorial 1 – Creating a Document
EXCEL Introduction.
Microsoft Excel 101.
Training Session A: Introduction to the StatPlanet Interface
Training Session D: Loading Shapefiles into StatPlanet
Unit 4: Using Spreadsheets to Make Economic Choices Lessons 20–26
Introducing Microsoft Office 2010
Understanding Microsoft Excel
5.2 Microsoft Excel.
Exploring Microsoft® Access® 2016 Series Editor Mary Anne Poatsy
MODULE 7 Microsoft Access 2010
Lesson 17 Getting Started with Excel Essentials
Lesson 18 Getting Started with Excel Essentials
Microsoft Excel 101.
Using Charts in a Presentation
Chapter 2 Creating a Research Paper with References and Sources
Word offers a number of features to help you streamline the formatting of documents. In this chapter, you will learn how to use predesigned building blocks.
Microsoft Official Academic Course, Access 2016
Creating a Word Document – Part 1
Using Cascading Style Sheets (CSS)
Microsoft PowerPoint 2007 – Unit 2
Understanding Microsoft Excel
Objectives At the end of this session, students will be able to:
Guidelines for Microsoft® Office 2013
Running a Java Program using Blue Jay.
Key Applications Module Lesson 16 — Excel Essentials
Exploring Microsoft® Office 2016 Series Editor Mary Anne Poatsy
Using Microsoft Outlook: Outlook Support Number
Chapter 8 Using Document Collaboration and Integration Tools
Key Applications Module Lesson 14 — Working with Tables
Microsoft Excel 2007 – Level 2
Exploring Microsoft Excel 2003
Presentation transcript:

Training Session B: Preparing and Importing Data Nov 2013 Training Session B: Preparing and Importing Data www.statsilk.com Creative Commons BY-SA 3.0

Preparing & Importing Data Into StatPlanet Anatomy of Data Editor Worksheets StatPlanet Data Structure Importing Data Identifying & Fixing Import Errors StatPlanet User Guide: Chapter 5

Anatomy of the StatPlanet Data Editor StatPlanet User Guide: Chapter 5.2 Worksheet descriptions

Key StatPlanet Data Editor Worksheets Purpose Import Set up your Shapefile map Import, organise and customise data Alter display names for territories Import Names Stores the names, aliases & variant names of places in the map Indicates import failures Data Identical to ‘Import’ but uses map IDs rather than names Generated by the ‘Save Data’ macro in ‘Import’ sheet The data.csv file generated is identical to the contents of this sheet Map regions Specify custom groupings of regions Specify custom zooms to regions Settings Customise appearance and functionality of StatPlanet StatPlanet User Guide: Chapter 5.2

Other StatPlanet Data Editor Worksheets Purpose Text-Translations Contains translations of interface elements Ability to add new languages, modify interface text Tools A range of tools relating to data management – use with caution Includes ability to upgrade from older versions of StatPlanet Help Some help on common Data Editor tasks License Description of license terms and conditions StatPlanet User Guide: Chapter 5.2 - Languages – non-Latin languages usually require a special version of StatPlanet to be prepared, e.g. for Hindi or Chinese fonts, and are not supported by default.

Importing into the StatPlanet Data Structure StatPlanet User Guide: Chapter 5.1, 5.4 How the Data Editor stores imported data

The StatPlanet Data Editor Open the StatPlanet Data Editor in the Training_Materials \StatPlanet_Vietnam folder, and go to the worksheet ‘Import’ Examine how column A contains the categories, column B the time period, and column C the indicators. Columns D to K are for various optional customizations and can be empty. Map regions are in the columns from L onwards. Run StatPlanet.exe in the same folder, and see how the data is displayed. Categories are displayed in the top menu – try clicking on them to switch to a different map. Explore the interface a bit, e.g. select an indicator in the top left panel, move the mouse over the map regions and over the graph ‘bars’. Close StatPlanet StatPlanet Training 2013 11/11/2018

Importing Prepared Data Go back to the StatPlanet Data Editor, and: Click on ‘Clear data’, select ‘Yes’ to remove all data. Click on the ‘Import data’ button Navigate to the Training_Materials\Example_Data folder and select the MDG_one_category_example.xlsx file The data will then automatically be imported. It shouldn’t report any errors. You may wish to change ‘New category’ towards the end of the Import data to ‘Income’ Click on the Save Data button. This saves the StatPlanet Data Editor, and updates the data.csv & settings.csv files (or data.txt & settings.txt files) Run StatPlanet.exe from the StatPlanet_Vietnam folder to view the imported data within StatPlanet. StatPlanet User Guide: Chapter 5.1, 5.4

A closer look at how Import works Examine the data structure in the Data Editor: Place names are listed horizontally across the top of the worksheet Data is organized first by Category (column A), then by Time Period (column B) and then by Indicator (column C) Indicator names are listed vertically Open the MDG_one_category_example.xlsx spreadsheet: Indicator names are listed horizontally, place names are listed vertically There are multiple columns repeating indicators for 4 years Fortunately, the Import macro is clever enough to know all this: It will identify the data structure and transpose vertical / horizontal if needed It can also be ‘taught’ to identify multiple naming variations for regions (the next exercise) StatPlanet User Guide: Chapter 5.1, 5.4

The StatPlanet Data Structure CATEGORY TIME INDICATOR Category A 2012   Indicator A(1) Indicator A(2) Indicator A(3) 2011 2010 Category B 2007 Indicator B(1) Indicator B(2) 2006 2005 Category C 2001 Indicator C(1) Indicator C(2) Basic rules for the data structure: Only one instance of each region name Data is structured first by category, then by time, and then by indicator. Time must be in order from most recent to oldest. Within one category, the list of indicators must be exactly the same for each time period Spacing between the different categories, times and indicators needs to be maintained as shown in the example left. The Import “macro” automatically structures your data this way StatPlanet User Guide: Chapter 5.5, 5.8

Common Data Import Failure Fail – multiple rows or columns for a map area / region Success StatPlanet User Guide: Chapter 5.8 Troubleshooting data import

Importing Unprepared Data Open MDG_one_category_example_errors.xlsx in Excel from the Training_Materials\Example_Data folder Restructure the data as shown in the previous slide (see MDG_one_category_example_errors_fixed.xlsx for an example) Return to the StatPlanet Data Editor (sheet “Import”) Import the file MDG_one_category_example_errors.xlsx Data imports but on completion, you are routed to the ‘Import names’ worksheet to see a list of failed imports Re-open MDG_one_category_example_errors.xlsx in Excel Note that not all the names in this file match the names in the StatPlanet Data Editor StatPlanet User Guide: Chapter 5.1, 5.4

Two ways to resolve naming issues 1. Fix Source Data 2. Add names to Data Editor Do this where name variant is unlikely to be used very often Adjust the names that failed to import in the source data so they will be recognised on import Do this where name version is likely to be used often Also consider doing this if the edits to the source data are very extensive Add the region names in the ‘Import names’ worksheet from row 3 onwards, in the corresponding column

Fix the names in the Data Editor Go to the ‘Import names’ worksheet and locate the list of regions that failed to import (in column B). This is a list of names in the source data file which was not recognized – because it is not listed in the sheet ‘Import names’. (The names in the sheet ‘Import names’ were taken from the shapefile map DBF file). Please note that punctuation & non-standard characters will have been removed during the import Find the corresponding name that failed to import in the first row of the worksheet For example, ‘TPHaNoi’ in the source data is ‘HaNoiCityHanoi’ in the sheet ‘Import names’. A useful approach to find the corresponding names is to press CTRL+F to search for part of the name in the document. For example, to find the name corresponding to “TPHaNoi”, search for ‘Hanoi’. To find the name corresponding to “DakNong”, search for “Nong”. Add TPHaNoi to the list of ‘recognized name variants’ of Hanoi city, by copying the name TPHaNoi and inserting it in column GI, row 3. Repeat this for the other unrecognized names. StatPlanet User Guide: Chapter 5.1, 5.4, 5.8 (troubleshooting) TPHoChiMinh TPHaiPhong HaTay TPHaNoi DakNong DakLak TPDaNang TPCanTho BacCan BaRiaVung

Fix the names in the Data Editor “HaTay” cannot be found anywhere, because it does not exist in the map since it became part of “Hanoi”. You need to go to the sheet “Import” to add it (if required). Scroll all the way to the right, to column PI, and enter it in row 2. Click on “Save data”. This adds “HaTay” to the sheets “Import names” and “Data”, and it will now be displayed as a “map point” in StatPlanet. In the same sheet (“Import”) remove the imported rows They need to be removed as data for some map regions was not imported the first time you did it. Save, close and re-open the StatPlanet Data Editor. Then import the source file again There should be no more ‘Import names’ errors. and this time data for all map regions should be imported. StatPlanet User Guide: Chapter 5.1, 5.4, 5.8 (troubleshooting)

Importing Bulk Data Return to the StatPlanet Data Editor (Import spreadsheet) Import “MDG_multiple_categories_example.xlsx” from the Training_Materials\Example_Data folder Note the large number of rows added by this import Open the file MDG_multiple_categories_example.xlsx and note that the sheet names correspond to the categories imported in the StatPlanet Data Editor. Click on the Save Data button & re-open StatPlanet.exe to see new data for multiple categories. StatPlanet User Guide: Chapter 5.1, 5.4

Data Editor – Preventing errors Sorting is strongly discouraged within the Data Editor – many features will only continue to work if they are left in the right order: The relationships and column ordering between the map columns in the worksheets Import, Import names and Data are crucial Sorting on the Import worksheet could destroy the data structure Where data fails to import properly, make necessary fixes and re-import the data. Importing data is preferable over inserting or re-arranging data manually, as importing automatically sets the right data structure. Please note that the ‘Clear data’ button clears everything and it is not possible to undo this. Making frequent backups is a good idea to be able to go back to a previous working version in case things go wrong.

Successfully Preparing Data for Import - Summary Preparing the Source data file There should be only one row (or column) of data for each region in the source data, per Excel worksheet. Make sure that each row and column in your source data has clear headers, such as the indicator name, or the map region name. For example, indicator names in row 1, and region names in column A. Clearly identify dates in the source data (if data is time-series) If you wish to import data as separate categories, organize it so that there is one category in each Excel worksheet. Correlate names in source data and in the Data Editor Either fix the source data, or add new names (spelling variants) into the StatPlanet Data Editor, sheet ‘Import names’ StatPlanet User Guide: Chapter 5.1, 5.4, 5.8 (troubleshooting)