Tina Fletcher| Targeting Quality| Oct 9, 2015 Why Getting Good Test Data is Hard, and What to Do About It.

Slides:



Advertisements
Similar presentations
Usage Statistics in Context: related standards and tools Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services Usage Statistics and Publishers:
Advertisements

SolidWorks Enterprise PDM Data Loading Strategies
Annoucements  Next labs 9 and 10 are paired for everyone. So don’t miss the lab.  There is a review session for the quiz on Monday, November 4, at 8:00.
Futures – Alpha Cloud Deployment and Application Management.
© 2010 Bennett, McRobb and Farmer1 Use Case Description Supplementary material to support Bennett, McRobb and Farmer: Object Oriented Systems Analysis.
Monash's Mock RQF − Lessons learnt David Groenewegen ARROW Project Manager.
Non-Coding Activities a Development Team Needs a.k.a ”I don’t code, am I no longer useful?” Maaret Pyhäjärvi| | Twitter: maaretp Test Granlund.
Skills Contest Everyone’s a winner! Transcomm 2013 Connie Rus Multimedia Communications Manager May 15, 2012.
Administrivia  Review Deliverable 2 –Overview (audience) –Excellent additions  User Goals  Usability Goals  User Group (who are you designing for?)
Software Engineering CSE470: Requirements Analysis 1 Requirements Analysis Defining the WHAT.
User studies. Why user studies? How do we know security and privacy solutions are really usable? Have to observe users! –you may be surprised by what.
Overview of Software Requirements
Database Design Concepts Info 1408 Lecture 2 An Introduction to Data Storage.
Database Design Concepts Info 1408 Lecture 2 An Introduction to Data Storage.
BTT12OI.  Do you know someone who has been scammed? What happened?  Been tricked into sending someone else money (not who they thought they were) 
Troy Eversen | 19 May 2015 Data Integrity Workshop.
Individual User Logins
Load Test Planning Especially with HP LoadRunner >>>>>>>>>>>>>>>>>>>>>>
Scenario testing Tor Stålhane. Scenario testing – 1 There are two types of scenario testing. Type 1 – scenarios used as to define input/output sequences.
Welcome Assessment Centres David Phillips Senior Assessment Partner DfT Resourcing Group.
Lecture 7 Page 1 CS 236 Online Password Management Limit login attempts Encrypt your passwords Protecting the password file Forgotten passwords Generating.
MASFAA 2013 October 6 th – 9 th, 2013 Indianapolis, Indiana Scholarship Administration and Software Tools: Getting Started Presenters Phil Knight – Next.
1 Design and Integration: Part 1 Nuggets about Design vs Project Management.
Computers & Employment By Andrew Attard and Stephen Calleja.
Copyright © IRI, Confidential and proprietary. Expanding Your Training Outlook with E-Learning IRI eSuite See - Act – Win.
The Five Step Sales Process The Five Step Sales Process Step One: Plan and Prepare May 11, 2011.
Systems Used for Collaboration When to achieve a common goal, result or work product.
Dr Lisa Wise 9/08/2002 The Evolution and Devolution of the Webmaster Role Dr Lisa Wise.
242/102/49 0/51/59 181/172/166 Primary colors 248/152/29 PMS 172 PMS 137 PMS 546 PMS /206/ /227/ /129/123 Secondary colors 114/181/204.
Computing Fundamentals Module Lesson 19 — Using Technology to Solve Problems Computer Literacy BASICS.
Business Correspondence: Letters, Memos, and s
Understanding Technology Crime Investigation for Managers.
What is Usability? Usability Is a measure of how easy it is to use something: –How easy will the use of the software be for a typical user to understand,
Moodle (Course Management Systems). Managing Your class In this Lecture, we’ll cover course management, including understanding and using roles, arranging.
ECE450 - Software Engineering II1 ECE450 – Software Engineering II Today: An Aside: The Quickest Tour through the UML that you will ever get.
Hipikat: A Project Memory for Software Development The CISC 864 Analysis By Lionel Marks.
EXPLORING “BEST PRACTICES” Let’s Test Mission Help me build the closing keynote: “A Critical Look at Best Practices”A Critical Look at Best Practices.
Client/User Analysis Website Design. 2 Questions to be answered: What is the purpose of the site? What is the purpose of the site? Who is the site for?
Midterm Stats Min: 16/38 (42%) Max: 36.5/38 (96%) Average: 29.5/36 (78%)
Designing & Testing Information Systems Notes Information Systems Design & Development: Purpose, features functionality, users & Testing.
Troubleshooting 101: Effective Information Gathering Martha Lundgren Texas Association of School Boards Copyright 2005, Texas Association of School Boards.
Formulating a Simulation Project Proposal Chapter3.
Use Cases Use Cases are employed to describe the functionality or behavior of a system. Each use case describes a different capability that the system.
1 Chapter 5 Lecture 5: Understanding Requirements Slide Set to accompany Software Engineering: A Practitioner’s Approach, 7/e by Roger S. Pressman Slides.
1 Design and Integration: Part 2. 2 Plus Delta Feedback Reading and lecture repeat Ambiguous questions on quizzes Attendance quizzes Boring white lecture.
Task Analysis Methods IST 331. March 16 th
Tips for Attending Conferences Deborah S. Ray Ashley Burton.
CS 5150 Software Engineering Lecture 7 Requirements 1.
By Germaine Cheung Hong Kong Computer Institute
Computing Fundamentals Module Lesson 6 — Using Technology to Solve Problems Computer Literacy BASICS.
Coaching Pack 9 – 11 Years. What Am I Coaching Today? What Might the Players Learn or Get Better at? TechnicalPsychological example PhysicalSocial example.
Copyright 2010, The World Bank Group. All Rights Reserved. Testing and Documentation Part II.
MIS 7003 MBA Core Course in MIS Professor Akhilesh Bajaj The University of Tulsa Introduction to S/W Engineering © All slides in this presentation Akhilesh.
E-BILLING MOTIVATION. Introduction  E-billing is the electronic delivery of financial documents to the customer, that represents and replaces the conventional.
Lecture 15 Page 1 CS 236 Online Evaluating Running Systems Evaluating system security requires knowing what’s going on Many steps are necessary for a full.
SharePoint Governance And the role of the Site Owner.
It’s Where the Fortune Is The Power of Follow Up.
Sitecore.net Training, Oct ECM 2.1 UPDATE 2 PART 1 CRAWL BEFORE YOU WALK.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
Repository Manager 1.3 Product Overview Name Title Date.
Copyright Office Material Copyright Request System.
SAP Security Online Training Online | classroom| Corporate Training | certifications | placements| support CONTACT US: MAGNIFIC TRAINING INDIA
Experience sharing - Introducing a tool for announcing and tracking attendance at conferences and congresses Global Regulatory Policy – Regulatory Sciences:
Tool Support for Testing
Password Management Limit login attempts Encrypt your passwords
Computer Aided Software Engineering (CASE)
Call Now For more details Visit at:- support-numberus.com/yahoo-support- number/ support-numberus.com/yahoo-support-
Introducing Automation in Traditional Software Testing Best Practices.
Implementing Security in ASP.NET Core: Claims, Patterns, and Policies
Computer Literacy BASICS
Presentation transcript:

Tina Fletcher| Targeting Quality| Oct 9, 2015 Why Getting Good Test Data is Hard, and What to Do About It

 Currently a Test Strategist; have also been a Test Lead, Team Lead, and Test Specialist  But really… someone who cares about doing thoughtful, responsible testing that (hopefully) helps lead to better quality releases  I’m also and Who am I?

 We don’t talk about it enough – Not glamorous  avoided – Sounds easy  underestimated – Not “testing”  forgotten  In my experience: – Not glamorous  necessary prerequisite – Sounds easy  surprisingly challenging – Not “testing”  forgetting is expensive Why am I talking about test data? Software Testing Test Data

 Have made enough mistakes to start offering advice?  What the rest of this talk will look like: – 10 useful questions to ask when starting a new software project  Example applications to help solidify concepts: – Media player – client – Learning management system for online courses Why am I talking about test data? (…cont.)

10 Useful Test-Data Related Questions to Ask When Starting a New Software Project

 General definition – Research shows: “test data is the data you test with” – Tina’s best attempt: “something a user might have, enter, or do while using your feature that you care about testing before release” 1. What is test data, in our context?

 For your project – Define a test data unit  Media files, s, online courses, students  Rows of database entries, JSON objects – Examples help to identify scope and variables  <1 sec 128kbps mp3 file  sent to 2 recipients, 3 image attachments, HTML table in body, AS 12.1 provider  Course with >1500 students, 6 quizzes and heavy group discussion usage  User registered as a student in 3 courses and a teaching assistant in another 1. What is test data, in our context? (…cont.)

 General reasons – Need test data in order to execute tests – Test under conditions that resemble production – Useful for many team members (developer, tester, UX designer, product owner) 2. Why does test data matter to us?

 Project-specific examples – We want to be confident that users can play all common media file types without encountering any errors – We want to be confident that users will be able to read and interact with any message they receive, regardless of its properties – We want to be confident that the online learning system’s performance is not impacted by large classes with many concurrent user sessions – We want to be confident that permissions in the online learning system are properly restricted according to user roles 2. Why does test data matter to us? (…cont.)

 Random vs controlled – Mailing list subscription or crafting specific s  Realistic vs simplified – Testing combinations of things or one at a time  Static vs changing – Same set of files or rotating/expanding collection  Able to tweak on demand as interesting things are observed – Something weird happens after ~2 minutes of playback 3. What should our test data look like?

 Understand functionality and performance expectations – Support claims for media file types – Load time before playback starts (large files)  Prioritize risk areas and how things might fail – s that have been forwarded across multiple providers  Decide what is in and out of scope – Top 25 providers  Understand your customers – Common configurations of permission settings 3. What should our test data look like? (…cont.)

 Complete test data management suites (generation, analysis, cleaning): – IBM, Informatica, Grid Tools  Copy from existing source: – Test data repository – Production  Lightweight/freeware (rows of data): –  Mock data (APIs): – How can we get test data?

 Use your team/company to generate data (“dogfooding”) – Internal training courses  Build your own tool to generate data – spammer (volume)  Use automated tests to generate data – Simulate student activities in a course  Generate it manually – Sometimes the only way, or most realistic way 4. How can we get test data? (…cont.)

 Using multiple methods is usually best – More variation – Each has strengths/limitations 4. How can we get test data? (…cont.)

 Inconsistencies in generated data  Large number of variations to obtain  Copyright  Privacy of “real” data – sharing – Masking – reduced usefulness? 5. What test data challenges might we encounter?

 “Old” data – Database migration – Time stamps  Cost – Paid accounts – Paid tools – Hosting multiple test environments – Time!! … 5. What test data challenges might we encounter? (…cont)

 Copy from a central location – Security/privacy concerns – Ability to partition data on demand – DRM restrictions  Stored in shared accounts or test environments – Return to known state / avoid irreversible change – Conflicts during simultaneous usage  Back up your test data! 6. Where are we going to keep test data? 7. How are we going to move/share test data?

 Are you finding bugs? – Are they valid? Important?  Are people outside of your team finding bugs? – e.g. beta, sales, other dev teams?  Are you able to test any scenarios that you randomly think of during testing? – Note: save/share additions and improvements to the data set 8. How will we know if our test data is effective?

 “Full coverage” is a myth; important to understand what’s missing  Ideas for mitigating actions: – Beta or crowd source testing – More unit tests – Review usage patterns (priority) – Communicate  New ideas from unexpected sources  Prepare: support, ops 9. What test data were we not able to get?

 Future you, or other teams, probably need to make changes in future – Avoid time cost for “reinventing the wheel”  Documentation: – What you have – What you don’t have – Where to find it – How to use it – How to add/tweak data 10. What test data legacy will we leave behind?

 Post mortem notes: – How you arrived at the current set of test data (and why) – Key things that need to be repeated – Ideas that didn’t pan out – Things that could have gone better – Things you didn’t get to – Ideas and recommendations for future 10. What test data legacy will we leave behind? (…cont.)

 What other questions would you add to this list?  What challenges have you encountered related to test data?  What creative test data solutions are you proud of?  What did I miss?  … or anything else! Over to you…

 Why is getting good test data hard? – Lots of variables and data characteristics to consider – Choosing sources and generation methods – Balancing data realism, privacy, $$, time – Can also be difficult to move, store, share, and tweak  What can you do about it? – Plan ahead – Ask yourself the 10 questions – Keep improving Conclusion

Thank you!