Big Data & Test Automation
Agenda What is Big Data? Big Data architecture Big Data technologies Testing strategy Functional test automation on data warehousing
What is Big Data Is it just huge amount of data ?
What is Big Data
Why do we need Big Data For Effective marketing To make better business decision To gather customer feedback Attain customer satisfaction Increase revenue
Why do we need Big Data
4 V’s of Big Data
Big Data Architecture
HDFS( Hadoop Distributed File System) Big Data Technologies HDFS( Hadoop Distributed File System)
Big Data Technologies Map Reduce – Framework Map - Perform filtering and sorting on data sets Reduce - Perform summary operation on map step result Hbase HBase is a column-oriented database management system Hive Data warehousing infrastructure for Hadoop. Data summarization, query and analysis Pig To create MapReduce programs used with Hadoop
Big Data Ecosystem Testing Big Data Extraction Testing Test Strategy Big Data Ecosystem Testing Big Data Extraction Testing Pre-Hadoop Validation Meta Data Analysis and Validation Impala & HDFS Data Storage Validation Validation on Data Extraction from Source Referential Integrity & Constraint Validation Heterogeneous Data Integration Validation Big Data Ecosystem Testing Non Functional Testing Big Data Testing Big Data Extraction Testing Data Transformation /Migration Testing Data Analytics and Visualization Testing Data Transformation/Migration Testing Data Quality Validation Data Correctness/ Completeness Validation Business Rule Validation Big Data Tools Query Surge - Functional Data analytics and Visualization Testing HDFS to SSAS validation Dashboard Validation Visualization Validation Report generation and Validation Non Functional Testing Performance Validation Security Validation
Query Surge Tool
Conclusion
Thank You