Syncsort Data Integration Update 05.28.2013. Summary Helping Data Intensive Organizations Across the Big Data Continuum Hadoop – The Operating System.

Slides:



Advertisements
Similar presentations
What is Cloud Computing? Massive computing resources, deployed among virtual datacenters, dynamically allocated to specific users and tasks and accessed.
Advertisements

Introduction to Hadoop Richard Holowczak Baruch College.
Business Intelligence in Microsoft SQL Server 2005 Marin Bezić Microsoft EMEA SQL BI PRODUCT MANAGER
Big Data Training Course for IT Professionals Name of course : Big Data Developer Course Duration : 3 days full time including practical sessions Dates.
© 2014 Cognizant 4 th March 2015 MBaaS: Mobile Backend as a Service Pablo Gutiérrez / Senior Mobility developer.
Drive Data Quality at Your Company: Create a Data Lake George Corugedo Chief Technology Officer & Co-Founder.
FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media)
Hortonworks Eric Baldeschwieler – CEO © Hortonworks Inc Architecting the Future of Big Data June 29, 2011.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
Running Hadoop-as-a-Service in the Cloud
Transform + analyze Visualize + decide Capture + manage Dat a.
1 Community (Optimize both Yarn & Non Yarn Hadoop clusters)
Modernizing Business with BIG DATA Aashish Chandra Divisional VP, Sears Holdings Global Head, Legacy Modernization, MetaScale.
Hadoop tutorials. Todays agenda Hadoop Introduction and Architecture Hadoop Distributed File System MapReduce Spark 2.
Cloud Computing Other Mapreduce issues Keke Chen.
Passage Three Introduction to Microsoft SQL Server 2000.
TITLE SLIDE: HEADLINE Presenter name Title, Red Hat Date For Red Hat, it's 1994 all over again Sarangan Rangachari VP and GM, Storage and Big Data Red.
SQL on Hadoop. Todays agenda Introduction Hive – the first SQL approach Data ingestion and data formats Impala – MPP SQL.
Streams – DataStage Integration InfoSphere Streams Version 3.0
Business Intelligence: The Next Big Thing (Really!) John Bair CTO, Ajilitee Sep 14, 2012 Presented to TDWI St. Louis Chapter.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
Committed to Deliver….  We are Leaders in Hadoop Ecosystem.  We support, maintain, monitor and provide services over Hadoop whether you run apache Hadoop,
©2013 Lavastorm Analytics. All rights reserved.1 Lavastorm Analytics Engine 5.0 New Feature Overview.
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
Penwell Debug Intel Confidential BRIEF OVERVIEW OF HIVE Jonathan Brauer ESE 380L Feb
Hadoop tutorials. Todays agenda Hadoop Introduction and Architecture Hadoop Distributed File System MapReduce Spark Cluster Monitoring 2.
Contents HADOOP INTRODUCTION AND CONCEPTUAL OVERVIEW TERMINOLOGY QUICK TOUR OF CLOUDERA MANAGER.
Microsoft TechForge 2009 SQL Server 2008 Unplugged Microsoft’s Data Platform Vinod Kumar Technology Evangelist – DB and BI
Enabling data management in a big data world Craig Soules Garth Goodson Tanya Shastri.
An Introduction to HDInsight June 27 th,
Data and SQL on Hadoop. Cloudera Image for hands-on Installation instruction – 2.
IBM Bluemix Ecosystem Development Hands on Workshop Section 1 - Overview.
Nov 2006 Google released the paper on BigTable.
Microsoft And Partners Driving Global Integration Solutions With BizTalk Server 2004 Ted Kummert Vice President Microsoft Corporation Business Process.
Big Data Tools Hadoop S.S.Mulay Sr. V.P. Engineering February 1, 2013.
© 2015 IBM Corporation IBM PureApplication Executive Symposium Diego Segre Vice President, Middleware, Break down the barriers to digital.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
Learn. Hadoop Online training course is designed to enhance your knowledge and skills to become a successful Hadoop developer and In-depth knowledge of.
Apache Hadoop on Windows Azure Avkash Chauhan
Data Analytics and Hadoop Service in IT-DB Visit of Cloudera - April 19 th, 2016 Luca Canali (CERN) for IT-DB.
Microsoft Partner since 2011
Microsoft Ignite /28/2017 6:07 PM
SQL Server 2016 Integration Services (SSIS)
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Data Analytics Challenges Some faults cannot be avoided Decrease the availability for running physics Preventive maintenance is not enough Does not take.
SAS® Viya™ Overview ANDRĖ DE WAAL, GLOBAL ACADEMIC PROGRAM
Big Data & Test Automation
Protecting a Tsunami of Data in Hadoop
Connected Infrastructure
Hadoop and Analytics at CERN IT
Chapter 14 Big Data Analytics and NoSQL
Operational & Analytical Database
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Delivering Business Insight with SQL Server 2005
PowerMart of Informatica
Connected Infrastructure
Enabling Scalable and HA Ingestion and Real-Time Big Data Insights for the Enterprise OCJUG, 2014.
SQOOP.
Pentaho 7.1.
Business Intelligence for Project Server/Online
Big Data - in Performance Engineering
Principal Product Manager Oracle Data Science Platform
XtremeData on the Microsoft Azure Cloud Platform:
Setup Sqoop.
Technical Capabilities
Pitch Deck.
6/17/ :03 AM © 2004 Microsoft Corporation. All rights reserved.
Mark Quirk Head of Technology Developer & Platform Group
Presentation transcript:

Syncsort Data Integration Update

Summary Helping Data Intensive Organizations Across the Big Data Continuum Hadoop – The Operating System for Big Data Strategic Focus on Hadoop & Collaboration with Open Source –Patch New Feature Commitment Spring 13 Release – Delivering a Smarter Approach to Hadoop ETL with 2 New Offerings: –DMX-h ETL Edition –DMX-h Sort Edition Closing Thoughts –Delivering Better ETL through Hadoop & Better Hadoop with Enhanced ETL Capabilities 2

The Big Data Continuum 3 EvolvedDynamicPlateauingAdvancingAwakening Big Data Continuum Early Hadoop adoption prototyping & experimentation Hand-coding: SQL, JCL Standardization & platforms for enterprise connectivity Arch limits + exponential costs. Growing MIPS, missed SLAs Big Data is the new standard for both MF & open systems data Challenges Long development cycles Unsustainable costs Hadoop ingestion & usability gaps Efficiency, ETL & skills gaps Hand-coding nightmare Value MaxMin Integrating Big Data… Smarter MFX SQL Migration Hadoop Sort & ETL ETL & Rehosting Optimization High-performance ETL DMXDMX-h

Hadoop – The Operating System for Big Data 4 - Hadoop - Ingest / Propagate (Flume, Sqoop) Describe & Develop (Hive, Pig, HCatalog) Persist (HDFS) - Databases -- Analytics - - ETL - Monitor / Admin (Zookeper, Oozie) (Mahout, Datameer)(Casandra, HBase)

Syncsort DMX-h: Introducing Two New Hadoop Offerings 5 GUI for developing & maintaining MR jobs Test & debug locally in Windows; deploy on Hadoop Use-case Accelerators to fast-track development Broad based connectivity with automated parallelism Best in class mainframe data access Improved per node scalability and throughput Improved Hadoop Sort performance Sort-work compression for storage savings Support for native Hadoop data types & Hive queries Automated deployment DMX-h Sort DMX-h ETL Everything you need to turn Hadoop into a feature-rich ETL Solution Seamlessly Optimize Map-sort and Reduce-merge Operations

Strategic Focus on Big Data & Hadoop 6 1. Smart Contributions to the Open Source Community 2. Embrace Syncsort Technology Differentiators & Mainframe Heritage 3. Strengthen Partnerships with Strategic Big Data & Hadoop Players 4. Leverage Customers Hadoop Expertise Light footprint Self-tuning engine Single install. No 3 rd party dependencies Files & Mainframe Expertise JIRA: and more!

Syncsort DMX-h: Smarter ETL through Hadoop 7 Smart architecture with engine running natively within MapReduce framework Smart connectivity delivers faster loads/extracts to HDFS Smart development hides complexity of MapReduce Smart use-case accelerators to jump-start Hadoop developers productivity PLUS Smarter Hadoop… Enhanced vertical scalability Smart contributions to open source community

Syncsort Spring 13 Release 8 Spring 13 Release Smart architecture DMX-h Sort & ETL product offerings Performance improvements o Smart reformatting Kerberos security authentication o HDFS Load/Extract, ETL execution Smart development Support for Job, all transformations Graphical UI for development and debug Smart use-case accelerators Documented templates for common use cases (CDC, web log aggregations, Join, Lookup, mainframe, etc.) Smart connectivity Native connectivity including mainframe & DBMSs Ecosystem OpenSync Phase I Data Connector API Announced May 20, 2013

9 Gaining insights from Big Data is becoming a required business competency Big Data is fundamentally different from small data and breaks traditional data management architectures Hadoop is emerging as the operating system for Big Data management ETL will be the killer app that drives adoption of the new OS Syncsorts sort technology can become a standard Hadoop component Syncsorts unique integration and core processing engine provides a unique approach to supporting large scale ETL in Hadoop with disruptive TCO Recent contributions to open source & new Hadoop centric offerings will deliver better ETL through Hadoop while strengthening Hadoop with ETL Closing Thoughts

10 Syncsort Confidential and Proprietary - do not copy or distribute