` tuplejump The data engineering platform. A startup with a vision to simplify data engineering and empower the next generation of data powered miracles!

Slides:



Advertisements
Similar presentations
FAST Radar System Engineering Overview. FAST Radar Overview –What’s Required? IIS 6.0  With Microsoft.NET Framework 1.1 and SMTP for MS SQL Server.
Advertisements

Turning Data into Value Ion Stoica CEO, Databricks (also, UC Berkeley and Conviva) UC BERKELEY.
© 2013 IBM Corporation October 4, 2013 IT Analytics and Big Data IBM Solutions Paul Smith (Smitty) Service Management Architect.
Essbase Reporting Jim Kubik Senior Sales Consultant.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
© 2014 Fair Isaac Corporation. Confidential. This presentation is provided for the recipient only and cannot be reproduced or shared without Fair Isaac.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
HOL9396: Oracle Event Processing 12c
BIG DATA – WHAT’S THE BIG DEAL The call would start soon, please be on mute. Thanks for your time and patience.
Apache Spark and the future of big data applications Eric Baldeschwieler.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Tyson Condie.
Committed to Deliver….  We are Leaders in Hadoop Ecosystem.  We support, maintain, monitor and provide services over Hadoop whether you run apache Hadoop,
Top 10 Ways to Visualize Data with JReport 10.1 Tyler Wilchek Marketing Manager Jinfonet Software Rockville, MD Greg Harris Product Engineer Jinfonet Software.
The Eyeblaster ACM Advertising Campaign Management.
A NoSQL Database - Hive Dania Abed Rabbou.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
CERN IT Department CH-1211 Geneva 23 Switzerland t CF Computing Facilities Agile Infrastructure Monitoring CERN IT/CF.
Project Management May 30th, Team Members Name Project Role Gint of Communications Sai
Big Data Analytics Platforms. Our Team NameApplication Viborov MichaelApache Spark Bordeynik YanivApache Storm Abu Jabal FerasHPCC Oun JosephGoogle BigQuery.
+ Logentries Is a Real-Time Log Analytics Service for Aggregating, Analyzing, and Alerting on Log Data from Microsoft Azure Apps and Systems MICROSOFT.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Summary Cognos 8 BI. Objectives  In this module we will examine:  major innovations in Cognos 8  review of new functionality in Cognos 8  customer.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
TWOJA CYFROWA PRZYSZŁOŚĆ. JUŻ DZISIAJ. Christoph F. Strnadl CTO Central & Eastern Europe 11 May 2016.
Slide 1 © 2016, Lera Technologies. All Rights Reserved. SAP BO vs SPLUNK vs OBIEE By Lera Technologies.
Time Series Data Repository #ODSummit - The Generic, Extensible, and Elastic Data Repository in OpenDaylight for Advanced Analytics.
Dato Confidential 1 Danny Bickson Co-Founder. Dato Confidential 2 Successful apps in 2015 must be intelligent Machine learning key to next-gen apps Recommenders.
MarkLogic The Only Enterprise NoSQL Database Presented by: Aashi Rastogi ( ) Sanket Patel ( )
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
Microsoft Ignite /28/2017 6:07 PM
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
© 2009 Oracle Corporation – Proprietary and Confidential Agenda Reporting Overview Performance Workspace Dashboards Reports Drill thru Smartview Excel.
Energy Demand Forecasting
Connected Infrastructure
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
5/9/2018 7:28 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
Smart Building Solution
Hadoop and Analytics at CERN IT
Reporting and Analysis With Microsoft Office
Spark Presentation.
Smart Building Solution
Energy Demand Forecasting
Connected Infrastructure
Building Analytics At Scale With USQL and C#
Enabling Scalable and HA Ingestion and Real-Time Big Data Insights for the Enterprise OCJUG, 2014.
Insights driven Customer Experience
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Pentaho 7.1.
Creating New Business Value with Big Data
Shubha Vijayasarathy Program Manager, Azure Event Hubs - Microsoft
September 11, Ian R Brooks Ph.D.
Oscar AP by Massive Analytic: A Precognitive Analytics Platform for Effortless Data-Driven Decisions. Now Available in Azure Marketplace MICROSOFT AZURE.
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
X in [Integration, Delivery, Deployment]
Federico Perrero – Plant Manager
Big Data - in Performance Engineering
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Microsoft SQL Server 2008 Reporting Services
Microsoft Azure Enables Big-Data-as-a-Service Applications for Industry and Government Use “Microsoft Azure is the most innovative and robust suite of.
Near Real Time ETLs with Azure Serverless Architecture
Technical Capabilities
Project Goals Collect and permanently store the data flowing around ONAP system into several Big Data storages, each in different category. Also serve.
Mark Quirk Head of Technology Developer & Platform Group
Architecture of modern data warehouse
Presentation transcript:

` tuplejump The data engineering platform

A startup with a vision to simplify data engineering and empower the next generation of data powered miracles! tuplejump Rohit Founder and CEO Satya Founder and CTO

What we do? Tuplejump Platform provides ready to use, out of the box, all integrated end-to-end data pipeline components to bring your idea to life fast! Most startups spend a lot of time studying and integrating various OSS. We have done this for you and assembled a system incorporating best of the breed systems. Our service engineers can assist you or develop your PoCs to entire solutions in record time.

The Data Pipeline COLLECT TRANSFORM PREDICT STORE EXPLORE VISUALIZE OpsCenter

The Tuplejump Platform | COLLECT Hydra The tentacled framework to gather high volume and velocity data from push (devices, page alerts, forms, etc) and pull (web scraping, blogs, social networks, etc.) powered by Akka, reacting on demands to events and streaming to Spark to batch process.

The Tuplejump Platform | TRANSFORM Spark + Calliope Using the friendly Spark API with added features to easily consume or load data from and to Cassandra powered storage. Transform structured and unstructured data and join other most simple data sets using drag and drop. Join delta transformations on real time feeds with existing data using Spark streaming,

The Tuplejump Platform | STORE DStore - Cassandra++ Cassandra, enriched with our custom components to provide an single storage mechanism for Files, (un)structured data, generic data formats like XML and JSON, etc. Stargate Stargate, a lucene powered indexing mechanism built right into C* to allow for advanced indexing and searching of data SnackFS SnackFS provides an HDFS compatible fat driver distributed file system over Cassandra.

The Tuplejump Platform | EXPLORE Shark + Calliope Shark Analytical engine shines in exploring structured and unstructured data sets having large amounts of data. With Calliope, you can have the most comprehensive reporting on data from Cassandra in seconds and minutes not hours. Using Stargate indexes you can filter a lot of data in Cassandra saving those agonizing hours of batch jobs. UberCube Our patent pending Ubercube (™) technology is an distributed OLAP cube engine designed from ground up for interactive exploration over very large datasets..

The Tuplejump Platform | PREDICT MinerBot Building on Spark's ML frameworl. EA and ANN/DL frameworks to take ML to the next level. Drag and drop Machine learning soon!

The Tuplejump Platform | VISUALIZE Pissaro A modern, game changing data frontend providing highly interactive and reactive visualization frontend. Not just reports!

The Tuplejump Platform | OpsCenter OpsCenter Deployment, monitoring and management framework built specifically targeting deploying, maintaining and scaling our platform without touching your server. Click to cluster One click deployment o take your application from development to cluster. BigData PaaS Coming soon is a PaaS, so you focus on your idea and let us worry about the rest.

Tuplejump Advantage All the advantages of Spark + All the advantages of Cassandra + Much more! Over 500x (much more in case of filtered data) faster than traditional Hadoop solutions Shark + C* provide for superfast ad hoc querying. UberCube empowers sub-millisecond responses on very large cubes MinerBot provides ready to use ML Algos, plus a possibility of much more complex algos and mechanisms than just map reduce. Ready to use, no integration required Easy to develop, deploy, monitor and scale

Case Study I - IoT

Hydra was designed for IoT in first place. Supports MQTT for messaging from and to devices/sensors and communication between devices. Use message processing to raise alerts Use batch processing for advanced data analytics DStore provides a highly scalable write optimized distributed storage for events and messages. MinerBot powers anomaly detection and automation on event analysis and patterns Build multidimensional analytics cube on the event features with UberCube Visualize and understand the events in charts with Pissaro

Case Study II - Advertising Ads

Case Study II - Advertising Hydra empowers high volume/velocity data collection to gather page clicks, user events, user behaviuor, etc. Event Processing to trigger/handle RTB MinerBot to optimize ad-user matching based on previous success/failure records Pissaro to empower the Advertiser dashboard and reports

Lets talk!