Take Away Thoughts On Big Data June 2012Bob Gourley CTOlabs.com Some Context You Can Use A Summary and Discussion of Big Data Presentations at the USGIF.

Slides:



Advertisements
Similar presentations
© 2004 Flashline Inc. The Seven Faces of Reuse Enterprise Architect Summit June 8, 2004 Charles Stack Founder and CEO Flashline, Inc. © 2004 Flashline.
Advertisements

Risk & Novelty Collaboration & Engagement Efficiency & Effectiveness Transferability & Scalability ▪Led government as first agency to implement enterprise-wide,
Maximizing Data and Data Services Monday, October 14, 2013 Location: Denver CO© 2013 Child Care Aware ® of America.
DoDAF 3.0: A Web 2.0 and SOA Mashup!
Fluff Matters! Information Governance in an Online Era Lisa Welchman.
What is Grid Computing? Grid Computing is applying the resources of many computers in a network to a single entity at the same time;  Usually to a scientific.
1© Copyright 2015 EMC Corporation. All rights reserved. SDN INTELLIGENT NETWORKING IMPLICATIONS FOR END-TO-END INTERNETWORKING Simone Mangiante Senior.
Capabilities Briefing
David Besemer, CTO On Demand Data Integration with Data Virtualization.
Rapid Mobile Development Enterprises are having a tough time keeping up with the demand for mobile apps. With these growing demands, businesses are expecting.
Office 365: Efficient Cloud Solutions Wednesday March 12, 9AM Chaz Vossburg / Gabe Laushbaugh.
1 | © 2011 Oracle Corporation – Proprietary and Confidential.
For more notes and topics visit:
Captcha Soft solutions Pvt Ltd is a recognized name in the web design industry. For the past three years, we’ve been doing what we love: inventing, conceptualizing,
Optimize your Open Data 5 Best Practices for Designing Data-Driven Apps ​ Glenn Hess ​ Federal Sales Engineer ​ Actuate, Inc.
Hosted on the Powerful Microsoft Azure Platform, Advent Countdown Lets Companies Run Reliable and Scalable Holiday Marketing Campaigns MICROSOFT AZURE.
Maximize Return on Engagement via Scalable Omni-Channel Online Services in the Cloud COMPANY PROFILE: XOMNI, INC. Founded in 2011 and headquartered in.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Agenda Motion Imagery Challenges Overview of our Cloud Activities -Big Data -Large Data Implementation Lessons Learned Summary.
CTOlabs.com Government Big Data Success Stories Bob Gourley Jan 2012.
1 IBM TIVOLI Business Continuance Seminar Training Document.
WHAT OUR CUSTOMERS ARE SAYING “After thorough market research and a review process, Qorus Breeze Proposals stood out from the competitors because of its.
Securely Synchronize and Share Enterprise Files across Desktops, Web, and Mobile with EasiShare on the Powerful Microsoft Azure Cloud Platform MICROSOFT.
UNCLASSIFIED A Chief Information Officer’s Perspective on Service-Oriented Architecture Presented to Service-Oriented Architectures for E-Government Conference.
Transforming video & photo collections into valuable resources John Waugaman President - Tygart Technology, Inc.
Virtual Classes Provides an Innovative App for Education that Stimulates Engagement and Sharing Content and Experiences in Office 365 MICROSOFT OFFICE.
Discover the Newest Solution from Expertime: Magento + PimCore Running on Microsoft Azure MICROSOFT AZURE ISV PROFILE: EXPERTIME Expertime works with clients.
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
TACTIC | Workflow: Project Management OSS on Microsoft Azure Helps Enterprises to Create Streamline, Manage, and Track Digital Content MICROSOFT AZURE.
Advancing Government through Collaboration, Education and Action Human Capital SIG February 14, 2013 Gamification in Federal Training Programs.
Powered by Microsoft Azure, Auctori Is the Next Generation in Multilingual, Global, Search Engine Optimized Web Content Management Systems MICROSOFT AZURE.
IoT Meets Big Data Standardization Considerations
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
Built on the Powerful Microsoft Azure Platform, Forensic Advantage Helps Public Safety and National Security Agencies Collect, Analyze, Report, and Distribute.
The VERSO Product Returns Portal Incorporates Office 365 Outlook and Excel Add-Ins to Create Seamless Workflow for All Participating Users OFFICE 365 APP.
Introducing the New iManage Dan Carmel, Chief Marketing Officer.
Leadership Guide for Strategic Information Management Leadership Guide for Strategic Information Management for State DOTs NCHRP Project Information.
National Cybersecurity Center of Excellence Increasing the deployment and use of standards-based security technologies Bill Fisher Security Engineer National.
By RevelOps Logentries DataHub is offering the first cloud-based service for log management real-time analytics designed to enable security, privacy, and.
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
LIMS (Location Information Management System) is the Smart Claim Solution for Motor Insurers, Built on the Powerful Microsoft Azure Platform MICROSOFT.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
GameChanger’s Rate Quote Issue Solution is Deployed to Microsoft Azure for a Fast, Flexible Direct to Consumer Insurance Sales Solution MICROSOFT AZURE.
ILink Systems, Inc Feb, 2014 Government IT Solutions.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Smart Cities & DigiGov - on the Road to Reality
2nd GEO Data Providers workshop (20-21 April 2017, Florence, Italy)
Meemim's Microsoft Azure-Hosted Knowledge Management Platform Simplifies the Sharing of Information with Colleagues, Clients or the Public MICROSOFT AZURE.
BMC Integration Service Overview and Architecture
Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.
Firefish Software for Professional Recruiters Stays Available Around the Clock from Any Device and Anywhere by Using the Microsoft Azure Platform Partner.
SOA Implementation and Testing Summary
Make Your Management and Board Meetings More Effective and Paperless with Microsoft Office 365, SharePoint, and the Pervasent Board Papers App Partner.
IWRITER 365 Offers Seamless, Easy-to-Use Solution for Using, Designing, Managing, and Sharing All Your Company Templates in Microsoft Office 365 OFFICE.
Cloudy with a Chance of Data
H3 Solutions and the Azure Government Cloud Team Up to Power Contextual Intelligence Platform – Where Big Data Meets Business Productivity MICROSOFT AZURE.
Operationalize your data lake Accelerate business insight
Stratus Innovations Group Intelligent Factory™ Solution Offering
Be Better: Achieve Customer Service Excellence and Create a Lean RMA and Returns Process with Renewity RMA and the Power of Microsoft Azure MICROSOFT AZURE.
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Get Enterprise-Grade Call Handling and Control for Microsoft Office 365 and Skype for Business with the Bridge Boss-Admin Executive Console OFFICE 365.
Hosted on Microsoft Azure, Seismic is Drastically Changing How Enterprise Sales Teams Utilize Content to Accelerate Sales and Close Deals MICROSOFT AZURE.
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
BluVault Provides Secure and Cost-Effective Cloud Endpoint Backup and Recovery Using Power of Microsoft OneDrive Business and Microsoft Azure OFFICE 365.
Advanced Summarization Platform Integrates with OneDrive to Generate Intelligence Reports “As enterprises move toward Office 365, the value of integrating.
XtremeData on the Microsoft Azure Cloud Platform:
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Reportin Integrates with Microsoft Office 365 to Provide an End-to-End Platform for Financial Teams That Simplifies Report Creation and Management OFFICE.
The Intelligent Enterprise and SAP Business One
Presentation transcript:

Take Away Thoughts On Big Data June 2012Bob Gourley CTOlabs.com Some Context You Can Use A Summary and Discussion of Big Data Presentations at the USGIF Technology Days

How Much Data Is there? All of this afternoon’s presentations reminded me of a quote that tried to spell out how big the universe is: "Space," it says, "is big. Really big. You just won't believe how vastly, hugely, mindbogglingly big it is. From the Hitchhikers Guide

CTOlabs projected in 2008 that the hype over Cloud Computing was headed to overtake the hype over SOA, and it did, in The hype over Big Data is showing signs of a faster growth, but we are unsure at this point at what point the hype will overtake that of cloud computing. Big reason to track this: It underscores that we need to decide what we mean by this term and not let others decide for us. Cross-Over in Hype

Big Data This is not business as usual. And this is not just lots of data. It is not just exponential growth of data. It is new ways of making sense over data that require changes to existing architectures. Big Data, the term, in its current use, implies many other things, like: Apache Hadoop Framework Commodity hardware leveraging Moore’s law Infinite scalability No data temples Caution: Big Data, the term, may soon lose all meaning. I will define Big Data Now

What Is Big Data? Big Data is a term applied to data sets whose size is beyond the ability of legacy approaches to capture, manage and process the data within a tolerable elapsed time. Big data sizes are a constantly moving target. For the enterprise CTO, speaking of “big data” implies the need for a strategy for dealing with sensemaking over large quantities of data. More Context Follows

Big Data Think of challenges facing humanity: They all need solutions. But first we have to address a key problem: Old ways of doing IT must change. When you hear the phrase “Big Data” think of this need for change. Ability for humans to analyze data Amount of data to Analyze Area of Need (and opportunity) The ability to collect, parse, analyze machine data in real time, whether on premise or in the cloud, will continue to grow.

Ignite Style Presentations Introductory Keynote – “Big Data: Friend or Foe” by Barry Barlow, Director, OnlineGEOINT Services, National Geospatial-Intelligence Agency Management of Multi-Terabyte Video Files in Broadcast Television – Steve Atkinson, Director Federal Sales, Front Porch Digital Inc. 3,000,000 km2 of Satellite Imagery Every Day: Finding the Relevant Pixels – Luke Barrington, CTO, Tomnod; and John Lucier, Senior Manager Analysis Offerings, DigitalGlobe The Value of Stream Computing in Big Data – Gabe Chang, Senior Consulting Client IT Architect, IBM Big Data: Security and Scale in the Era of On-Demand IT – MG (Ret.) John M. Custer, Director, Federal Missions and Programs, EMC2 Intelligence Products in the Age of Big Data – George Demmy, CTO, TerraGo Data Intensity and Datacenter Consolidation via Federated Cloud “Big Data” – Eng Lim Goh Ph.D., CTO and Senior VP, Silicon Graphics International Corp. A Big Ocean: Handling High Volume Bathymetry – Michael Henheffer, Senior Software Developer Marine Division, CARIS Big Features: A WISE Approach to Scalable Geospatial ISR Fusion – Joshua Lieberman, Senior Manager, Deloitte FAS LLP OMAR: Open Source Software for Big Data – Mark R. Lucas, Principal Scientist, RadiantBlue Technologies Interacting with Big Data using Mobile Devices – Kumar Navular, Director, NextGeneration Products, DigitalGlobe Predictive Analytics in the Cloud: The Art of the Possible – by Anthony Quartararo, President and CEO, Spatial Networks Inc. Big Data: The Diversity Factor – Dan Quinn, Vice President of Sales and Marketing, Progressive Technology Federal Systems Inc. Data Harmonization Through the Use of Complex Event Processors – Dennis Groseclose, President, TransVoyant Standardizing Web Interfaces to Distribute Wide Area Motion Imagery – Rahul Thakar, VP of Technology, PIXIA Corp. Discussant – Bob Gourley, Chief Technology Officer, Crucial Point LLC

Recap Big Data Friend or Foe: Barry Barlow says there is lots of data. Can't analyze it all with people. Mentioned MapStory. RecordedFuture. Hadoop. Big Data in Television: DBX is the current format. TB size files. Lessons regarding metadata and workflow relevant to our world. Suggested use of “proxy workflows.” Finding Relevant Pixels: New crowdsourcing methods of review. Stream and Big Data: Interconnected world with fast moving data, analyze in stream. Game-Changing Tech for Data Explosion: Value comes from analytics. Storage nothing, ot a tsunami,it is a rising tide.. Use Hadoop. Put compute into storage. Intel reporting in the age of Big Data: Data must be operatioinalized Big Ocean: Bathymetry data smartly handled. Big Features: Context for collaboration while working big data is crtical. OMAR- Open Source for Big Data: These guys are working really BIG data, in a way that is smooth and easy to users. Big Data and Mobile: Well engineerd solutions can make it incredibly easy on users. Data Intensity and Data Center Consolidation: Think BIG. Keep data where it is. Have a vision. Predictive Analytics in the Cloud: Know your challenge. User experience is everything. Activity-Based Intelligence: Must engineer for ABI. Sometimes bring data together. Global Standardization of Web Interfaces for WAMI: Web services for MI Diverse Requirements: Says Big Data Not New. Good ways to present to users.

Some discussion items Presenters at times used varying definitions of the term “big data” We can’t command that the term always be used the way the tech community uses it. But we can push back whenever someone decides to invent their own definition.

What To Do Now: Sign up for the Government Big Data Newsletter Send me an , at crucialpointllc.com Or sign up at CTOvision.com Leverage the Apache Hadoop framework. Download open source distribution of CDH4 from Cloudera Share your lessons learned, and seek lessons from others on your Big Data use. Be precise on how you use the term, or expect its meaning to disappear

Backup Slides

Big Data Use Cases in Government Security: rapid real time analysis of all relevant data Rapid return of geospatial data Location based push of data: ads now, but watch for more Real time return of relevant search: Google, Cloudera and USA.gov Real time suggestion of topics: Google, Cloudera and USA.gov Bioinformatics: Human Genome, Hadoop Bioinformatics: Patient location, treatment, outcomes

Big Data and the Special Case of Cyber Cyber security has long generated large quantities of data. Enterprises need access to all the data to look for evidence of coordinated/sophisticated adversary action. Old approaches do not enable that. New CDH4 enabled “Big Data” approaches enable “Enterprise Security Intelligence” solutions. Includes non-cyber data. Rapid computer emergency response needs all source data along with the cyber data. New approaches to cyber security analysis, including incident detection, incident response, forensics and remediation, require “Big Data” thinking and designs built to bring all the data together.

The Intent of the Government Big Data Solutions Award  Established to help facilitate exchange of best practices, lessons learned and creative ideas for solutions to hard data challenges  Special focus on solutions built around Apache Hadoop framework  Nominees and award winners to be written up in CTOlabs.com technology reviews  Award meant to help generate exchange of lessons learned We established a team of judges, asked them to consider mission impact as primary criteria, and solicited award nominations via sites frequented by government IT professionals and solution providers.

The Government Needs More Agility*  The government can rapidly benefit from the lessons of high tech by being a faster follower, especially when it comes to Big Data constructs  Thesis: If the Big Data community understands more about federal missions, challenges and successes, we can improve the speed and effectiveness of federal solutions. “High tech runs three-times faster than normal businesses. And the government runs three-times slower than normal businesses. So we have a nine-times gap” – Andy Grove *Among other needs

Most active fed solution areas:  Federal integrators: Spending internal research and development funds to create prototypes and full solutions relevant to fed missions  DoD and IC agencies: Using Big Data approaches to solve “needle in the haystack” and “connect the dots” problems  National Labs: Bioinformatics solutions have been put in place by federal researchers  OMB and GSA: Ensuring sharing of lessons and solutions. Key exemplars around web search methods. Solutions inside government agencies and on citizen facing properties Big Data solutions are already making a difference in government service to citizens. Highlighting some of this virtuous work is a goal of our Government Big Data Solutions Award.

Top Nominees for 2011  USA Search: Best in class hosted search services over more than 400 gov sites. Great use of CDH3.  GCE Federal: Cloud-based financial management solutions. Apache Hadoop, Hbase, Lucene for Dept of Labor.  PNNL Bioinformatics: Leading researcher Dr. Taylor of PNNL is advancing understanding of health, biology, genetics and computing using Apache Hadoop/MapReduce/HBase.  SherpaSurfing: Use of CDH as a cybersecurity solution. Ingest packet capture in any format, analyze trends, find malware, alert.  US Department of State: Bureau of Counselor Affairs. Large data with important applications for citizen service and national security. Each of these are making a difference for government missions right now.

USA Search  Program of General Services Administration’s (GSA) Office of Citizen Services and Information Technologies.  Hosted search services for USA.gov and over 500 other government websites.  Solves big data challenges with open source capabilities.  CDH3 since fall HDFS, Hadoop and Hive used in cost effective, resilient, scalable solution.  Search Results. Search Suggestions. Trend analysis. Analytic dashboards. Bottom Line: USA Search brings the best of the open source community to multiple government missions, including direct citizen support

Some Requests Sign up for the Government Big Data Newsletter at: Watch for the 2012 Government Big Data Solutions Award Stay in touch!

Thank You! Please give feedback and find more info at: CTOvision.com