Top Banner
Order from chaos Simon Brown DBI317
31

Simon Brown. a generic label for describing any corporate information that is not in a database.

Dec 23, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Simon Brown. a generic label for describing any corporate information that is not in a database.

Order from chaos

Simon Brown

DBI317

Page 2: Simon Brown. a generic label for describing any corporate information that is not in a database.

What Is Unstructured Data?

Page 3: Simon Brown. a generic label for describing any corporate information that is not in a database.

What is unstructured data?a generic label for describing any corporate information

that is not in a database

unstructured data : Data that does not reside in fixed locations

any data that has no identifiable structure

information that either does not have a pre-defined data model or is not organized in a pre-defined manner

Page 4: Simon Brown. a generic label for describing any corporate information that is not in a database.

Structure

Page 5: Simon Brown. a generic label for describing any corporate information that is not in a database.

Why is unstructured data important?Unstructured data doubles every three months

7 million web pages are added every day

80% of business is conducted on unstructured information

85% of all data stored is held in an unstructured format

Page 6: Simon Brown. a generic label for describing any corporate information that is not in a database.

What are we working with?

Page 7: Simon Brown. a generic label for describing any corporate information that is not in a database.

Technology

UnstructuredExcelFile shares/FoldersInternet Data

StructuredSQL Server 2012Analysis ServicesIntegration Services

Semi-structuredSharepointExcel ServicesService Manager

Business IntelligenceSharepoint PowerPivot Power View

Power Query Power Map BI Semantic Model

Page 8: Simon Brown. a generic label for describing any corporate information that is not in a database.

Meet BobBusiness RequirementsFast report creationTurning personal information into BIWill use outcome to plan special pricing and inventory managementTo be discarded after use

Page 9: Simon Brown. a generic label for describing any corporate information that is not in a database.

Demo

Retail Measurements

Page 10: Simon Brown. a generic label for describing any corporate information that is not in a database.

ReviewWhat We DidTook unstructured data joined it to structured data

Bob has used this for personal reporting

Identified this as tactical data

Page 11: Simon Brown. a generic label for describing any corporate information that is not in a database.

How We Did It

UnstructuredExcelFile shares/FoldersInternet Data

StructuredAnalysis Services

Semi-structured

Business IntelligenceBI Semantic Model

Page 12: Simon Brown. a generic label for describing any corporate information that is not in a database.

Success

Page 13: Simon Brown. a generic label for describing any corporate information that is not in a database.

Meet SallyBusiness RequirementsGain insight for planning meetingNeed to understand what current state looks likeNeed to plan for future stateWork out who needs to be engaged in private sector to align planningNeed to know if more funding will be required

Page 14: Simon Brown. a generic label for describing any corporate information that is not in a database.

Demo

Hospital Growth

Page 15: Simon Brown. a generic label for describing any corporate information that is not in a database.

ReviewWhat We DidTook structured data combined it

Created a personal data source

Published reports from the data source

Used reports for the basis of a planning presentation

Page 16: Simon Brown. a generic label for describing any corporate information that is not in a database.

How We Did It

UnstructuredExcelInternet Data

StructuredSQL Server 2012Analysis Services

Semi-structuredSharepointExcel Services

Business IntelligenceSharepoint PowerPivot Power View

Power Query Power Map BI Semantic Model

Page 17: Simon Brown. a generic label for describing any corporate information that is not in a database.

Success

Page 18: Simon Brown. a generic label for describing any corporate information that is not in a database.

Meet SimonBusiness RequirementsKeep data for other usesFeel good about house purchaseUnderstand which agent is likely to give the best result if needing to sell

Page 19: Simon Brown. a generic label for describing any corporate information that is not in a database.

Demo

House Pricing

Page 20: Simon Brown. a generic label for describing any corporate information that is not in a database.

ReviewWhat We DidTook unstructured data from the internet and transformed to data set

No longer a personal datasource but a consumable datasource

Used ETL to transform to structured data

Use PowerView to visualise data

Page 21: Simon Brown. a generic label for describing any corporate information that is not in a database.

How We Did It

UnstructuredExcelInternet Data

StructuredSQL Server 2012Integration Services

Semi-structuredSharepointExcel Services

Business IntelligenceSharepoint PowerPivot Power View

Power Query BI Semantic Model

Page 22: Simon Brown. a generic label for describing any corporate information that is not in a database.

Total Chaos

Page 23: Simon Brown. a generic label for describing any corporate information that is not in a database.

ChaosWe have over 10,000 Excel spreadsheets in the organisation. I am going to ban Excel.

- Manager

Page 24: Simon Brown. a generic label for describing any corporate information that is not in a database.

Discovery

Page 25: Simon Brown. a generic label for describing any corporate information that is not in a database.

Data Source

Page 26: Simon Brown. a generic label for describing any corporate information that is not in a database.

Change Control

Page 27: Simon Brown. a generic label for describing any corporate information that is not in a database.

Success

Page 28: Simon Brown. a generic label for describing any corporate information that is not in a database.

Questions

Page 29: Simon Brown. a generic label for describing any corporate information that is not in a database.

Developer Network

Resources for Developers

http://msdn.microsoft.com/en-au/

Learning

Virtual Academy

http://www.microsoftvirtualacademy.com/

TechNet

Resources

Sessions on Demand

http://channel9.msdn.com/Events/TechEd/Australia/2013

Resources for IT Professionals

http://technet.microsoft.com/en-au/

Page 30: Simon Brown. a generic label for describing any corporate information that is not in a database.

Track Resources • Download the CTP for SQL Server 2014 and accelerate your queries

using In-Memory OLTP - http://technet.microsoft.com/en-us/evalcenter/dn205290.aspx

• Get into the cloud with an Azure account - use SQL database in Windows Azure or take your workload into Azure VM - www.windowsazure.com

• Get big with big data – HDInsight on Azure and grab the latest Power BI featureshttp://www.windowsazure.com/en-us/documentation/services/hdinsight/?fb=en-us

Power BI - www.powerbi.com

Page 31: Simon Brown. a generic label for describing any corporate information that is not in a database.

© 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.