Oracle 10g DB Oracle Data Mining - Oracle | Integrated Cloud · PDF fileOracle Data Mining Overview 2. ... Data Warehousing ETL OLAP Data Mining OOracle 10racle 10g DB ... The role
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Data Mining: Find hidden Patterns • Data Mining can find previously hidden patterns and
relationships to help you: • Make informed predictions and…• Better understand customers
• Data Mining can help answer questions such as:• Which customers are likely to churn or attrite?• Which customers are likely to respond to this offer?• Which employees are likely to leave?• What “next product” should I recommend to this customer? • Which factors are most associated with a target attribute e.g. high value
customers • Which customer or transactions are most “unnatural” or possibly
Data Mining: Discover New Insights • Data Mining uncover hidden patterns and relationships to
help you: • Discover new segments, clusters, and subgroups and …
• Data Mining can help answer questions such as:• What are the profiles subpopulations or items of interest e.g. churners,
profitable customers, defective product, etc. • What natural segments or clusters exist in my data?• Which items are typically purchased together?• What items seems to fail together?• Which genes are most associated with this disease?
of a table and returns count, min, max, range, mean, stats_mode, variance, standard deviation, median, quantile values, +/- n sigma values, top/bottom 5 values
• Correlations• Pearson’s correlation coefficients, Spearman's and
Kendall's (both nonparametric).
• Cross Tabs• Enhanced with % statistics: chi squared, phi coefficient,
ease of use with ODM in-Database functionality & scalability
• Build, store, browse and score models in the Database for optimal performance
• For more information :• SPSS – Roger Lonsberry, (312) 651-3475 or [email protected]• Oracle – Alan Manewitz, (925) 984-9910 or [email protected]• Oracle – Charlie Berger, (781) 744-0324 or [email protected]
Benefits of Oracle’s ApproachIn-Database Analytics Benefit• Platform for Analytical
Applications• Eliminates data movement and
security exposure• Fastest: Data Information
• Wide range of data mining algorithms & statistical functions
• Supports most analytical problems
• Runs on multiple platforms • Applications may be developed and deployed
• Built on Oracle Technology • Grid, RAC, integrated BI,…• SQL & PL/SQL available• Leverage existing skills
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
The role of Data Mining in Rules-based Remote Services Delivery
Tracy E. ThieretPrincipal Scientist
Imaging and Systems Technology CenterXerox Innovation Group
Webster, New York
Oracle OpenWorld: October 2006 2
Talk Track
• Introduction to Xerox
• Business Metrics and Requirements
• How do we get data from our devices?
• OK, we have data. Now what?
• Before you can do Data Mining…
• The Process and some Results
• The Rewards
Oracle OpenWorld: October 2006 3
Xerox Innovation Group Locations
**XRCC
****PARCWCR&T/ISTC
**XRCE
**El Segundo**Stamford
Oracle OpenWorld: October 2006 4
Introduction to Xerox
It’s all about DocumentsCopying and PrintingFormat conversion – electronic to paper and back
How do we make money?Engineering design of marking productsChemistry and Physics of MaterialsServices around Marking and Scanning
Oracle OpenWorld: October 2006 5
Some Xerox Engineered ProductsFull Range: Desktop to Production
NuveraPhaser 6250
DocuColor iGen3WorkCenterPro 90
Oracle OpenWorld: October 2006 6
Document System DeviceModern document devices perform multiple functions
Print, Scan, Copy, Fax, E-Mail, OCR, and much more…Simultaneously
Complex Electro-Mechanical devices DocuColor iGen3 Digital Press
85 Computers, 5M Lines of S/W, 3.5 miles of wires, 192 Sensors, 102 Motors
StackingPrint
Engine
Smart Paper Trays
Steering & Motion Control Loops
Process Control Loops
Oracle OpenWorld: October 2006 7
US Consumables Industry$~37 Billion Annually
Toners/Carriers25,000 Freight Cars/Year
Photoreceptors 220,000/day
Fuser Components35 Million Rolls/Year
Paper & Transparencies4 Billion Sheets/Day
Specialty Materials
•Fuser Oil•Cleaner Blades Inks
490,000 Cartridges/Day
Copying Printing Faxing
Copying Printing Faxing
Oracle OpenWorld: October 2006 8
TonerA Highly Complex and Constrained Material
20 µm
Oracle OpenWorld: October 2006 9
Business Objectives:Reducing Costs in each LoB
Engineering DesignProviding Increased Functionality within Boundaries
Total Manufacturing CostsSoftware Development
Toner Chemistry and PhysicsNew DesignsImproved Functionality
Services DeliveryXerox’s Internal Service ForceParts and LaborAccelerate Collective Learning Product
EoLTime…ProductLaunch
Convergence to Mature Metrics
Mea
sure
of “
Goo
dnes
s”
$
Oracle OpenWorld: October 2006 10
Data from Devices
Web Presence and Back-Office
Information Flow
Informatio
n
Flow
Network/Systems Mgmt AppOn-site Solutions
Information Flow
Xerox & Partner Sites
Customer Site
Active Device Agent with embedded
intelligence
Enhanced web access to tools and
services
Make use of external and internal standards to speed development and deployment
of capabilities.
Devices
Oracle OpenWorld: October 2006 11
OK – we have the data. Now what?
Deliver Data-Centric Services to Customers
AMR: Automated Meter ReadingASR: Automated Supplies ReplenishmentOthers in the Pipe
Feed-forward to Service Reps for Repair Hints
Knowledge Development in Engineering
Oracle OpenWorld: October 2006 12
Focus on Break-Fix Service
A host of Questions before you start…How to deploy Knowledge to Field Personnel?Knowledge Representation?Transparency?Ease of Knowledge Development?Decoupling of Cycle Times?
Machine Software ReleasesKnowledge Discovery
RulesOut of Favor in the ’80sBack in with deployment of Business Rules
Oracle OpenWorld: October 2006 13
Where do the Rules come from?
From the knowledge of the experts
Interviews with Engineering and Service Reps.Computational Capture and Analysis“Same problem, Different Machine”
Fast cycle time discovery of hidden knowledge
Data MiningMany algorithms that deliver rule ready resultsTriage the rules with the SMEs before deploymentTest for effectiveness in the field
Oracle OpenWorld: October 2006 14
Competitive Benchmarking
Our Choice
Oracle OpenWorld: October 2006 15
Detecting the Unexpected
Oracle OpenWorld: October 2006 16
Domain Analysis
DeployKnowledge
Target Data Set(s)
Target Data Set(s)
DataPreprocessing
DataPreprocessing
DataReduction
DataReduction
Data MiningTask SelectionData Mining
Task Selection
AlgorithmSelection
AlgorithmSelection
Data MiningData Mining
Interpretationof Results
Interpretationof Results
Repeatas
necessary
Before you can do Data Mining…• Business Hypotheses – What problem are you trying to address?• Cost/benefit modeling• Domain Knowledge Acquisition
• Assemble Relevant Data Sources• Business Processes• Numerical and Textual
• SQL to summarize/aggregate data• Pre-computed fields
• Find useful variables
• Classification: Identify clusters that describe behaviors• Association: What variables describe the problem?
• Tools & documentation• Reports and proposals for Business Decisions and Implementation• Rollout & Feedback – Quantify Benefits
Data Mining Expertise can be utilized for:• Product and Architecture Decisions• Post Launch Product Improvement• Expand revenue opportunities • Post Sale Services Improvement• Customer Relationship Management