Top Banner
Graphs for Fraud Detec1on [email protected] @ rvanbruggen
31

201411203 goto night on graphs for fraud detection

Jul 16, 2015

Download

Technology

Rik Van Bruggen
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 201411203 goto night on graphs for fraud detection

Graphs  for  

Fraud  Detec1on  [email protected]  

@rvanbruggen  

Page 2: 201411203 goto night on graphs for fraud detection

Agenda  •  About  Graphs  •  About  Graph  Databases  •  Why  Graph  Databases  ma<er  for  Fraud  Detec?on  –  Short  demonstra?on  

•  Case  Studies  •  Q&A  

Page 3: 201411203 goto night on graphs for fraud detection

Introduc?on:  about  Graphs  

Page 4: 201411203 goto night on graphs for fraud detection
Page 5: 201411203 goto night on graphs for fraud detection

Meet ���Leonhard Euler•  Swiss  mathema?cian  •  Inventor  of  Graph  Theory  (1736)  

Page 6: 201411203 goto night on graphs for fraud detection

Königsberg  (Prussia)  -­‐  1736  

Page 7: 201411203 goto night on graphs for fraud detection

A

B

D

C

Page 8: 201411203 goto night on graphs for fraud detection

A

B

D

C

1

23

4

76

5

Page 9: 201411203 goto night on graphs for fraud detection

About  Graph  Databases  

Page 10: 201411203 goto night on graphs for fraud detection

So  what  is  a  graph  database?  

•  OLTP  database  – “end-­‐user”  transac?ons  

•  Model,  store,  manage  data  as  a  graph  

Page 11: 201411203 goto night on graphs for fraud detection

What  is  a  graph?  Node  

Rela?onship  

Page 12: 201411203 goto night on graphs for fraud detection

Contrast  with  Rela?onal  

Graphs are often referred to as “Whiteboard Friendly”. The data model reflects the way a domain expert would naturally

draw their data on a whiteboard“The schema is the data”. Schema flexibility allows the system

to change in response to a changing environment

Page 13: 201411203 goto night on graphs for fraud detection

What  are  graphs  good  for?  

Complex  Querying  

Page 14: 201411203 goto night on graphs for fraud detection

Examples  of  complex  queries?  1.  Semi-­‐structure  in  datasets  

14

– Normaliza?on  introduces  complexity  

– Forces  developers  to  develop  all  kinds  of  logic  to  deal  with  this  variability  in  their  applica?on  logic  

Page 15: 201411203 goto night on graphs for fraud detection

Examples  of  complex  queries:  2.  Connectedness  in  data  

Lots  of  normalized  rela?onships  between  the  different  en??es,  forces  developers  to  do  •  Deep  joins  •  Recursive  joins  •  Pathfinding  opera?ons  •  “open-­‐ended”  queries  

Page 16: 201411203 goto night on graphs for fraud detection

Examples  of  Connectedness  

Page 17: 201411203 goto night on graphs for fraud detection

Graphs  in  Fraud  Detec1on?  

Page 18: 201411203 goto night on graphs for fraud detection

Recommender  Fraud  Detec?on  systems  

Page 19: 201411203 goto night on graphs for fraud detection

Recommender  Fraud  Detec?on  systems  

Fraud Detection

Page 20: 201411203 goto night on graphs for fraud detection

Graphs  in  Fraud  Detec?on  Systems  

•  Real  ?me  aspect  •  Detec?on  Prac?ces  rely  on  Graph  Algorithms  

•  Opera?onal    efficiency  

Page 21: 201411203 goto night on graphs for fraud detection

Real  1me  fraud  detec?on?  •  Context  is  everything  –  You  don’t  want  to  be  blocking  credit  cards  for  no  

reason…  false  posi?ves  are  fatal…  •  Complexity  ma<ers  –  Need  to  outsmart  the  “bad  guys”  –  Assume  that  they  can  and  will  understand/beat  

the  system  •  Visualiza?on  ma<ers  –  Manual  interven?on  relies  on  fast  understanding  

of  context  –  and  visualiza?on  helps  there  

Page 22: 201411203 goto night on graphs for fraud detection

Fraud  Detec?on  prac?ces:  Graph  Algorithms  

●  Helpful  for  naviga?ng  complex  networks  ●  tell  me  how  A  and  B  are  related  ●  The  things  on  the  path  between  A  and  B  could  very  well  be  

interes?ng  ●  ShortestPath,  AllShortestPaths,  Weighted  ShortestPath  (Dijkstra,  A*)  

●  Helpful  for  understanding  the  important  parts  of  a  network  ●  Clusters  ●  Bridges  

●  Centrality  ●  Betweenness  Centrality  

●  (Page)Ranking  

Page 23: 201411203 goto night on graphs for fraud detection

Opera1onal  Efficiency  

•  Graph  datamodel  removes  the  need  for  many  “batch  opera?ons”  – No  need  to  precalculate  –  just  feed  it  into  the  graph  

•  Complex  pa<ern  matching  in  milliseconds  •  Graph  Locality  ==  Predictability  &  Speed,  even  over  large  datasets  

Page 24: 201411203 goto night on graphs for fraud detection

●  Circular  pa<erns  omen  indicate  some  kind  of  a<empt  to  “trick  the  system”  

 

Short  demo  -­‐  1  

Page 25: 201411203 goto night on graphs for fraud detection

Short  demo  -­‐  2  

Page 26: 201411203 goto night on graphs for fraud detection

Use  Cases  (neo4j.com/use-­‐cases)  

Page 27: 201411203 goto night on graphs for fraud detection

Customers  (neo4j.com/customers)  

Page 28: 201411203 goto night on graphs for fraud detection

Graph  Gists  (h<p://gist.neo4j.org/)  

Page 29: 201411203 goto night on graphs for fraud detection

Neo Technology, Inc Confidential

Neo4j License Overview

Developer!Seats!

($6K*/Developer/Year)

Test!Instances!

($6K/Instance/Year)

Production!Instances!

(Bundle / Core Pricing)

Instances whose purpose is to ensure that the software accessing

Neo4j is meeting specification.!!

(e.g. System Test, Integration Test, UAT, Performance Test, Staging)

Instances that store and process data in a way that benefits and

advances an organization’s goals.!!

May be accessed by applications and/or end users

Includes access by programmers to licensed test instances, and

private instances on the programmer’s personal machine for the sole purpose of writing, debugging, or testing software

designed to access Neo4j

*Or otherwise, depending on the Bundle, and negotiation

Neo4j  versions  /  licenses  

Personal  <  Startup  /  Departmental  <  Enterprise  deployment  models  Open  source  &  Commercial  license  terms  available  

Specific  OEM  models  

Page 30: 201411203 goto night on graphs for fraud detection

Future  trainings  &  events!  

30

Page 31: 201411203 goto night on graphs for fraud detection

Neo  Technology  www.neotechnology.com    Neo4j  www.neo4j.org      [email protected]  or  +32  478  686800  

Q&A,  Conclusion,  Next  Steps