Top Banner
Use Cases For Cassandra in Federal and State Government Chris Bradford and Matt Overstreet
22

Use cases for cassandra in federal and state government

Aug 17, 2015

Download

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Use cases for cassandra in federal and state government

Use Cases For Cassandra in Federal

and State Government

Chris Bradford and Matt Overstreet

Page 2: Use cases for cassandra in federal and state government

Matt Overstreet● Software Architect● Search relevancy engineer● Has worked on systems ranging

from Tractor Trailer weigh stations to celebrity websites

● Likes Cassandra

GitHub: omnifroodle

Page 3: Use cases for cassandra in federal and state government

● DataStax Cassandra Architect● Contributor to CQLEngine -

Python C* ORM● Developed Trireme -

a C* migration engine● Created the world’s smallest C*

cluster

Chris Bradford

Twitter: @bradfordcp

GitHub: bradfordcp

Page 4: Use cases for cassandra in federal and state government

Who we are● Consulting firm based in

Charlottesville Virginia● Founded in 2005● 30 consultants delivering projects● Focused on Search in 2010, specifically

Solr and Lucene● Delivering Cassandra Consulting since

2012● Datastax Gold partner● Great with Search, Analytics and

Discovery

Page 5: Use cases for cassandra in federal and state government

Blog & Publications● Blog: http://o19s.com/blog/● Twitter: @o19s● Books

o Relevant Search (Manning)

o Building a Search Server with Elasticsearch (Packt)

o Apache Solr Enterprise Search Server (Packt)

Page 6: Use cases for cassandra in federal and state government

How we got hereOpenSource Connections started with a deep expertise in full text search.

As the size and velocity of the data we interact with grew, so did our toolset for storing, presenting and processing that data.

Page 7: Use cases for cassandra in federal and state government

OSC Toolkit

Page 8: Use cases for cassandra in federal and state government

Some Use Cases- Analytics Workloads

- Welfare Fraud Detection- Intrusion Detection

- Distributed Data Warehousing- Data Warehouse/Sink- Replication & Recovery

Page 9: Use cases for cassandra in federal and state government

Analytics WorkloadsLook for patterns of user error, fraud and abuse in forms submitted to an agency.

Requires the ability to compare submissions to look for similar identifiers such like name, street address, etc

Page 10: Use cases for cassandra in federal and state government

Welfare Fraud Detection● Massive amounts of data● Hard to compare and find patterns● Difficult to incorporate human analysis

Page 11: Use cases for cassandra in federal and state government

Welfare Fraud Detection● Ingest data into the system or work on

data in place● Fraud Score Generation

o Automated ruleso Manually

● Employees can now focus on reviewing the flagged records

Page 12: Use cases for cassandra in federal and state government
Page 13: Use cases for cassandra in federal and state government

Intrusion Detection● Stream log data in to C* from

applications● Surface metrics through a security

dashboard● Perform analysis on records looking for

anomalies (Optional)CREATE TABLE ids ( window TIMESTAMP, route VARCHAR, status_code VARCHAR, request_id TIMEUUID, PRIMARY KEY ((window, route, status_code), request_id));

Page 14: Use cases for cassandra in federal and state government

Intrusion Detection

Page 15: Use cases for cassandra in federal and state government

Distributed Data Warehouse● Cassandra is designed in a peer

to peer architecture. There are no “masters” or “slaves”.

● True distributed load, write anywhere, read anywhere.

● Built-in replication between data centers.

Page 16: Use cases for cassandra in federal and state government

Simple Distributed Applications

Page 17: Use cases for cassandra in federal and state government
Page 18: Use cases for cassandra in federal and state government

Data Warehouse● Cassandra is used to house case data

from disparate systems● Data is then pushed into a full text

search index● Cases may now be searched through

an intuitive web interface

Page 19: Use cases for cassandra in federal and state government
Page 20: Use cases for cassandra in federal and state government

Operations

● Widely compatible with programming

languages used in enterprise

development

● OpsCenter monitoring tool

● Cassandra scales predictably

● Fault-tolerant

Page 21: Use cases for cassandra in federal and state government

Use Case Review● Analytics Workloads

○ Welfare Fraud Detection○ Intrusion Detection

● Distributed Data Warehousing○ Data Warehouse/Sink○ Replication & Recovery

Page 22: Use cases for cassandra in federal and state government

Q & A