Top Banner
Splunk Spark Integration Gang Tao
18

Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Feb 24, 2018

Download

Documents

vuongcong
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Splunk Spark IntegrationGang Tao

Page 2: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

About Me• Software Engineer with 15+ Years experience • Now architect working on Data acquisition and Cloud App

• Used to be working on BI, ERP and other Enterprise application development

• Like data science and open source

Page 3: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Splunk'Company'Overview'

3"

Company''

•  Global"HQs:""!  San"Francisco"!  London""!  Hong"Kong"

•  1,800+"employees"globally"•  Annual"Revenue:"$450.9M"(YoY"+49%)"

•  NASDAQ:"SPLK"

Products'

•  Free"trial"to"massive"scale"•  Splunk"products:""

!  Splunk"Enterprise"!  Splunk"Cloud"!  Hunk"!  Splunk"Light"!  Splunk"MINT"!  Premium"SoluWons"

Customers''

•  10,000+"customers"•  Across"100"countries"•  Small"to"large"organizaWons"

•  More"than"80"of"the"Fortune"100"

•  Largest"license:""!  400+"Terabytes/day"

Page 4: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Splunk'–'a'Data'Pla-orm'

Mainframe)Data)

VMware)

Pla0orm)for)Machine)Data)

Exchange) PCI)Security)

Rela=onal)Databases)

Mobile)Forwarders) Syslog)/))TCP)/)Other)

Sensors)&)Control)Systems)

Wire))Data)

Mobile)Intel)

Splunk'Premium'Apps' Rich'Ecosystem'of'Apps'

MINT')

Splunk - a Machine Data Platform

Page 5: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Demo

Page 6: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Splunk Technical Stack

PresentingProcessing

StoreAcquisition

Page 7: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Splunk Deployment ArchitectureIndexer store  data,  transform  row  data  into  events  and  searches  the  indexed  data  in  response  to  search  requests.  

Search  Headdirects  search  requests  to  a  set  of  indexers,  merges  the  results  and  presents  them  to  the  user  

Forwarderget  data  into  indexers  

Page 8: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Splunk VS Open Source

Page 9: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Splunk VS Open Source

Page 10: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

SQL of Machine Data - SPLSPL  –  Splunk  Processing  Language  

SQL  *nix  Pipe  Google  Search

Page 11: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Extensibility - Splunk Apph0p://apps.splunk.com/    

Enterprise  Security  ITSI  DB  Connect  Technology  Add-­‐ons

Page 12: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Why Integration?• Splunk to Spark

• Data Ingestion

• Unstructure/Semi Structure data Indexing

• Data processing with Splunk search

• Data Presenting

• Spark to Splunk

• Powerful computing capability

• Machine Learning

• Open Source community

Page 13: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Solution A

Page 14: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Solution B

Page 15: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Solution C

Indexer

Virtual Indexer (Spark)

SPL

Enhanced Search Command

Spark Driver

(SPL Parser)

Spark Worker

Spark Worker

Spark Worker

Page 16: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Challenges• Avoid big data movement

• keep good user experience

• Adapt to SPL concept

Page 17: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created
Page 18: Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created