Top Banner
1 SPLUNK OVERVIEW ALEXANDER FOK BIG DATA ARCHITECT FEBRUARY 2017
16

Splunk Architecture overview

Jan 21, 2018

Download

Technology

Alex Fok
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Splunk Architecture overview

1

SPLUNK OVERVIEW

ALEXANDER FOK

BIG DATA ARCHITECT

FEBRUARY 2017

Page 2: Splunk Architecture overview

2

• What is Splunk?

• Splunk Main Functionality

• Why Splunk?

• Demo

Agenda

Page 3: Splunk Architecture overview

3

• Real Time logs collection, indexing and data analytics

• Time Series data management

• Search query language

tail –f ALL_LOGS*| grep “WHATEVER YOU NEED”

tail –f SOME_LOGS| grep “WHATEVER YOU NEED”| count by InterestingField

Commands Pipe

tail –f SOME_LOGS| grep “WHATEVER YOU NEED”| count by InterestingField

3

Splunk Main Functionality

Page 4: Splunk Architecture overview

4

• Presentation level – graphs, tables, etc

• Historical analysis

• Automation capabilities

• APIs

REST

Command line

Data Exploration and Visualization Capabilities

Page 5: Splunk Architecture overview

5

• Rolls Royce in the field

• Proven field success

• Flexible, User friendly, modern tool

• Enterprise grade – users access management, security, multitenant platform, data retention management policy

•Reach Echo system

•Splunk appstore –splunkbase.splunk.com

5

Why Splunk?

Page 6: Splunk Architecture overview

6

•Strong Visualization Capabilities – reports, dashboards

•Infinite scale – up to hundreds of TB logs per day

•Strong Post processing capabilities - Calculated and Extracted Fields

•Various Optimizations

Frequent Reports precalculation

Lookup tables

Field Tags

•Advanced Data Models - CIM

Why Splunk?

Page 7: Splunk Architecture overview

7

sourcetype=mysql_config OR sourcetype=remedy_changeticket

| dedup _raw, User

| transaction TicketId, User

| eval hasTicket = if(eventcount > 1, "Yes", "No")

| rename PrevPropValue as "Original_Value", NewPropValue

as "New_Value", hasTicket as "Change_Ticket"

| fields _time, User, Property, "Original_Value",

"New_Value", "Change_Ticket"

Page 8: Splunk Architecture overview

8

• Web logs• Log4J, JMS, JMX• .NET events• Code and scripts

• Configurations• syslog• SNMP• netflow

• Configurations• Audit/query logs• Tables• Schemas

• Hypervisor• Guest OS, Apps• Cloud

• Configurations• syslog• File system• ps, iostat, top

• Registry• Event logs• File system• sysinternals

Logfiles Configs Messages Traps Alerts

Metrics Scripts TicketsChanges

Linux/UnixWindows NetworkingDatabasesApplicationsVirtualization

& Cloud

• Click-stream data• Shopping cart data• Online transaction

data

Customer Facing Data

Outside the Datacenter

• Manufacturing, logistics…

• CDRs & IPDRs• Power consumption• RFID data• GPS data

No predefined schema, no custom connectors, no RDBMS, no need to filter/forward.

Splunk – The Big Picture

8

Page 9: Splunk Architecture overview

9

Splunk Architecture

Page 10: Splunk Architecture overview

10

Splunk’s MapReduce-based Architecture10

Chunk 1

Chunk 2

Chunk 3

Chunk 4

Chunk 1

Chunk 2

Chunk 3

Chunk 4

Chunk 1

Chunk 2

Chunk 3

Chunk 4

Search Head

map

map

map

map

map

map

map

map

map

Answer

reduce

Server 1 Server 2 Server N

time

Page 11: Splunk Architecture overview

11

•Events, Indexes, Fields – key value pairs, columns

•Index Time

events are processed, classified, time stamp is extracted indexed

Predefined Fields are extracted

events can be enriched

Events can trigger logic -> alerts, reports, dashboards updates etc

•Search Time

events are searched

fields are extracted or calculated

transactions are closed

Visualizations can be built

Splunk Typical WorkFlow

Page 12: Splunk Architecture overview

12

•Show events counts by SFlow

•SFlow|stats count by SFlow

•| transaction SAUPID startswith="Product Start" endswith="Product End"

Demo

Page 13: Splunk Architecture overview

13

What is An App?

• Terminology

• Apps – A workspace that solves a specific use case with a navigable view

• Add-on – A reusable Splunk component that does not contain a view

• Example

• Splunk for Cisco Security is an App

• The collection of field extractions/sourcetypes/transforms/eventypes thatmap raw firewall logs is an Add-on

Page 14: Splunk Architecture overview

14

•CIM – Common Information Model

•Domain centric data models – OSSEC, networking, ticket management

•Data normalization

•Validation

•Visualization

•Action generation

Splunk as SIEM

Page 15: Splunk Architecture overview

15

Marathon Tel Aviv 2017 – See you tomorrow

Page 16: Splunk Architecture overview

16

Alexander Fok, Big Data Architect

THANK YOU