1 Porcupine: A Highly Available Cluster-based Mail Service Yasushi Saito Brian Bershad Hank Levy University of Washington Department of Computer Science.

Porcupine: A Highly Available Cluster-based Mail Service

Yasushi SaitoBrian Bershad

Hank Levy

University of Washington Department of Computer Science and Engineering,

Seattle, WA

http://porcupine.cs.washington.edu/

Why Email?

Mail is importantReal demand

Mail is hardWrite intensiveLow locality

Mail is easyWell-defined APILarge parallelismWeak consistency

Use commodity hardware to build a large, scalable mail service

Three facets of scalability ...• Performance: Linear increase with cluster

size • Manageability: React to changes automatically• Availability: Survive failures gracefully

Conventional Mail Solution

Static partitioning

Performance problems:No dynamic load balancing

Manageability problems:Manual data partition

decisionAvailability problems:

Limited fault tolerance

SMTP/IMAP/POP

Bob’smbox

Ann’smbox

Joe’smbox

Suzy’smbox

NFS servers

Presentation Outline

OverviewPorcupine Architecture

Key concepts and techniquesBasic operations and data structuresAdvantages

Challenges and solutionsConclusion

Key Techniques and Relationships

Functional Homogeneity“any node can perform any task”

AutomaticReconfiguration

Load BalancingReplication

Manageability PerformanceAvailability

Framework

Techniques

Porcupine Architecture

Node A ...Node B Node Z...

SMTPserver

POPserver

IMAPserver

Mail mapMailbox storage

User profile

Replication Manager

Membership Manager

Load Balancer

User map

Porcupine Operations

Internet

A B...

1. “send mail to bob”

2. Who manages bob? A

3. “Verify bob”

5. Pick the best nodes to store new msg C

DNS-RR selection

4. “OK, bob has msgs on C and D 6. “Store

msg”B

Protocol handling

User lookup

Load Balancing

Message store

Basic Data Structures

“bob”

BCACABAC

bob: {A,C}ann: {B}

BCACABAC

suzy: {A,C} joe: {B}

BCACABAC

Apply hash function

User map

Mail map/user info

Mailbox storage

Bob’s MSGs

Suzy’s MSGs

Bob’s MSGs

Joe’s MSGs

Ann’s MSGs

Suzy’s MSGs

Porcupine Advantages

Advantages:Optimal resource utilizationAutomatic reconfiguration and task re-distribution

upon node failure/recoveryFine-grain load balancing

Results:Better AvailabilityBetter ManageabilityBetter Performance

Presentation Outline

OverviewPorcupine ArchitectureChallenges and solutions

Scaling performanceHandling failures and recoveries:

Automatic soft-state reconstructionHard-state replication

Load balancingConclusion

Performance

GoalsScale performance linearly with cluster size

Strategy: Avoid creating hot spotsPartition data uniformly among nodes

Fine-grain data partition

Measurement Environment

30 node cluster of not-quite-all-identical PCs100Mb/s Ethernet + 1Gb/s hubsLinux 2.2.742,000 lines of C++ code

Synthetic load Compare to sendmail+popd

How does Performance Scale?

0 5 10 15 20 25 30Cluster size

Messages/second

Porcupine

sendmail+popd

68m/day

25m/day

Availability

Goals:Maintain function after failuresReact quickly to changes regardless of cluster sizeGraceful performance degradation / improvement

Strategy: Two complementary mechanismsHard state: email messages, user profile

Optimistic fine-grain replicationSoft state: user map, mail map

Reconstruction after membership change

Soft-state Reconstruction

B C A B A B A C

bob: {A,C}

joe: {C}

B C A B A B A C

B A A B A B A B

bob: {A,C}

joe: {C}

B A A B A B A B

A C A C A C A C

bob: {A,C}

joe: {C}

A C A C A C A C

suzy: {A,B}

ann: {B}

1. Membership protocolUsermap recomputation

2. Distributed disk scan

Timeline

ann: {B}

B C A B A B A C

suzy: {A,B}C ann: {B}

B C A B A B A C

suzy: {A,B}ann: {B}

B C A B A B A C

suzy: {A,B}

How does Porcupine React to Configuration Changes?

0 100 200 300 400 500 600 700 800Time(seconds)

Messages/second

No failure

One nodefailureThree nodefailuresSix nodefailures

Nodes fail

New membership determined

Nodes recover

New membership determined

Hard-state Replication

Goals:Keep serving hard state after failuresHandle unusual failure modes

Strategy: Exploit Internet semanticsOptimistic, eventually consistent replicationPer-message, per-user-profile replicationEfficient during normal operationSmall window of inconsistency

How Efficient is Replication?

0 5 10 15 20 25 30Cluster size

Porcupine no replication

Porcupine with replication=2

68m/day

24m/day

How Efficient is Replication?

0 5 10 15 20 25 30Cluster size

Porcupine no replication

Porcupine with replication=2

Porcupine with replication=2, NVRAM

68m/day

24m/day33m/day

Load balancing: Deciding where to store messages

Goals:Handle skewed workload wellSupport hardware heterogeneityNo voodoo parameter tuning

Strategy: Spread-based load balancingSpread: soft limit on # of nodes per mailbox

Large spread better load balanceSmall spread better affinity

Load balanced within spreadUse # of pending I/O requests as the load measure

How Well does Porcupine Support Heterogeneous Clusters?

0% 3% 7% 10%Number of fast nodes (% of total)

Spread=4

Static

+16.8m/day (+25%)

+0.5m/day (+0.8%)

Conclusions

Fast, available, and manageable clusters can be built for write-intensive service

Key ideas can be extended beyond mailFunctional homogeneity

Automatic reconfiguration

Replication

Load balancing

Ongoing Work

More efficient membership protocolExtending Porcupine beyond mail: Usenet,

BBS, Calendar, etc More generic replication mechanism

1 Porcupine: A Highly Available Cluster-based Mail Service Yasushi Saito Brian Bershad Hank Levy University of Washington Department of Computer Science.

c slide

mday slide

grain data partition

solutions conclusion

mail map recon

sendmail popd slide

performance scale

node cluster

Documents

Porcupine Music Festival de Musique

Sever2005 porcupine&Squill co evolution

Porcupine Porcupine Researched by: Sonia Lombardo The...

Implementing Processes and Process Management Brian Bershad.

MLC OFFICE PORCUPINE MINING DIVISION

Porcupine Hugs 2013-2014 Catalog

PORCUPINE JOINT VENTURE - Ontario

Porcupine or Keta ͞͞ emiw

Porcupine fat chubby things

The fable of the porcupine

Stevens Porcupine Boiler

* Shiori Sasaki , Yasushi Kiyoki , * Taizo Yakushiji

The Porcupine Dilemma

Michael M. Swift, Muthukaruppan Annamalai, Brian N. Bershad,...

Porcupine Mountains Visitor - Michigan

Yasushi Inoue Eroberungszüge