Top Banner
für Radioastronomie Max-Planck-Institut Bareos in Radio Astronomy - Scaling up using Virtual Full Backups Bareos in Radio Astronomy – Scaling up using Virtual Full Backups Jan Behrend Max Planck Institute for Radio Astronomy Open Source Backup Conference September 23 rd 2014
29

Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

May 29, 2018

Download

Documents

Dang Thu
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Bareos in Radio Astronomy –Scaling up using Virtual Full Backups

Jan Behrend

Max Planck Institute for Radio Astronomy

Open Source Backup ConferenceSeptember 23rd 2014

Page 2: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

OverviewAbout the InstituteBackup Goals and LimitationsThe ChallengeImplementationConfiguraton StrategyVirtual Full BackupsIntegration with DRBDIntegration with REARWishlist

Page 3: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Max Planck Institute for Radio Astronomy

– Founded in 1966– Main Building Bonn / 100m Radio Telescope Effelsberg– Apex Telescope in the Atacama Desert, Chile

– IT staff of 11 people– 204 servers (132 virtualized, 110 Bareos clients)– 64 node cluster– . . .

http://mpifr.de

Page 4: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Lofar Antenna Field

“Software” Telescope44 Stations

Page 5: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Scientific Raw Data

Project Output per Observation RunK-Band receiver: 2 TiBH1-Survey: 3 TiBPulsar search: 4 TiBLeap: 7 TiBLofar: 50 TiBGlow: 120 TiB. . .

∑186 TiB

Forget it!

Page 6: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Scientific Raw Data

Project Output per Observation RunK-Band receiver: 2 TiBH1-Survey: 3 TiBPulsar search: 4 TiBLeap: 7 TiBLofar: 50 TiBGlow: 120 TiB. . .

∑186 TiB

Forget it!

Page 7: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

GoalsFast recovery of:

specific files / directories(many) complete systems

LimitationsTime

Backup window (scientists are night birds)Network bandwidth (usually 1GBit/s)Resources on backup clients (I/O, RAM, CPU)

Storage space (disk and tape)

What can we do?parallel jobs, Virtual Full Backupsvolume retention

Page 8: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

GoalsFast recovery of:

specific files / directories(many) complete systems

LimitationsTime

Backup window (scientists are night birds)Network bandwidth (usually 1GBit/s)Resources on backup clients (I/O, RAM, CPU)

Storage space (disk and tape)

What can we do?parallel jobs, Virtual Full Backupsvolume retention

Page 9: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

GoalsFast recovery of:

specific files / directories(many) complete systems

LimitationsTime

Backup window (scientists are night birds)Network bandwidth (usually 1GBit/s)Resources on backup clients (I/O, RAM, CPU)

Storage space (disk and tape)

What can we do?parallel jobs, Virtual Full Backupsvolume retention

Page 10: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

“The Remains of the Day” without Raw Data

Full Backup Volume: 55 TiB (1.5M files / TiB)Differential Backup Volume: 566 GiB (4K files / GiB)Incremental Backup Volume: 102 GiBTwo backup copies

Time for a complete Full Backup and its 2nd copy

55 TiB130 MiB

s

· 2 · 2 · 1 d60 · 60 · 24 s

= 20.54 days

High Volume “Longterm” ClientsVirtual Fulls every half year, Incrementals every SaturdayLongterm Full Backup Volume: - 50 TiB (1M files / TiB)⇒ Full Volume every 8 weeks: 5 TiB (6M files / TiB)

Page 11: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

“The Remains of the Day” without Raw Data

Full Backup Volume: 55 TiB (1.5M files / TiB)Differential Backup Volume: 566 GiB (4K files / GiB)Incremental Backup Volume: 102 GiBTwo backup copies

Time for a complete Full Backup and its 2nd copy

55 TiB130 MiB

s

· 2 · 2 · 1 d60 · 60 · 24 s

= 20.54 days

High Volume “Longterm” ClientsVirtual Fulls every half year, Incrementals every SaturdayLongterm Full Backup Volume: - 50 TiB (1M files / TiB)⇒ Full Volume every 8 weeks: 5 TiB (6M files / TiB)

Page 12: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

“The Remains of the Day” without Raw Data

Full Backup Volume: 55 TiB (1.5M files / TiB)Differential Backup Volume: 566 GiB (4K files / GiB)Incremental Backup Volume: 102 GiBTwo backup copies

Time for a complete Full Backup and its 2nd copy

55 TiB130 MiB

s

· 2

· 2 · 1 d60 · 60 · 24 s

= 20.54 days

High Volume “Longterm” ClientsVirtual Fulls every half year, Incrementals every SaturdayLongterm Full Backup Volume: - 50 TiB (1M files / TiB)⇒ Full Volume every 8 weeks: 5 TiB (6M files / TiB)

Page 13: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

“The Remains of the Day” without Raw Data

Full Backup Volume: 55 TiB (1.5M files / TiB)Differential Backup Volume: 566 GiB (4K files / GiB)Incremental Backup Volume: 102 GiBTwo backup copies

Time for a complete Full Backup and its 2nd copy

55 TiB130 MiB

s

· 2 · 2

· 1 d60 · 60 · 24 s

= 20.54 days

High Volume “Longterm” ClientsVirtual Fulls every half year, Incrementals every SaturdayLongterm Full Backup Volume: - 50 TiB (1M files / TiB)⇒ Full Volume every 8 weeks: 5 TiB (6M files / TiB)

Page 14: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

“The Remains of the Day” without Raw Data

Full Backup Volume: 55 TiB (1.5M files / TiB)Differential Backup Volume: 566 GiB (4K files / GiB)Incremental Backup Volume: 102 GiBTwo backup copies

Time for a complete Full Backup and its 2nd copy

55 TiB130 MiB

s

· 2 · 2 · 1 d60 · 60 · 24 s

= 20.54 days

High Volume “Longterm” ClientsVirtual Fulls every half year, Incrementals every SaturdayLongterm Full Backup Volume: - 50 TiB (1M files / TiB)⇒ Full Volume every 8 weeks: 5 TiB (6M files / TiB)

Page 15: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

“The Remains of the Day” without Raw Data

Full Backup Volume: 55 TiB (1.5M files / TiB)Differential Backup Volume: 566 GiB (4K files / GiB)Incremental Backup Volume: 102 GiBTwo backup copies

Time for a complete Full Backup and its 2nd copy

55 TiB130 MiB

s

· 2 · 2 · 1 d60 · 60 · 24 s

= 20.54 days

High Volume “Longterm” ClientsVirtual Fulls every half year, Incrementals every SaturdayLongterm Full Backup Volume: - 50 TiB (1M files / TiB)⇒ Full Volume every 8 weeks: 5 TiB (6M files / TiB)

Page 16: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Backup Hardware2 Tape Libraries: Spectra Logic T950

2 x LTO5 drives80 Slots a 1,5 TiB = 160 TiB (too small by now)Dedicated cleaning partition

2 physical servers (Fujitsu RX300S6)2x 30 TiB RAID storage (Nexan Satabeast2)(Mostly) dedicated 1GbE/10GbE backup network

Page 17: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Hardware Infrastructure

FC SAN FC SAN

BACKUPSRV2cold standby

BACKUPSRV1

BACKUP LAN BACKUP LAN

INTRANET BONNINTRANET EFFELSBERG

TAPE LIBRARY 1TAPE LIBRARY 2

8GBit

1GBit

3x10GBit

10GBit 10GBit

SATABEAST (30 TiB)

40km

SATABEAST (30 TiB)

Page 18: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Configuration StrategyIncrementals to disk for better restore performanceFulls and Differentials to cheaper tape storagePer client config (DIR and SD) for easier configmanagement:

compression, quota, reservation, encryption ...easy templating for new backup clients

Focus on schedules / retentionDaily backup summary (3rd party tools)Dedicated DB partition· · ·

Page 19: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Virtual Full Backups

Reorganizes existing backups to create a new Full

Last Full (2 months ago)FullPool

Last Differential (1 week ago)DiffPool

Incrementals (x days ago)IncrementalPool

New Virtual Full (today)VirtualPool

Subsequent Migration

Page 20: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Virtual Full Backups

+++ No backup client interaction!(basically) no backup time limit

– Needs designated pool to avoid lockups– No “MaxFullInterval” config option

Page 21: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

What is DRBD?

Taken from http://www.drbd.org/home/what-is-drbd

Page 22: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Integrating Bareos with HA / DRBD

DRBD volume backed up by HA controlled BAREOS instance

IMAP FailoverIP: 134.104.18.110BAREOS on /REAR enabled

IMAP FailoverIP: 134.104.18.111BAREOS on /REAR enabled

/var/mail on DRBDBAREOS on /var/mailIP: 134.104.18.77...

Client {Address = 134.104.18.110...FileSet {File = /

Client {Address = 134.104.18.77...FileSet {File = /var/mail

Client {Address = 134.104.18.111...FileSet {File = /

Page 23: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

REAR: Relax and Recover

Linux bare metal disaster recovery solutionCreates bootable mediaRecovery takes care of all disk configurationCan use Bareos to restore content

Page 24: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Bareos Client config

/etc/rear/local.confGRUB_RESCUE=OUTPUT=ISOBACKUP=BAREOSBACKUP_URL="rsync://backupsrv/srv/rear/"

/etc/bareos/bconsole.confConsole {

Name = client-restorePassword = "password"

}

Page 25: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Bareos Client config

/etc/rear/local.confGRUB_RESCUE=OUTPUT=ISOBACKUP=BAREOSBACKUP_URL="rsync://backupsrv/srv/rear/"

/etc/bareos/bconsole.confConsole {

Name = client-restorePassword = "password"

}

Page 26: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Bareos Server config

/etc/bareos/conf.d/client.dir.confConsole {Name = client-restorePassword = "password"ClientACL = client-fdJobACL = Restore, clientFileSetACL = clientCommandACL = status, restore[...]

}

http://relax-and-recover.org

Page 27: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

WishlistIntegration of virtualization solutions on hypervisor levelWrite to more than one destination at onceRolling Spool/DespoolAutomatic spooling when writing to tapeNo copy of jobs using a base job?”MaxFullInterval“ for Virtual Full Backups

Page 28: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Want configs? Send me a mail to [email protected]

Any questions?

Notable reference:http://myunix.dk/category/bacula

Page 29: Bareos in Radio Astronomy Scaling up using Virtual Full ... · Bareos in Radio Astronomy - Scaling up using Virtual Full Backups ... Open Source Backup Conference ... Jan Behrend

für Radioastronomie

Max-Planck-Institut

Bareos in Radio Astronomy - Scaling up using Virtual Full Backups

Want configs? Send me a mail to [email protected]

Any questions?

Notable reference:http://myunix.dk/category/bacula