Rethinking Data Warehousing & Analytics Ashish Thusoo, Facebook Data Infrastructure Team Why Another Data Warehousing System? Data, data and more data 200GB per day in…
1. Proprietary and Presenter: Konstantin Gredeskoul CTO, Wanelo.com Based on work of Atasay Gökkaya and other engineers "It's a Unix System! I know this!"…
1. Hive User Group Meeting August 2009 2. Hive Overview 3. Why Another Data Warehousing System? Data, data and more data 200GB per day in March 20085+TB(compressed) raw data…
1. Hive: A Petabyte Scale Data Warehouse System on Hadoop Ning Zhang Data Infrastructure Team Facebook 2. Overview Motivations Real world problems we faced at Facebook Why…
1. Hive User Group Meeting August 2009 2. Hive Overview 3. Why Another Data Warehousing System? Data, data and more data 200GB per day in March 20085+TB(compressed) raw data…
Slide 1CST8177 awk Slide 2 The awk program is not named after the sea-bird (that's auk), nor is it a cry from a parrot (awwwk!). It's the initials of the authors,…
Slide 1CSCI 330 T HE UNIX S YSTEM Awk Slide 2 W HAT IS AWK ? created by: Aho, Weinberger, and Kernighan scripting language used for manipulating data and generating reports…
1.Data Warehousing & Analytics on Hadoop Ashish Thusoo, Prasad Chakka Facebook Data Team 2. Why Another Data Warehousing System? Data, data and more data 200GB per day…
1.Hadoop and Hive Large Scale Data Processing using Commodity HW/SW Joydeep Sen Sarma2. Outline Introduction Hadoop Hive Hadoop/Hive Usage @Facebook Wishlists/Projects Questions…