1. Runtime Internals 连城 [email protected][email protected] 2. What is • A fast and general engine for large-scale data processing • An open source implementation…
1. Evan Sparks and Ameet Talwalkar UC Berkeley UC Berkeley baseML baseML M ML M 2. Three Converging Trends 3. Big Data Three Converging Trends 4. Distributed…
Connected Homes Jim Anning, Head of Data & Analytics Josep Casals, Lead Data Engineer Data Science! Analytics! Data Engineering Hive 100K - 2 minutes 2.3 Billion Events…
1. Spark as a Platform to Support Multi-Tenancy and Many Kinds of Data Applications Kelvin Chu @ Uber 2. About Myself • Started with Spark 0.7 • Co-created Spark Job…
Slide 1 www.spark-project.org Spark Lightning-Fast Cluster Computing UC BERKELEY 1 This Meetup Project history and plans Spark tutorial Running locally and on EC2 Interactive…