1. 1 Scaling ETL with Hadoop Gwen Shapira, Solutions Architect @gwenshap [email protected] 2. 2 3. ETL is… • Extracting data from outside sources • Transforming…