1. Apache Spark The Emerging Platform for Distributed Analytics July 2014 Thomas W. Dinsmore 2. What is Apache Spark? • Distributed in-memory analytics engine • Runs…