DOCUMENT RESOURCES FOR EVERYONE
Documents tagged
Data & Analytics Similarity at scale

This is a presentation I gave at Hadoop Summit San Jose 2014, on doing fuzzy matching at large scale using combinations of Hadoop & Solr-based techniques.