Spark

Frequent Pattern Mining

TL; DR The field of Frequent Pattern Mining (FPM) encompasses a series of techniques for finding patterns within a dataset. This article will cover some of those techniques and how they can be used to extract behavioral patterns from anonymous interactions, in the context of an ecommerce site. Terms and

Creating Component Tests for Spark Applications

One of the main engineering challenges faced by the Empathy.co Data Team is creating robust tests for our Spark applications. Since these applications are constantly evolving, as for any application, we needed a way to ensure changes wouldn’t break the code; a guarantee that the output from our