This post is about the algorithms discussed by Ted Dunning in Spark Summit 2015
Algorithms are:
- Hashing and Sketching
- Hyper log log = count distinct
- Count min = count(s)
- Streaming k-means
- Quantiles via t-digest
You can see the video from this link.