|
You are here |
www.evanmiller.org | ||
| | | | |
highlyscalable.wordpress.com
|
|
| | | | | Statistical analysis and mining of huge multi-terabyte data sets is a common task nowadays, especially in the areas like web analytics and Internet advertising. Analysis of such large data sets often requires powerful distributed data stores like Hadoop and heavy data processing with techniques like MapReduce. This approach often leads to heavyweight high-latencyanalytical processes and... | |
| | | | |
will-keleher.com
|
|
| | | | | HyperLogLogs leverage the power of hashing strings into random numbers to create remarkably accurate cardinality estimates. | |
| | | | |
geo.rocks
|
|
| | | | | Have you ever wondered how HyperLogLog works? Or have you never heard of it at all? In this post I explain the wonderful algorithm of Flajolet et al. from scratch and in a very simple manner. | |
| | | | |
orlp.net
|
|
| | | |||