![]() ![]() Shigang Chen and his team at the University of Florida who are actively conducting research around "Big Network Data". ResultsĪ direct comparison with the HyperLogLog++ implementation used by InfluxDB yielded the following results: ExactĪ big thank you to Prof. In general it borrows a lot from InfluxData's fork of Clark Duvall's HyperLogLog++ implementation, but uses 50% less space. 4-bit register instead of 5 (HLL) and 6 (HLL++), but most implementations use 1-byte registers out of convenience.loglog-beta for dynamic bias correction medium and high cardinalities.sparse representation for lower cardinalities (like HyperLogLog++).The core differences between this and other implementations are: This work is based on "Better with fewer bits: Improving the performance of cardinality estimation of large data streams - Qingjun Xiao, You Zhou, Shigang Chen". HyperLogLog - an algorithm for approximating the number of distinct elementsĪn improved version of HyperLogLog for the count-distinct problem, approximating the number of distinct elements in a multiset using 33-50% less space than other usual HyperLogLog implementations. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |