WavingSketch

Introduction

Finding top-k items in data streams is a fundamental problem in data mining. Existing algorithms that can achieve unbiased estimation suffer from poor accuracy. In this paper, we propose a new sketch, WavingSketch, which is much more accurate than existing unbiased algorithms. WavingSketch is generic, and we show how it can be applied to four applications: finding top-k frequent items, finding top-k heavy changes, finding top-k persistent items, and finding top-k Super-Spreaders. We theoretically prove that WavingSketch can provide unbiased estimation, and then give an error bound of our algorithm. Our experimental results show that, compared with the state-of-the-art, WavingSketch has 6 times higher insertion speed and at least 1000 times lower error rate in finding frequent items when memory size is tight. For other applications, WavingSketch also achieves higher accuracy and faster insertion speed. All related codes are open-sourced and available at Github.

About this repo

Waving contains codes of WavingSketch (including Multi-Counter WavingSketch) and the algorithms we compared with in the four tasks.
SIMD contains codes of WavingSketch (including Multi-Counter WavingSketch) and the related algortihms implemented using SIMD optimization in the task of finding frequent items. It also contains the codes of our compression and expansion experiments.
Flink contains codes of WavingSketch implemented on top of Apache Flink and a sample dataset used in Flink experiments.
More details can be found in the folders.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
Flink		Flink
SIMD		SIMD
Waving		Waving
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WavingSketch

Introduction

About this repo

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

IceGiraffe/SlidingFilter_Waving

Folders and files

Latest commit

History

Repository files navigation

WavingSketch

Introduction

About this repo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages