Sliding aggregation

This is a spark program implementing sliding aggregation algorithm from

Prerequisites

Set up pyspark on your cluster of computers
spark submit [spark_options] sliding_aggregation.py path_to_input_on_hdfs window_size path_to_output_on_hdfs number_of_partitions

Collection of files of the form: number weight

e.g. file1.txt

1 2
2 5
5 8

file2.txt

2 3
5 6
9 1

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
sliding_aggregation.py		sliding_aggregation.py
tests.py		tests.py