*** This repository is a modification of the following one: ***
https://github.com/infolab-usc/SCAWG
- Added straightforward running parameters to easily generate synthetic/synthesis data. Please see the main.Entry for all parameters.
- Added support to other dataset format
- Real dataset should be put in
dataset/real/{name}
folders. Please see each processor under org.geocrowd.synthesis for details. - The result is put in
dataset/{name}/task
anddataset/{name}/worker
for synthesis data. For synthetic data, the result is put indataset/{distribution}/task
anddataset/{distribution}/worker
. - Please remember clean the result folder, so that the output of different runs will not mix together.