multivariate Time Series Anomaly Detection Benchmark Suites

Benchmark Suites for Anomaly Detection on Multivariate Time Series

Features

Two benchmark suites
1. Fully synthetic benchmark suite
2. Semi-realistic benchmark suite
Only multivariate sequences
Expressive and compact
Full knowledge about the sequences and the anomalies
Compatible to TimeEval
Online Dataset visualizer: https://2er0.github.io/mTADS

Fully synthetic benchmark suite samples

Semi-realistic benchmark suite sample

Usage

Sequence iterator

Use a file iterator to load the sequences or the build in loading functionality with caching capabilities.

from loading import load_all_stored_datasets

generator = load_all_stored_datasets("fsb")  # or srb
for gen in generator:
    group, parameters, train_sequence, test_sequence = gen
    ...

group contains the name of the loaded sequence and may contain no-anomaly in the end, which means that the training sequences does not contain any anomaly and is for unsupervised and semi-supervised algorithms.
parameters contains the generation configuration and varies between the GutenTAG generated ones and the added.
train_sequence is a Pandas DataFrame containing the sequence for training with the column structure timestamp,value-0,value-1,…,is_anomaly and is fully compatible to GutenTAG and TimeEval
test_sequence is a Pandas DataFrame containing the sequence for testing with the column structure timestamp,value-0,value-1,…,is_anomaly and is fully compatible to GutenTAG and TimeEval

Compatibility note

The semi-realistic benchmark suite is not fully compatible with the TimeEval framework because the sequences include additional information. They include in the which channel contains the anomaly and the magnitude of the artificial change. This information is automatically filtered out by using the provided loader in loading.py.

Generate sequences

Executing the main.py file generates the sequences that are not covered via the GutenTAG framework:

usage: main.py [-h] [--suite {FSB,SRB}] [--save SAVE] [--plot PLOT] [--interactive INTERACTIVE] [--remove_orphan_only REMOVE_ORPHAN_ONLY]

optional arguments:
  -h, --help            show this help message and exit
  --suite {FSB,SRB}     Name name of the benchmark suite to generate: 'FSB' or 'SRB'
  --save SAVE           Save the generated sequences: 'yes' or 'no'
  --plot PLOT           Render each sequences after generation: 'yes' or 'no'
  --interactive INTERACTIVE
                        Generate sequences interactively with with user input to continue: 'yes' or 'no'
  --remove_orphan_only REMOVE_ORPHAN_ONLY
                        Remove orphan sequences only: 'yes' or 'no'

Generate the GutenTAG framework covered sequences:

pip install timeeval-gutenTAG
gutenTAG --config-yaml GutenTAG-config.yaml --seed 11

It is required to have GutenTAG installed on the system to generate the sequences.

Generate new sequences

Changing parameters

To generate new sequences that are covered by the GutenTAG framework, then please follow their documentation and update the configuration in GutenTAG-config.yaml accordingly and generate the sequences with the command above.

To generate new sequences that ere not covered by the GutenTAG framework, then update parameterization in the file generators.py and execute the main.py after. Be aware that not all generators configured in the generator.py file are generated. They need to be added to the main.py file to be generated.

Citation

If you use mTADS in your project or research, please cite our demonstration paper:

@INPROCEEDINGS{baumgartner_mtads_2023,
  author={Baumgartner, David and Langseth, Helge and Ramampiaro, Heri and Engø-Monsen, Kenth},
  booktitle={2023 IEEE International Conference on Big Data (BigData)}, 
  title={mTADS: Multivariate Time Series Anomaly Detection Benchmark Suites}, 
  year={2023},
  pages={588-597},
  keywords={Uncertainty;Manufacturing processes;Machine learning algorithms;Time series analysis;Medical services;Benchmark testing;Distance measurement;Machine learning;Anomaly detection;Time series},
  doi={10.1109/BigData59044.2023.10386980}}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
fsb_timeseries		fsb_timeseries
images		images
results		results
srb_timeseries		srb_timeseries
.gitattributes		.gitattributes
.gitignore		.gitignore
GutenTAG-config.yaml		GutenTAG-config.yaml
LICENSE		LICENSE
README.md		README.md
generators.py		generators.py
loading.py		loading.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

multivariate Time Series Anomaly Detection Benchmark Suites

Features

Fully synthetic benchmark suite samples

Semi-realistic benchmark suite sample

Usage

Sequence iterator

Compatibility note

Generate sequences

Generate new sequences

Changing parameters

Citation

Contributors

About

Releases 1

Packages

Languages

License

2er0/mTADS

Folders and files

Latest commit

History

Repository files navigation

multivariate Time Series Anomaly Detection Benchmark Suites

Features

Fully synthetic benchmark suite samples

Semi-realistic benchmark suite sample

Usage

Sequence iterator

Compatibility note

Generate sequences

Generate new sequences

Changing parameters

Citation

Contributors

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages