GitHub - EmorZz1G/STAND: Labels Matter More Than Models: Quantifying the Benefit of Supervised Time Series Anomaly Detection

📊 Labels Matter More Than Models: Quantifying the Benefit of Supervised Time Series Anomaly Detection

This repository contains the implementation of STAND 🚀, a simple supervised time series anomaly detection baseline, as described in our paper. STAND demonstrates that with proper supervision, even simple models can outperform complex unsupervised approaches in time series anomaly detection tasks.

Fig. 1: Classification of time series anomaly detection methods.

💡 Key Findings

Labels Matter More Than Models: With sufficient labeled data, even simple supervised models consistently outperform sophisticated unsupervised methods.
Supervision Brings Higher Returns: The performance gain from using labeled data far exceeds the improvement from using more complex models.
Better Predictive Consistency and Anomaly Localization: Supervised methods, especially STAND, show superior performance in predicting consistent anomaly scores and precisely localizing anomalies.

📁 Project Structure

STAND/
├── src/
│   ├── data_utils/     # Data loading utilities
│   ├── exp/            # Experiment configurations and runners
│   ├── models/         # Model implementations
│   │   ├── supervised/ # Supervised models (including STAND)
│   │   └── ...         # Other model implementations
│   ├── scripts/        # Run scripts for different experiments
│   └── utils/          # Utility functions and dataset processing
├── logs/               # Experiment results and logs
├── docs/               # Documentation
├── tests/              # Testing scripts
├── requirements.txt    # Dependencies
└── README.md           # This file

🔧 Installation

# Create a virtual environment (recommended)
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

🚀 Usage

🎯 Supervised Experiments

Run supervised experiments with STAND and other baselines:

# Navigate to scripts directory
cd src/scripts

# Run STAND on all datasets
python run_supervised.py

# To run specific configurations, modify the script parameters
# For example, to run only STAND on specific datasets:
cd src/exp
python supervised.py --model_name STAND --dataset_name PSM

🔍 Unsupervised Experiments

# Run unsupervised baselines
python run_unsupervised.py

🔄 Semi-supervised Experiments

# Run semi-supervised approaches
python run_semisupervised.py

📚 Datasets

The code supports five real-world time series datasets (PSM, SWaT, WADI, Swan, Water) for anomaly detection.

For dataset downloading, please refer to the FTSAD project.

🏆 Key Results

📈 Performance Comparison

Supervised methods (STAD) significantly outperform traditional unsupervised methods (UTAD-I) and deep learning-based unsupervised methods (UTAD-II).
STAND baseline achieves the best overall performance across multiple datasets.

📊 Supervision Benefit Analysis

Using only 10% of labeled PSM data, a simple ExtraTrees model already outperforms the best unsupervised method.
Performance scales consistently with more labeled data, showing the substantial benefit of supervision.

📝 Citation

If you use this code in your research, please cite our paper:

@article{zhong2025labels,
  title={Labels Matter More Than Models: Quantifying the Benefit of Supervised Time Series Anomaly Detection},
  author={Zhijie Zhong and Zhiwen Yu and Kaixiang Yang and C. L. Philip Chen},
  journal={arXiv Preprint arXiv:2511.16145},
  year={2025},
  pages={1--16},
  archivePrefix={arXiv},
  primaryClass={cs.LG},
  doi={10.48550/arXiv.2511.16145},
}

📄 License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📊 Labels Matter More Than Models: Quantifying the Benefit of Supervised Time Series Anomaly Detection

💡 Key Findings

📁 Project Structure

🔧 Installation

🚀 Usage

🎯 Supervised Experiments

🔍 Unsupervised Experiments

🔄 Semi-supervised Experiments

📚 Datasets

🏆 Key Results

📈 Performance Comparison

📊 Supervision Benefit Analysis

📝 Citation

📄 License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
docs		docs
logs		logs
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
requirements_new.txt		requirements_new.txt

License

EmorZz1G/STAND

Folders and files

Latest commit

History

Repository files navigation

📊 Labels Matter More Than Models: Quantifying the Benefit of Supervised Time Series Anomaly Detection

💡 Key Findings

📁 Project Structure

🔧 Installation

🚀 Usage

🎯 Supervised Experiments

🔍 Unsupervised Experiments

🔄 Semi-supervised Experiments

📚 Datasets

🏆 Key Results

📈 Performance Comparison

📊 Supervision Benefit Analysis

📝 Citation

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages