Time-series classification
Time series classification is a supervised machine learning technique used to assign a predefined category or label to an entire sequence of time-ordered data points, rather than predicting a future value. It involves training a model on labeled examples of time series data and then using that model to classify new, unseen time series sequences into their correct classes, which is useful in applications like medical diagnosis, human activity recognition, and sensor data analysis.
Time series classification assigns labels to temporal sequences — ECG diagnosis, activity recognition, industrial fault detection. ROCKET (random convolution features) and InceptionTime dominate benchmarks on the UCR/UEA archive, while foundation models like Chronos are beginning to enable zero-shot classification.
History
UCR Time Series Archive established — becomes the standard evaluation suite
DTW (Dynamic Time Warping) + 1-NN shown to be a surprisingly strong baseline
BOSS (Bag of SFA Symbols) introduces dictionary-based approaches for TSC
COTE/HIVE-COTE ensemble combines multiple distance and feature-based classifiers
InceptionTime applies deep CNN with inception modules, matching HIVE-COTE
ROCKET generates 10K random convolution features — fast, accurate, and scalable
HIVE-COTE 2.0 becomes the most accurate single classifier on the UCR archive
MultiROCKET extends ROCKET with multi-scale random convolutions
QUANT and HYDRA push random feature methods further
Time series foundation models applied to classification via embedding extraction
How Time-series classification Works
Time Series Representation
Raw temporal data (univariate or multivariate) is optionally transformed — z-normalization, resampling, or feature extraction (ROCKET random convolutions, SFA words).
Feature Extraction
ROCKET: apply 10K random convolutional kernels, extract max value and proportion of positive values per kernel. Alternatively, deep networks (InceptionTime) learn features end-to-end.
Classification
For ROCKET: a simple ridge classifier or linear SVM on the random features. For deep learning: global average pooling followed by a dense classification layer.
Ensemble (optional)
HIVE-COTE combines multiple representation-specific classifiers (distance, dictionary, interval, shapelet) via weighted voting.
Current Landscape
Time series classification in 2025 has mature, well-benchmarked solutions. ROCKET-family methods offer the best speed-accuracy tradeoff, training in minutes on CPU while matching or exceeding deep learning. HIVE-COTE 2.0 is the accuracy champion but costly. InceptionTime is the go-to deep learning approach. The emerging disruption is foundation model embeddings — using Chronos or TimesFM to extract features from time series, then training a simple classifier on top, which works surprisingly well with few labeled examples.
Key Challenges
Variable-length sequences require alignment or padding strategies that can distort temporal patterns
Class imbalance — many real-world TSC tasks have rare but important classes (fault detection, rare diseases)
Multivariate complexity — correlations across channels in multivariate time series are hard to capture effectively
Interpretability — explaining why a time series was classified a certain way is important for medical and industrial applications
Scalability — HIVE-COTE is highly accurate but computationally expensive; ROCKET scales much better
Quick Recommendations
Fast and accurate baseline
ROCKET / MultiROCKET
Best accuracy-speed tradeoff — trains in minutes, competitive with deep learning
Maximum accuracy
HIVE-COTE 2.0
Most accurate on the UCR archive, but slower to train
Deep learning approach
InceptionTime
Best deep learning baseline, GPU-friendly, handles multivariate naturally
Transfer learning / few-shot
Chronos/TimesFM embeddings + classifier
Foundation model features enable classification with very few labeled examples
What's Next
The frontier is few-shot and zero-shot time series classification using foundation models, enabling classification on new domains without domain-specific training data. Expect advances in multivariate TSC (EEG, sensor arrays), interpretable classification (which temporal patterns drive the decision), and real-time streaming classification for edge deployment.
Benchmarks & SOTA
Related Tasks
Get notified when these results update
New models drop weekly. We track them so you don't have to.
Something wrong or missing?
Help keep Time-series classification benchmarks accurate. Report outdated results, missing benchmarks, or errors.