Spaces:
Running
Running
update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,69 @@ app_file: app.py
|
|
| 11 |
pinned: false
|
| 12 |
license: mit
|
| 13 |
short_description: space for VisionTSpp
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
pinned: false
|
| 12 |
license: mit
|
| 13 |
short_description: space for VisionTSpp
|
| 14 |
+
pipeline_tag: time-series-forecasting
|
| 15 |
---
|
| 16 |
|
| 17 |
+
# VisionTS++: Cross-Modal Time Series Foundation Model with Continual Pre-trained Visual Backbones
|
| 18 |
+
|
| 19 |
+
This repository hosts the **VisionTS++** model, a state-of-the-art time series foundation model based on continual pre-training of a visual Masked AutoEncoder (MAE) on large-scale time series data. It excels in multivariate and probabilistic time series forecasting by bridging modality gaps between vision and time series data.
|
| 20 |
+
|
| 21 |
+
The model was introduced in the paper:
|
| 22 |
+
[**VisionTS++: Cross-Modal Time Series Foundation Model with Continual Pre-trained Vision Backbones**](https://arxiv.org/abs/2508.04379)
|
| 23 |
+
|
| 24 |
+
Official GitHub repository: [https://github.com/HALF111/VisionTSpp](https://github.com/HALF111/VisionTSpp)
|
| 25 |
+
|
| 26 |
+
Experience **VisionTS++** directly in your browser on this [Hugging Face Space](https://huggingface.co/spaces/Lefei/VisionTSpp)! You can upload your own custom time series CSV file for zero-shot forecasting.
|
| 27 |
+
|
| 28 |
+
## About
|
| 29 |
+
VisionTS++ is built upon continual pre-training of a vision model on large-scale time series, addressing key discrepancies in cross-modal transfer from vision to time series. It introduces three key innovations:
|
| 30 |
+
|
| 31 |
+
1. **Vision-model-based filtering**: Identifies high-quality sequences to stabilize pre-training and mitigate the data-modality gap.
|
| 32 |
+
2. **Colorized multivariate conversion**: Encodes multivariate series as multi-subfigure RGB images to enhance cross-variate modeling.
|
| 33 |
+
3. **Multi-quantile forecasting**: Uses parallel reconstruction heads to generate quantile forecasts for probabilistic predictions without parametric assumptions.
|
| 34 |
+
|
| 35 |
+
These innovations allow VisionTS++ to achieve state-of-the-art performance in both in-distribution and out-of-distribution forecasting, demonstrating that vision models can effectively generalize to Time Series Forecasting with appropriate adaptation.
|
| 36 |
+
|
| 37 |
+
|
| 38 |
+
## Installation
|
| 39 |
+
|
| 40 |
+
The VisionTS++ model is available through the `visionts` package on PyPI.
|
| 41 |
+
|
| 42 |
+
First, install the package:
|
| 43 |
+
|
| 44 |
+
```shell
|
| 45 |
+
pip install visionts
|
| 46 |
+
```
|
| 47 |
+
|
| 48 |
+
If you want to develop the inference code, you can also build from source:
|
| 49 |
+
|
| 50 |
+
```shell
|
| 51 |
+
git clone https://github.com/HALF111/VisionTSpp.git
|
| 52 |
+
cd VisionTSpp
|
| 53 |
+
pip install -e .
|
| 54 |
+
```
|
| 55 |
+
|
| 56 |
+
For detailed inference examples and usage with clear visualizations of image reconstruction, please refer to the `demo.ipynb` notebook in the [official GitHub repository](https://github.com/HALF111/VisionTSpp/blob/main/demo.ipynb).
|
| 57 |
+
|
| 58 |
+
## Citation
|
| 59 |
+
If you're using VisionTS++ or VisionTS in your research or applications, please cite them using this BibTeX:
|
| 60 |
+
|
| 61 |
+
```bibtex
|
| 62 |
+
@misc{chen2024visionts,
|
| 63 |
+
title={VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters},
|
| 64 |
+
author={Mouxiang Chen and Lefei Shen and Zhuo Li and Xiaoyun Joy Wang and Jianling Sun and Chenghao Liu},
|
| 65 |
+
year={2024},
|
| 66 |
+
eprint={2408.17253},
|
| 67 |
+
archivePrefix={arXiv},
|
| 68 |
+
url={https://arxiv.org/abs/2408.17253},
|
| 69 |
+
}
|
| 70 |
+
@misc{shen2025visiontspp,
|
| 71 |
+
title={VisionTS++: Cross-Modal Time Series Foundation Model with Continual Pre-trained Visual Backbones},
|
| 72 |
+
author={Lefei Shen and Mouxiang Chen and Xu Liu and Han Fu and Xiaoxue Ren and Jianling Sun and Zhuo Li and Chenghao Liu},
|
| 73 |
+
year={2025},
|
| 74 |
+
eprint={2508.04379},
|
| 75 |
+
archivePrefix={arXiv},
|
| 76 |
+
primaryClass={cs.CV},
|
| 77 |
+
url={https://arxiv.org/abs/2508.04379},
|
| 78 |
+
}
|
| 79 |
+
```
|