SequentialLearning
/

SuperLinear

mixture-of-experts

Model card Files Files and versions

lirannoc commited on Sep 21

Commit

94a91a3

·

verified ·

1 Parent(s): 6263455

Upload README.md

Files changed (1) hide show

README.md +15 -11

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ tags:
   - pytorch
   - fft
 model-index:
-  - name: Super-Linear
     results: []
 ---
@@ -42,18 +42,20 @@ import torch
 model = AutoModelForCausalLM.from_pretrained("SequentialLearning/SuperLinear", trust_remote_code=True)
 # Prepare input time series data
-# Shape: [batch_size, sequence_length, features]
-input_data = torch.randn(1, 512, 1)
 # Generate predictions
 with torch.no_grad():
-    outputs = model(inputs_embeds=input_data, pred_len=96)
-    predictions = outputs.logits  # Shape: [batch_size, prediction_length, features]
 ```
 ## Configuration
-Key configuration parameters:
 - `train_seq_len`: Training sequence length (default: 512)
 - `train_pred_len`: Training prediction length (default: 96)
@@ -62,18 +64,20 @@ Key configuration parameters:
 - `freq_experts`: Frequency-specific expert configuration
 - `moe_temp`: Temperature for expert selection during inference (default: 1)
-## Link to GitHub
-[https://github.com/azencot-group/SuperLinear](https://github.com/azencot-group/SuperLinear)
 ## Citation
 If you use SuperLinear in your research, please cite:
 ```bibtex
-@article{todo,
-  title={SuperLinear: todo},
-  author={Your Name},
   year={2025}
 }
 ```

   - pytorch
   - fft
 model-index:
+  - name: SuperLinear
     results: []
 ---
 model = AutoModelForCausalLM.from_pretrained("SequentialLearning/SuperLinear", trust_remote_code=True)
 # Prepare input time series data
+# Shape: [batch_size, channel, sequence_length] or [batch_size, sequence_length]
+input_data = torch.randn(1, 1, 512)
 # Generate predictions
 with torch.no_grad():
+    outputs = model(inputs_embeds=input_data, pred_len=96, get_prob = True)
+    preds = output.logits # Predicted values
+    probs = output.attentions  # Expert probabilities stored here
 ```
 ## Configuration
+Key parameters:
 - `train_seq_len`: Training sequence length (default: 512)
 - `train_pred_len`: Training prediction length (default: 96)
 - `freq_experts`: Frequency-specific expert configuration
 - `moe_temp`: Temperature for expert selection during inference (default: 1)
+## Links
+- **GitHub Repository**: [https://github.com/azencot-group/SuperLinear](https://github.com/azencot-group/SuperLinear)
+- **Paper**: [https://arxiv.org/abs/2509.15105](https://arxiv.org/abs/2509.15105)
 ## Citation
 If you use SuperLinear in your research, please cite:
 ```bibtex
+@article{nochumsohn2025super,
+  title={Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting},
+  author={Nochumsohn, Liran and Marshanski, Raz and Zisling, Hedi and Azencot, Omri},
+  journal={arXiv preprint arXiv:2509.15105},
   year={2025}
 }
 ```