* upload encoder checkpoints
Browse files- .gitattributes +1 -0
- .ipynb_checkpoints/README-checkpoint.md +29 -0
- README.md +26 -0
- encoder_100M_16B.ckpt +3 -0
- encoder_10M_16B.ckpt +3 -0
- encoder_1B_16B.ckpt +3 -0
- encoder_300M_16B.ckpt +3 -0
- encoder_30M_16B.ckpt +3 -0
- figures/subjective_results.jpg +3 -0
- figures/tsfm-scalinglaws.jpg +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
*.jpg filter=lfs diff=lfs merge=lfs -text
|
.ipynb_checkpoints/README-checkpoint.md
ADDED
|
@@ -0,0 +1,29 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+
This repo provides the model weights released in the paper [Towards Neural Scaling Laws for Time Series Foundation Models](https://arxiv.org/abs/2410.12360).
|
| 6 |
+
|
| 7 |
+
The models have varying sizes, ranging from 1M to 1B parameters, and were trained on datasets spanning from 10M to 16B time points.
|
| 8 |
+
|
| 9 |
+
Code: https://github.com/Qingrenn/TSFM-ScalingLaws
|
| 10 |
+
|
| 11 |
+
Dataset: https://huggingface.co/datasets/Qingren/TSFMScalingLaws
|
| 12 |
+
|
| 13 |
+
<p align="center">
|
| 14 |
+
<img src="figures/tsfm-scalinglaws.jpg" width="100%">
|
| 15 |
+
<br />
|
| 16 |
+
<span>
|
| 17 |
+
Figure1: Scaling laws for NLL in relation to model size, compute, and dataset size. The blue lines represent ID performance, while the red and green lines show OOD performance on LSF subset and Monash subset.
|
| 18 |
+
</span>
|
| 19 |
+
</p>
|
| 20 |
+
|
| 21 |
+
<p align="center">
|
| 22 |
+
<img src="figures/subjective_results.jpg" width="100%">
|
| 23 |
+
<br />
|
| 24 |
+
<span>
|
| 25 |
+
Figure2: Prediction results of models with sizes 1B, 300M, 100M, and 10M.
|
| 26 |
+
</span>
|
| 27 |
+
</p>
|
| 28 |
+
|
| 29 |
+
|
README.md
CHANGED
|
@@ -1,3 +1,29 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
| 4 |
+
|
| 5 |
+
This repo provides the model weights released in the paper [Towards Neural Scaling Laws for Time Series Foundation Models](https://arxiv.org/abs/2410.12360).
|
| 6 |
+
|
| 7 |
+
The models have varying sizes, ranging from 1M to 1B parameters, and were trained on datasets spanning from 10M to 16B time points.
|
| 8 |
+
|
| 9 |
+
Code: https://github.com/Qingrenn/TSFM-ScalingLaws
|
| 10 |
+
|
| 11 |
+
Dataset: https://huggingface.co/datasets/Qingren/TSFMScalingLaws
|
| 12 |
+
|
| 13 |
+
<p align="center">
|
| 14 |
+
<img src="figures/tsfm-scalinglaws.jpg" width="100%">
|
| 15 |
+
<br />
|
| 16 |
+
<span>
|
| 17 |
+
Figure1: Scaling laws for NLL in relation to model size, compute, and dataset size. The blue lines represent ID performance, while the red and green lines show OOD performance on LSF subset and Monash subset.
|
| 18 |
+
</span>
|
| 19 |
+
</p>
|
| 20 |
+
|
| 21 |
+
<p align="center">
|
| 22 |
+
<img src="figures/subjective_results.jpg" width="100%">
|
| 23 |
+
<br />
|
| 24 |
+
<span>
|
| 25 |
+
Figure2: Prediction results of models with sizes 1B, 300M, 100M, and 10M.
|
| 26 |
+
</span>
|
| 27 |
+
</p>
|
| 28 |
+
|
| 29 |
+
|
encoder_100M_16B.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bd8ee81c556a5cdb3e0d9b691d74bded014cf7b271d50a38f1addada85de553f
|
| 3 |
+
size 511513426
|
encoder_10M_16B.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:de04533da64352f98a829d2dd1297e38e779332267d49e2a883ecfa5d366f3f3
|
| 3 |
+
size 57547814
|
encoder_1B_16B.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0a171ccab4b8cda5c7164643e2ad822025f69343f654845c6050876a50267277
|
| 3 |
+
size 4080900576
|
encoder_300M_16B.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2d349f71f56a65d5a1288b150308d971251a33112a1761e872c6c6c972befec3
|
| 3 |
+
size 1212117408
|
encoder_30M_16B.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a510df3e8ebf51d25181dc70d35b2372a2d7e96b4a94f2a285d4581e2dd1a0c6
|
| 3 |
+
size 152441950
|
figures/subjective_results.jpg
ADDED
|
Git LFS Details
|
figures/tsfm-scalinglaws.jpg
ADDED
|
Git LFS Details
|