* upload encoder checkpoints

Files changed (10) hide show

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.jpg filter=lfs diff=lfs merge=lfs -text

.ipynb_checkpoints/README-checkpoint.md ADDED Viewed

+---
+license: apache-2.0
+---
+This repo provides the model weights released in the paper [Towards Neural Scaling Laws for Time Series Foundation Models](https://arxiv.org/abs/2410.12360).
+The models have varying sizes, ranging from 1M to 1B parameters, and were trained on datasets spanning from 10M to 16B time points.
+Code: https://github.com/Qingrenn/TSFM-ScalingLaws
+Dataset: https://huggingface.co/datasets/Qingren/TSFMScalingLaws
+<p align="center">
+  <img src="figures/tsfm-scalinglaws.jpg" width="100%">
+  <br />
+  <span>
+    Figure1: Scaling laws for NLL in relation to model size, compute, and dataset size. The blue lines represent ID performance, while the red and green lines show OOD performance on LSF subset and Monash subset.
+  </span>
+</p>
+<p align="center">
+  <img src="figures/subjective_results.jpg" width="100%">
+  <br />
+  <span>
+    Figure2: Prediction results of models with sizes 1B, 300M, 100M, and 10M.
+  </span>
+</p>

README.md CHANGED Viewed

@@ -1,3 +1,29 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
 ---
+This repo provides the model weights released in the paper [Towards Neural Scaling Laws for Time Series Foundation Models](https://arxiv.org/abs/2410.12360).
+The models have varying sizes, ranging from 1M to 1B parameters, and were trained on datasets spanning from 10M to 16B time points.
+Code: https://github.com/Qingrenn/TSFM-ScalingLaws
+Dataset: https://huggingface.co/datasets/Qingren/TSFMScalingLaws
+<p align="center">
+  <img src="figures/tsfm-scalinglaws.jpg" width="100%">
+  <br />
+  <span>
+    Figure1: Scaling laws for NLL in relation to model size, compute, and dataset size. The blue lines represent ID performance, while the red and green lines show OOD performance on LSF subset and Monash subset.
+  </span>
+</p>
+<p align="center">
+  <img src="figures/subjective_results.jpg" width="100%">
+  <br />
+  <span>
+    Figure2: Prediction results of models with sizes 1B, 300M, 100M, and 10M.
+  </span>
+</p>

encoder_100M_16B.ckpt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:bd8ee81c556a5cdb3e0d9b691d74bded014cf7b271d50a38f1addada85de553f
+size 511513426

encoder_10M_16B.ckpt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:de04533da64352f98a829d2dd1297e38e779332267d49e2a883ecfa5d366f3f3
+size 57547814

encoder_1B_16B.ckpt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0a171ccab4b8cda5c7164643e2ad822025f69343f654845c6050876a50267277
+size 4080900576

encoder_300M_16B.ckpt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2d349f71f56a65d5a1288b150308d971251a33112a1761e872c6c6c972befec3
+size 1212117408

encoder_30M_16B.ckpt ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a510df3e8ebf51d25181dc70d35b2372a2d7e96b4a94f2a285d4581e2dd1a0c6
+size 152441950

figures/subjective_results.jpg ADDED Viewed

figures/tsfm-scalinglaws.jpg ADDED Viewed