indobert-large-p2_preprocessing_tuning

Browse files

Files changed (7) hide show

README.md +14 -17
config.json +2 -2
final_model/config.json +2 -2
final_model/model.safetensors +1 -1
final_model/training_args.bin +1 -1
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [indobenchmark/indobert-large-p2](https://huggingface.co/indobenchmark/indobert-large-p2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2080
-- Accuracy: 0.7727
-- Precision: 0.7824
-- Recall: 0.7786
-- F1: 0.7792
 ## Model description
@@ -44,12 +44,11 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 4.574921995691458e-05
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
-- optimizer: Use OptimizerNames.ADAFACTOR and the args are:
-No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 20
@@ -57,15 +56,13 @@ No additional optimizer arguments
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
-| 0.8626        | 1.0   | 111  | 0.7792          | 0.7455   | 0.7639    | 0.7500 | 0.7359 |
-| 0.3943        | 2.0   | 222  | 0.8798          | 0.7364   | 0.7480    | 0.7483 | 0.7394 |
-| 0.1861        | 3.0   | 333  | 1.0242          | 0.7636   | 0.7808    | 0.7597 | 0.7672 |
-| 0.0715        | 4.0   | 444  | 1.2080          | 0.7727   | 0.7824    | 0.7786 | 0.7792 |
-| 0.0283        | 5.0   | 555  | 1.4991          | 0.7591   | 0.7709    | 0.7658 | 0.7638 |
-| 0.0248        | 6.0   | 666  | 1.5037          | 0.7591   | 0.7721    | 0.7576 | 0.7612 |
-| 0.0071        | 7.0   | 777  | 1.6257          | 0.7659   | 0.7676    | 0.7760 | 0.7696 |
-| 0.0024        | 8.0   | 888  | 1.7610          | 0.7614   | 0.7609    | 0.7723 | 0.7644 |
-| 0.0022        | 9.0   | 999  | 1.8253          | 0.7568   | 0.7563    | 0.7687 | 0.7597 |
 ### Framework versions

 This model is a fine-tuned version of [indobenchmark/indobert-large-p2](https://huggingface.co/indobenchmark/indobert-large-p2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6673
+- Accuracy: 0.7841
+- Precision: 0.7920
+- Recall: 0.7918
+- F1: 0.7901
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2.3352320097915953e-05
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 20
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
+| 1.2207        | 1.0   | 111  | 0.7383          | 0.7409   | 0.7463    | 0.7491 | 0.7435 |
+| 0.6702        | 2.0   | 222  | 0.6673          | 0.7841   | 0.7920    | 0.7918 | 0.7901 |
+| 0.4953        | 3.0   | 333  | 0.7161          | 0.7636   | 0.7707    | 0.7722 | 0.7711 |
+| 0.3754        | 4.0   | 444  | 0.8318          | 0.75     | 0.7552    | 0.7657 | 0.7569 |
+| 0.2769        | 5.0   | 555  | 0.8916          | 0.7591   | 0.7587    | 0.7732 | 0.7642 |
+| 0.2039        | 6.0   | 666  | 0.9693          | 0.7432   | 0.7533    | 0.7589 | 0.7524 |
+| 0.1525        | 7.0   | 777  | 1.0838          | 0.7477   | 0.7431    | 0.7610 | 0.7471 |
 ### Framework versions

config.json CHANGED Viewed

@@ -3,11 +3,11 @@
   "architectures": [
     "BertForSequenceClassification"
   ],
-  "attention_probs_dropout_prob": 0.1,
   "classifier_dropout": null,
   "directionality": "bidi",
   "hidden_act": "gelu",
-  "hidden_dropout_prob": 0.1,
   "hidden_size": 1024,
   "id2label": {
     "0": "LABEL_0",

   "architectures": [
     "BertForSequenceClassification"
   ],
+  "attention_probs_dropout_prob": 0.3,
   "classifier_dropout": null,
   "directionality": "bidi",
   "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.3,
   "hidden_size": 1024,
   "id2label": {
     "0": "LABEL_0",

final_model/config.json CHANGED Viewed

@@ -3,11 +3,11 @@
   "architectures": [
     "BertForSequenceClassification"
   ],
-  "attention_probs_dropout_prob": 0.1,
   "classifier_dropout": null,
   "directionality": "bidi",
   "hidden_act": "gelu",
-  "hidden_dropout_prob": 0.1,
   "hidden_size": 1024,
   "id2label": {
     "0": "LABEL_0",

   "architectures": [
     "BertForSequenceClassification"
   ],
+  "attention_probs_dropout_prob": 0.3,
   "classifier_dropout": null,
   "directionality": "bidi",
   "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.3,
   "hidden_size": 1024,
   "id2label": {
     "0": "LABEL_0",

final_model/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2ec3179bcc7bf1ad095411bdb9e8dfc46ad1e885175627d9e9ab1d8b753795fc
 size 1340635060

 version https://git-lfs.github.com/spec/v1
+oid sha256:12dce05ada3ba7f7a9b691e2a5449f92abdba7faedc68fbdff9a4b789cf0775c
 size 1340635060

final_model/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b4c6f1017d41aac00d808abc885e3baf9c6ea1fdda0db35908bdd2a827ba3b10
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:564cc7aa93f14e1bb96404a32c2266f2447c3039a96fa866935213511d4991a0
 size 5304

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2ec3179bcc7bf1ad095411bdb9e8dfc46ad1e885175627d9e9ab1d8b753795fc
 size 1340635060

 version https://git-lfs.github.com/spec/v1
+oid sha256:12dce05ada3ba7f7a9b691e2a5449f92abdba7faedc68fbdff9a4b789cf0775c
 size 1340635060

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b4c6f1017d41aac00d808abc885e3baf9c6ea1fdda0db35908bdd2a827ba3b10
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:564cc7aa93f14e1bb96404a32c2266f2447c3039a96fa866935213511d4991a0
 size 5304