Update README.md
Browse files
README.md
CHANGED
|
@@ -37,8 +37,8 @@ This may be due to the base model being trained for emoji classification and the
|
|
| 37 |
This model is better if emojis are to be also included for sentiment analysis.
|
| 38 |
No Evaluation is done for data with only text and no emojis.
|
| 39 |
|
| 40 |
-
The model was fine-tuned with dataset: mteb/tweet_sentiment_extraction from
|
| 41 |
-
converted to
|
| 42 |
|
| 43 |
The model has a test loss of 0.6 and an f1 score of 0.74 on the unseen data from the dataset.
|
| 44 |
|
|
@@ -78,4 +78,7 @@ Text: tu mujhe pasandh heh
|
|
| 78 |
Negative: 0.01
|
| 79 |
Neutral: 0.22
|
| 80 |
Positive: 0.76
|
| 81 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
| 37 |
This model is better if emojis are to be also included for sentiment analysis.
|
| 38 |
No Evaluation is done for data with only text and no emojis.
|
| 39 |
|
| 40 |
+
The model was fine-tuned with the dataset: mteb/tweet_sentiment_extraction from hugging face
|
| 41 |
+
converted to Hinglish text.
|
| 42 |
|
| 43 |
The model has a test loss of 0.6 and an f1 score of 0.74 on the unseen data from the dataset.
|
| 44 |
|
|
|
|
| 78 |
Negative: 0.01
|
| 79 |
Neutral: 0.22
|
| 80 |
Positive: 0.76
|
| 81 |
+
```
|
| 82 |
+
Possible Future Direction:
|
| 83 |
+
|
| 84 |
+
1. Pre-train the Hinglish model with both Hindi, Hinglish, and English datasets. Current tokens for hinlish have very small sizes i.e. low-priority vocabs are used mostly.
|