i was playing around with lstm networks for sequence classification. The dataset is "tweet_eval" (on huggingface) .
I made a small lstm network but no matter how i tune the hyperparameters the test accuracy isn't improving. Its just moving up and down in the range of (55-45%). Since the training dataset is about 45K samples big i think the model should be able to generalize to the test data, which isn't happening. Right after the start of the training the accuracy is around 50% and its about the same even in like 20 epochs.
here's the networks code: