Without peeking into the future, Oversample your imbalanced dataset to balance it

Nikhil Nanda
3 min readAug 29, 2020

For those constrained by your project deadline, feel free to skip directly to the code

Getting high accuracy on the test set has always been the end goal, especially while tackling a classification problem. This attitude led me to a pretty decent accuracy on a multi-class classification problem. The imbalance in the dataset did not seem to decrease the accuracy so I had ignored the imbalance problem. However, on looking at the distribution of the predictions, it was clearly visible that the…

--

--