This project includes implementation of Text classification from scratch and comparison with sklearn Naive Bayes Classifier.
Link : https://archive.ics.uci.edu/ml/machine-learning-databases/20newsgroups-mld/ For larger Dataset : 20_newsgroups.tar.gz For smaller Dataset : mini_newsgroups.tar.gz
if .ipynb file doesn't load, try https://nbviewer.jupyter.org/