In index.js, txtLower, txtWords, and txtNoPunct seem to be used with no rhyme or reason. Some explanation as to why they are used would be good, but you should probably just standardize on 1 variable.
Also, txtNoPunct could be generated much more efficiently using regex.