Suggestions

Suggestions

Hi Samprity,


It is a very good wiki page, you gives a very interesting hypothesis and designed the experiment well. It is really a quite meaningful project to help people decide if it is a good movie that worth spend money and time on it.

Only suggestion is that in this hypothesis part, it would be better to specify how you are going to exam the accuracy on the improved Naive Bayes algorithm. And one question about the experiment result. Furious Seven and Deadpool both have 8 stars but they have totally different prediction. What may cause this result? Is that due to the review for movies is not that formal? Or maybe there would be some special slangs in different movie that makes your classifier failed to study?


Best regards,

Jiahong Chen

JiahongChen (talk)04:07, 21 April 2016

Thanks for the review! I will include how I measured the accuracy in the hypothesis.
Regarding the Fast and Furious 7 review, there could some words that the classifier failed to study. The Fast and Furious 7 review probably has fewer words that are similar to the training dataset of positive reviews. Like TianQiChen has mentioned the tool gave "extremely entertaining" a 58% positive but "7" a 81% negative.

SamprityKashyap (talk)04:30, 21 April 2016