Regression, Logistic Regression and Maximum Entropy part 2 (code + examples)

Posted on 4 CommentsPosted in Classification, Sentiment Analytics

Introduction In the previous blog we have seen the theory and mathematics behind the Maximum Entropy and Logistic Regression Classifiers. Logistic Regression is one of the most powerful classification methods within machine learning and can be used for a wide variety of tasks. Think of pre-policing or predictive analytics in health; it can be used to aid […]

Regression, Logistic Regression and Maximum Entropy

Posted on 4 CommentsPosted in Classification, Machine Learning, Sentiment Analytics

1. Introduction One of the most important tasks in Machine Learning are the Classification tasks (a.k.a. supervised machine learning). Classification is used to make an accurate prediction of the class of entries in a test set (a dataset of which the entries have not yet been labelled) with the model which was constructed from a training […]

Sentiment Analysis with the Naive Bayes Classifier

Posted on 6 CommentsPosted in Machine Learning, Sentiment Analytics

From the introductionary blog we know that the Naive Bayes Classifier is based on the bag-of-words model. With the bag-of-words model we check which word of the text-document appears in a positive-words-list or a negative-words-list. If the word appears in a positive-words-list the total score of the text is updated with +1 and vice versa. […]

Sentiment Analysis with bag-of-words

Posted on 4 CommentsPosted in Machine Learning, Sentiment Analytics

Introduction: In my previous post I have explained the Theory behind three of the most popular Text Classification methods (Naive Bayes, Maximum Entropy and Support Vector Machines) and told you that I will use these Classifiers for the automatic classification of the subjectivity of Amazon.com book reviews. The purpose is to get a better understanding of […]

Visualizing Data

Posted on Leave a commentPosted in Visualizations

We all know that visualizing data is an important part of Data Science. If it is done wrong, it can be boring not grabbing the attention of the readers, or even worse; convey the wrong message. If it done correctly, it can intrigue even the most indifferent reader (some people can even turn Data Visualizations into […]

Text Classification and Sentiment Analysis

Posted on 9 CommentsPosted in Machine Learning, Sentiment Analytics

Introduction: Natural Language Processing (NLP) is a vast area of Computer Science that is concerned with the interaction between Computers and Human Language[1]. Within NLP many tasks are – or can be reformulated as – classification tasks. In classification tasks we are trying to produce a classification function which can give the correlation between a […]