Question 1
As we get the wiki articles we want to calculate sentiment of each one. One easy NLP li
ary is TextBlob. TextBlob contains a basic polarity method rating article from -1 to 1. Here is a basic example of how to use it
#pip install TextBlo
from textblob import TextBlo
tb=TextBlob(text)
print(tb.sentiment.polarity)
For the Assignment Calculate the Overall Polarity for at least 5 different pages on wikipedia. Try to find them in different genres (politics, history, science, cele
ity, etc) Report on their totals.Â
Â
Try to construct your own very positive sentences and your own very negative sentences (try 2-3 of each). Report how it does.Â
Question 2
Project Description
Issue: Phishing attempts : Analysis of phishing emails vs non-phishing emails
Do an analysis of the issue using textual analysis algorithms. You will need to collect some data for the issue so that you can do the analysis.
Analysis techniques to choose from:
· Word Distribution / Frequency / N-Grams
· Part-of-Speech
· Named Recognition
· Sentiment Analysis
· Text Classification
· Topic Modeling
· Sentence Structure
Submission:
Part A (10%):Â One page describing the issue you want to address and hypothesis of what you think the analysis might show
Part B (90%): All code and 3 – 4 page conference paper describing the results on the analysis