Comp 6751 Project 4: Extending a Feature Grammar solution

$29.99

Category:

Description

5/5 - (3 votes)

Goal: extend your feature grammar for sentiment analysis to cover more complex cases. Compare resulting sentiment analysis to a baseline system.
Data: you should be guided by examples in the NLTK data > sentence polarity > rt-polarity.neg
You may simplify data samples to focus on the topic of the assignment, as for Project 3, but for Project
4, you should attempt more complex, longer sentences and also paragraphs.
Description: You are to extend your feature grammar for NLTK’s feature based Earley’s Chart parser
(parser class “FeatureEarleyChartParser”).
You are trying to develop the best grammar you can: wider coverage, less acceptance of ungrammatical
information, and you are trying to connect sentiment and discourse relation information.
Required coverage
1. complex declarative sentences with a sentiment-bearing words
2. conjunction (and, or, but) of sentiment bearing adjectives or nouns (I saw a dull and scary movie,
It was a mess and a hazard)
3. conjunction (and, or, but) of sentiment bearing sentences (It was too long but entertaining)
4. negation (of text including sentiment bearing words)
5. small paragraphs of text that contain sentiment words
Data: you may continue to use the polarity dataset of Project 3 as inspiration, or you may consult the
movie reviews dataset in NLTK.
SSAP baseline: run the aFinn Simplest Sentiment Analysis in Python
• download the aFinn sentiment lexicon (https://www2.imm.dtu.dk/pubdb/views/publication_details.
php?id=6010)
• get the SSAP to work (https://finnaarupnielsen.wordpress.com/2011/06/20/simplest-sentimentanalysis-in-python-with-af/)
• use your sentences for Project 3 and 4 and record the performance of SSAP
• record the performance of SSAP as a baseline and compare the performance of your Project 4 in
both tabular form and with error analysis on 10 examples of different complexity
1
Deliverables:
Create a file Good with your training sentences that your grammar parses and labels correctly.
Create a file False with your training sentences that your grammar does not parse or label correctly.
• 1 file: your well-annotated grammar (2pts, Grad Attr. 4,5,6)
• 1 file: Good with annotations (2pts, Grad Attr. 4,5,6)
• 1 file: False with annotations (2pts, Grad Attr. 4,5,6)
• 1 file: A report that includes:
– one page page explaining and critiquing your grammar design for complex declarative sentences
(2pts, Grad Attr. 1, 6)
– one page page explaining and critiquing your grammar design for conjunction (and, or, but)
of sentiment bearing adjectives or nouns (I saw a dull and scary movie, It was a mess and a
hazard) with and without negation (2pts, Grad Attr. 1, 6)
– one page page explaining and critiquing your grammar design for conjunction (and, or, but) of
sentiment bearing sentences (It was too long but entertaining) with and without negation (2pts,
Grad Attr. 1, 6)
– one page comparing SSAP and your grammar-based sentiment analysis of small paragraphs of
text that contain sentiment words (2pt, Grad Attr. 1, 6)
Note on grading: if your Project 4 is better than your Project 3, it will count for both projects
2