Aaron Plocharczyk's project idea and work plan post

Upload data files

Create main.py

Create analysis_tools.py

Create an iterative loop for the program to run in

Allow user to type “Help” at any time to gain insight on what to do next

Allow user to specify file name

Allow user to access max/min/average review length for all types of sentiment reviews (or for a specific sentiment type) in the file

Allow user to access max/min/average word count for all types of sentiment reviews (or for a specific sentiment type) in the file

Allow user to access most common words in all types of sentiment reviews (or for a specific sentiment type) in the file

Allow user to access least common words in all types of sentiment reviews (or for a specific sentiment type) in the file

Allow user to access most correlated words to a specific sentiment type in the file

Allow user to set a “threshold” for correlation so that a term must occur at least n times before it is considered

Allow user to see correlation of word count/review length to a specific sentiment type in the file

Allow user to view terms that are unique to a certain file

Allow user to visualize results of these queries

I’ve decided to shift my project from sentiment analysis to a program that actually trains a machine learning model and tests it. Here is my new work plan:
By 6/14:

program asks user for a training file and creates a “trainingFile” object from it
By 6/15:

program counts total number of occurrences for each term

program calculates correlation of terms with a sentiment
By 6/19:

program prints and visualizes correlation and total number of occurrences for each term

program allows user to set a minimum threshold for total number of occurrences for each term

program allows user to delete particular terms
By 6/22:

program asks user for a testing file and creates a “testingFile” object from it

program tests current model on the test file and saves the results locally to refer to in the next iteration of the program

program prints a visual representation of the test results as compared to previous tests

program allows user to type “help” at any time to learn about what they should do

program repeats
Stretch Goals:

program allows user to save model in a comma separated format

program allows user to open saved models

program allows user to alter saved models

program allows user to test with saved models