Data Analytics – Section 1

To Do

1 2 3 5

Calculating Chi Square

Some of you are asking about the calculating the Chi Square statistic – take a look at slides 19 & 20 from the decision tree deck – you’ll see the formula and reasoning.  This is a good formula to know for the Exam!

Remember:

  • High statistic (low p-value) from chi-squared test means the groups are different

Assignment #9 – Association Rule Mining with R

Here is the assignment and an answer sheet to submit (in Word format).

Here is the data file you’ll need [Groceries.csv].

Another note: Make sure you’ve included ALL the attachments (check the assignment instructions) as separate files or you will not receive credit for the assignment. Do not send me a ZIP or RAR file! 

This assignment is due April 25, 2016.

Weekly Question

Leave your response as a comment on this post by the beginning of class on April 18, 2016. Remember, it only needs to be three or four sentences. For these weekly questions, I’m mainly interested in your opinions, not so much particular “facts” from the class!

Now that the course is just about over, think about what you’ve learned.

  • For you, what is the most important takeaway from the course?
  • How would you explain what you’ve learned to a future employer in a job interview?

Weekly Question #5

Leave your response as a comment on this post by the beginning of class on April 11, 2016. Remember, it only needs to be three or four sentences. For these weekly questions, I’m mainly interested in your opinions, not so much particular “facts” from the class! If you sign in using your AccessNet ID and password you won’t have to fill in the name, email and captcha fields when you leave your comment.

Answer one of the two questions below (not both):

  1. Name and describe a business question that you could answer using a decision tree. What data would you collect to perform the analysis? Don’t use an example we’ve covered in class.
  2. What advice would you give someone regarding how to select the right predictor variables for a decision tree analysis?
1 2 3 5