Data Analytics – Section 1

Assignments

Assignment #9 – Association Rule Mining with R

Here is the assignment and an answer sheet to submit (in Word format).

Here is the data file you’ll need [Groceries.csv].

Another note: Make sure you’ve included ALL the attachments (check the assignment instructions) as separate files or you will not receive credit for the assignment. Do not send me a ZIP or RAR file! 

This assignment is due April 25, 2016.

Assignment #7 – Decision Trees in R

Here is the assignment and Assignment #7 – Decision Trees in R ANSWER SHEET (in Word format).

Here is the data file you’ll need [BankLoan.csv].

Note: If you try to open this file in Excel, you’ll get two error dialogs. The file is fine. Just click “Yes” and “OK” and the file will open.

Another note: Make sure you’ve included ALL the attachments (check the assignment instructions) as separate files or you will not receive credit for the assignment. Please do not send me a ZIP or RAR file! 

This assignment is due on 4/11 by the start of class.

Setting up R & R Studio

R is a widely-used, open source statistical analysis platform. RStudio is an integrated development environment for R – that means it makes using R easier!

You should install both software packages – R and RStudio! Don’t just install R or your life will be difficult!

We’ll be using this software to do some analytics! You can get a full copy of the software – PC or Mac – for free!

First download and install R:

  1. If you have Windows
    1. Download the installer file for R (this is a direct link to the install file).
    2. Make sure you keep track of where you save the installer.
    3. Double-click on the installer file and follow the instructions (just accept the default settings).
  2. If you have a Mac
    1. Download “R-3.2.1-snowleopard.pkg” if you have Mac OS X 10.6 or “R-3.2.4.pkg” if you have Mac OS X 10.9 or higher.
    2. Make sure you keep track of where you save the installer.
    3. Do whatever it is you “Mac people” do to install software! (I don’t have a Mac!)

Now download and install RStudio:

  1. Download the appropriate installer: Windows or Mac.
  2. Make sure you keep track of where you save the installer.
  3. Install the software! Just accept the default settings.

After both are installed, you’re always going to run RStudio, which will use R behind the scenes to give you a pleasing analytics experience!

Assignment 4 – ETL – Due March 11

Here’s the information you need to complete Assignment #4.  This assignment will walk you through the process of ETL in Excel.  You will learn some basic Excel formulas that will be helpful for the future as well!  Simply work through the instructions in the Word document and as always, let me know if you have questions!

Follow the instructions for submission as well – just email me your completed Excel workbook by the morning of 3/11.

Word Doc: Assignment #4 – ETL

Excel Workbook: ETL Workbook