MIS 0855: Data Science Fall 2018

Section 004, Instructor: Larry Dignan

Reading Quiz #9: Complete by Nov. 26

Some quick instructions:

  • You must complete the quiz by the start of class on Nov. 26, 2018.
  • When you click on the link, you may see a Google sign in screen. Use your AccessNet ID and password to sign in. It will then take you to the quiz.
    If it says you don’t have access, make sure you’re signed out of your regular Gmail (non-TUMail) account!
  • You can only do the quiz once. If you submit multiple times, I’ll only use the first (oldest) one.
  • This is “open book” – you can use the articles to answer the questions – but do not get help from anyone else.

Ready? Take the quiz by clicking this link.

Reading Quiz #8: Complete by Nov. 12

Some quick instructions:

  • You must complete the quiz by the start of class on Nov. 12, 2018.
  • When you click on the link, you may see a Google sign in screen. Use your AccessNet ID and password to sign in. It will then take you to the quiz.
    If it says you don’t have access, make sure you’re signed out of your regular Gmail (non-TUMail) account!
  • You can only do the quiz once. If you submit multiple times, I’ll only use the first (oldest) one.
  • This is “open book” – you can use the articles to answer the questions – but do not get help from anyone else.

Ready? Take the quiz by clicking this link.

Your reading for Modules 11.1, 11.2

A note about that first reading: It’s a bit dated and Hadoop has advanced since that article. Much of the focus in the open source community has been on side projects tied to Hadoop. One common theme is that analytics and better user interfaces are being layered onto Hadoop. Most companies would use Hadoop via companies like Cloudera and Hortonworks. These companies package Hadoop and sell services and support. To see what I mean re Hadoop and its other projects see the primary Apache page. For our purposes, we’ll keep Hadoop high level, but in the data science department, internship interviews etc you may want to know about projects like Hive, Cassandra, Pig and Spark.

Assignment 3: Group Project: Due Dec. 7 at 3 p.m.; Presentations Dec. 10

Here are the assignment instructions.  Groups MUST be 4 to 5 members.  You may not do this assignment on your own or in smaller groups than 5.

Input your teams into this Google Doc.

Note that the date on the assignment is incorrect.

Once we form groups Nov. 5 the following deadlines will apply:

Nov. 12 (end of class): Need your idea you’ll examine in the assignment for approval.

Nov. 27: Need a note that your group has met and set individual deliverables for the group.

For these interim deadlines all I need is an email from each group leader detailing the team, the topic and rough plan. The main goal is to account for idea changes (many of you will course correct after exploring the data). I’m here to help you focus, refine, find sources etc.

The assignment is due Dec. 7 at 3 p.m. We’ll do the presentations Monday, Dec. 10.