
Lawrence Dignan
Weekly Question #9: Complete by April 10, 2017
Leave your response as a comment on this post by the beginning of class on April 10, 2017.
Leave a post about your group project:
- What is the subject of your group project?
- Which of your fellow scholars are in your group?
Only one group member has to make the post (only one post per group), but for your other group members to get credit they need to be mentioned in the post.
Weekly Question #8: Complete by April 10, 2017
Leave your response as a comment on this post by the beginning of class on April 10, 2017. Remember, it only needs to be three or four sentences. For these weekly questions, I’m mainly interested in your opinions, not so much particular “facts” from the class!
Here is the question:
Once again, find another online article dated within last two weeks from a credible source that has something to do with data and is interesting and relevant to you. Copy and paste the URL directly into your response followed by a few sentences that explain what is interesting about it.
Your reading for April 10
A note about that first reading: It’s a bit dated and Hadoop has advanced since that article. Much of the focus in the open source community has been on side projects tied to Hadoop. One common theme is that analytics and better user interfaces are being layered onto Hadoop. Most companies would use Hadoop via companies like Cloudera and Hortonworks. These companies package Hadoop and sell services and support. To see what I mean re Hadoop and its other projects see the primary Apache page. For our purposes, we’ll keep Hadoop high level, but in the data science department, internship interviews etc you may want to know about projects like Hive, Cassandra, Pig and Spark.
In-Class Exercise 11.1: Creating a Database
Here is the exercise.
Good read on building teams, data science with other disciplines
This NYT article on how Google researched teams and their effectiveness is worth a read.
Study Guide for Exam 2, and exam prep office hours
Here is the study guide for the second exam. Nathan will hold exam review at 8:30 am – 10:00 am on Friday (03/31/2017) at Breakout room Alter 236C
Assignment 4: Final (Group) Project: Due Friday, April 28 at 3 p.m.; Presentations May 1
Here are the assignment instructions. Groups MUST be 4 to 5 members. You may not do this assignment on your own or in smaller groups than 5. Note that the date on the assignment is incorrect.
The assignment is due April 28, 2017 at 3 p.m. We’ll do the presentations Monday, May 1.
In-Class Exercise 9.2: Creating Interactive Dashboards
Here is the exercise.
And here is the Excel workbook you’ll need [Pew Story Data (Jan – May 2012).xlsx]
Reading Quiz #7: Complete by March 27, 2017
Some quick instructions:
- You must complete the quiz by the start of class on March 27, 2017.
- When you click on the link, you may see a Google sign in screen. Use your AccessNet ID and password to sign in. It will then take you to the quiz.
If it says you don’t have access, make sure you’re signed out of your regular Gmail (non-TUMail) account! - You can only do the quiz once. If you submit multiple times, I’ll only use the first (oldest) one.
- This is “open book” – you can use the articles to answer the questions – but do not get help from anyone else.
Ready? Take the quiz by clicking this link.
In-Class Exercise 9.1: Connecting Diverse Data
Here is the exercise.
And here are the workbooks [2012 Presidential Election Results by District.xlsx and Portrait 113th Congress.xlsx]