Uncategorized
Study Guide for Exam 3 (Final Exam)
Here is the study guide for the third (final) exam. And here’s the more detailed overview.
In class exercise 13.2 driver download
Here is the link for the driver download
Your reading for 13.1, 13.2
Here’s your reading for the week ahead:
Your reading for 12.1, 12.2
Here’s your reading for 12.1, 12.2 and sentiment analysis:
Extra credit assignment due April 16
For one point added to your final grade, here’s what I’m looking for. Read these following two Q&As and give me 100 words on one of them (your choice).
- Death and data science: How machine learning can improve end-of-life care
- A day in the data science life: Salesforce’s Dr. Shrestha Basu Mallick
The 100 words should focus on one of the following:
- What are the challenges with this topic in regards to data science?
- How do you foresee analytics affecting the fields that are in the interviews (end of life care and sales to assistance)?
- What was your biggest takeaways from these two data scientists?
This will be due on April 16 to me before class via email.
An unvarnished view of being a data scientist
Yes, data scientist is the hot career of the moment, but when someone asked on Quora what the downsides were the answers were pretty telling. Here’s a look at what data scientists had to say.
Your reading for 11.1, 11.2
A note about that first reading: It’s a bit dated and Hadoop has advanced since that article. Much of the focus in the open source community has been on side projects tied to Hadoop. One common theme is that analytics and better user interfaces are being layered onto Hadoop. Most companies would use Hadoop via companies like Cloudera and Hortonworks. These companies package Hadoop and sell services and support. To see what I mean re Hadoop and its other projects see the primary Apache page. For our purposes, we’ll keep Hadoop high level, but in the data science department, internship interviews etc you may want to know about projects like Hive, Cassandra, Pig and Spark.
Study Guide for Exam 2
Here is the study guide for the second exam. And here’s the more detailed version.
Agenda for the exam will be to:
–Re-form groups for last group project at the beginning.
–Take test
Your reading for the week (data integration)
Data is beautiful data science, visualization link worth checking out
I came across this story on someone on Reddit visualizing his Tinder experience only to find another person did their 500-day OKCupid outcomes.
Both of the data sets (along worth a bunch of others) are on the Data is Beautiful Reddit. The thread highlights the democratization of visualizing data. Worth checking out for giggles.