• Log In
  • Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar

Data Science

Department of Management Information Systems, Temple University

Data Science

MIS 0855.003 ■ Fall 2022 ■ Guohou Shan
  • Schedule
  • Announcements
  • Readings
  • About
    • Course details
    • Course syllabus
    • Grading
    • Get Tableau software
    • Guohou, your instructor
    • Gradebook
  • Gradebook

Week3 Quiz and Week2 Summary

September 1, 2022 Leave a Comment

Here are some things you may need to pay attention to:
 
Week 3 Quiz Due (you can click the highlights to get the related content)
 
  • please read the reading materials for Week 2 (four readings)
  • please finish the week 3 quiz before the class on 9/6 (Tuesday).
Week 3 guest speaker
  • I will invite Joe, the senior associate director from the MIS department to give us a 15 mins talk, helping us go over the ways of getting an MIS major, minor, and/or certificate in the class of 9/6.
Some key things from week 2:
 
In-class activity submission
  • You will get up to 5 points for submitting the complete dictionary
  • The points you get from this assignment have the same importance as your individual assignment, which takes up 20% of your final scores
  • We will have other in-class activities in later classes, which may have higher points
  • For this particular activity, I will only evaluate how you build the dictionary
    • If you didn’t provide variable meanings or units because you are not familiar with the abbreviation or the car functions, that is totally fine. There is no point loss on that.
    • So, if you are correct in labeling the data type (we have talked about the common data type in the class), I will give you a 5 point.
In-class quiz
  •  
  • You will get up to 4 points from attending the in-class quiz
  • The in-class quiz has the same importance as the before-class quiz, where it takes up to 10% of your final score
Week 2 Key Takeaways
  • Data is important Today
    • data is the new oil/gold
    • massive data contains massive useful insights
    • Your competitors use data to drive revenue
    • advanced tech makes deriving the insights easier
  • Big Data means
    • Volume: the amount of data has literally exploded
    • Variety: many different sources of data are combined together
    • Velocity: data can change very quickly
  • Compared with the old days, data storage and collection are convenient and cheap
  • Data Science is the generalizable extraction of knowledge from data
  • Dangers of big data analytics
    • it is easy to find what is not there
    • causality is harder to find, especially, the direction of the causality can be tricky
    • dirty data appears more often. However, it is harder to find errors
  • If I have big data, Do I still need to think
    • Yes. It is hard to find actionable insights from big data
    • Data can have a survival bias
  • The hypothesis is “educated guess”, which is also a testable prediction. Good hypothesis should 
    • be testable
    • be falsifiable
    • ground in rationale
  • Metadata: the data that describe other data. It contains
    • Variable name (a dataset column label)
    • Variable description (in a data dictionary)
    • Data type (in a data dictionary)
    • Value (the datum itself)
  • Metadata is often stored as a data dictionary attached to a dataset
  • Common data types contain
    • Integer: whole numbers; it is called Number in Excel and Number (whole) in Tableau;
    • Floating point: decimal values; it is called Number in Excel and Number (decimal) in Tableau;
    • Boolean: binary values; it is called <N/A> in Excel and Boolean in Tableau; 
    • String: Alphanumeric characters; it is called Text in Excel and String in Tableau;
    • Date/Time: Calendar date and time; it is called Data in Excel and Date or Date & time in Tableau.
  • The data type is important
    • it determines the type of values that a data field can have
    • it defines the kind of operations that can be performed on data
    • Incorrect data types can create problems in analysis and result in wrong results (e.g., the gene name errors shown in the week 2 reading materials)
  • Why does Metadata matter?
    • it brings economic costs (e.g., the Mars Climate Orbiter)
    • it can make human life in danger (e.g., Gimli glider)
  • Advantages of Metadata
    • metadata facilitate understanding and data processing
    • it makes navigating data and data-based objects easier
  • Weakness of Metadata
    • it takes some time to read it
    • creating and maintaining is laborious
    • the creator of the data just doesn’t need it, making it not created
  • In practice, we often need to reverse engineer or just guess the missing parts of metadata when analyzing datasets.

 

Exam, Assignment, Project, and Quiz Dues
  • three in-person and closed exams will happen on 9/29 (Thursday), 10/27 (Thursday), and 12/01 (Thursday)
  • two individual assignments will be due on 10/13 (Thursday) and 11/10 (Thursday)
  • the final group project will need to have 4-5 students in one group and is due on 12/06 (Tuesday)
  • quizzes are normally due before the start of the first class (Tuesday) of a week
  •  

 

Instructions to get Tableau software:
  • Download the latest version of Tableau Desktop and Tableau Prep Builder from here: https://www.tableau.com/tft/activation
  • Click on the link above and select “Download Tableau Desktop” and “Download Tableau Prep Builder” (we need to use both Tableau Desktop and Tableau Prep Builder)
  • On the form, enter your school email address for Business email and enter the name of your school for Organization.
  • Activate with the product key TCVB-B1D4-A680-1321-2667
 

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Primary Sidebar

LECTURE RECORDINGS

Recordings are as follows.

14th week 11/29/2022
12/1/2022
13th week 11/15/2022 11/17/2022
12th week 11/08/2022 11/10/2022
11th week 11/01/2022 11/03/2022
10th week 10/25/2022
10/27/2022
9th week 10/18/2022 10/20/2022
8th week 10/11/2022 10/13/2022
7th week 10/04/2022 10/06/2022
6th week 09/27/2022
09/29/2022
5th week 09/20/2022 09/22/2022
4th week 09/13/2022 09/15/2022
3rd week 09/06/2022 09/08/2022
2nd week 08/30/2022 09/01/2022
1st week 08/23/2022 08/25/2022

RECENT ANNOUNCEMENTS

  • Week 13 Summary
  • Week13 Prep and Week12 Summary
  • Week12 Prep and Week11 Summary
  • Week10 Prep and Week9 Summary
  • Week9 Prep and Week8 Summary
  • Week8 Prep and Week7 Summary
  • Week6 Exam and Week5 Summary
  • Week5 Quiz and Week4 Summary
  • Week4 Quiz and Week3 Summary
  • Week3 Quiz and Week2 Summary
  • Week2 Quiz and Week1 Summary

  • Video for part6 in-class-activity0915
  • Video recap for in-class-activity0913
  • Data visualization
  • Assessing the Trustworthiness of Data
  • Welcome to MIS0855 Data Science!
  • Teaching Team

    Guohou (Jack) Shan (Instructor)
    guohou.shan@temple.edu

    I am available to meet students on Tuesday (3:30 pm – 4:30 pm) at Speakman 208E. I will also hold virtual office hours from 1-3 pm on Monday and Friday (4 pm – 5 pm) over zoom (https://temple.zoom.us/j/2292311746).  If the time is not convenient for you, please send me an email.

    Anna M Boykis (Information Technology Assistant)
    anna.boykis@temple.edu

    Email Anna to arrange a meeting.

    GREAT DATA SITES

  • FiveThirtyEight
  • Guardian Data Blog
  • Flowing Data
  • Financial Times Data Blog
  • Pew Research Data
  • US Government Open Data
  • OpenDataPhilly
  • Copyright © 2025 · Department of Management Information Systems · Fox School of Business · Temple University