-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 2 months ago
Just a reminder that your final exam will be on Tuesday, May 9 at 1:00pm in the same room as class. Please be on time. Students will not be permitted to enter late. Please make sure that all missing as […]
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 2 months ago
Leave your response to the question below as a comment on this post by the beginning of class on April 27, 2017. It only needs to be three or four sentences.
What was the most important takeaway (from y […]
-
What I mostly took away from this course is how to thoroughly analyze data and take advantage of the technologies provided to me like excel and tableau. This class made me put more data and what I am analyzing into perspective. Tableau was had to learn, but in more ways than one it helps with data visualization and having your data set understood.
-
This class helped me to learn skills with how to deal with data. The class helped me learn how to look through data and clean the bad data which is an important skill. I also learned more effective techniques in excel as well as how to use tableau.
-
My most important (and favorite) takeaway from this course was learning how to analyze data sets with Tableau. I will be using Tableau for my internship this summer, so getting a solid understanding will certainly benefit me. Being a huge sports fan, the one in-class assignment from this semester where we analyzed college basketball statistics was my favorite.
-
The most important takeaway from this course in my opinion is that data is everywhere and there are so many different ways to view and display it. Using excel and tableau are great tools that we go to take away from being in the course that we can now use in our future endeavors.
-
The most important takeaway from this class is the ability to analyze data sets using Excel and Tableau. These programs are integral for those who want to perform data science. This course is an introduction to the various ways to analyze, clean, and present data correctly.
-
The most important take away from this class is using new technologies like Tableau. I enjoyed using it for projects as well as outside the classroom for my own individual projects. It was also interesting to clean data and learn best practices.
-
The most important takeaway in my opinion is learning how to use Tableau effectively. It is a very useful resource and being proficient with it ups your value. Also this class goes into depth about data and its many uses which is interesting in its own way.
-
My biggest takeaway from this class overall is how businesses use data to determine success and make intelligent decisions. Without this class, I don’t think I would have this perspective of data (because I am in a data journalism class) and it opened me up to many different uses for big data. The programs we used are also essential to getting a job in any data field these days. Putting Tableau and Excel proficiency on my resume helped me get a data journalism internship this summer at an investigative firm, and will help me get jobs in the future. Because of this class, I have a well-rounded understanding of the collection, analysis and presentation of data.
-
The most valuable skills learned in the class was learning ways to use Tableau and Excel. However, the main purpose of the course is an introduction to data, and one of the most surprising things I learned about was the need to clean data sets almost as much as entering them.
-
The most important takeaway to me from this course is how to analyze and visualize data using Tableau. This tool will probably help me a lot in my future career.
-
One of the most important things that I took away from this course was the transformation of raw data into visuals that we can understand and learn from. This class was the first time I was ever introduced to Tableau and the value that it brings to understanding data. It is a useful resource that can help me in the future.
-
My most important takeaway from Data Science is how much data mining goes around us all day. We all use Facebook, Twitter, Netflix, Youtube almost everyday, but only some will recognize how big data is a part of all those platforms. If I use Netflix, now I know that Netflix takes into account the types of shows and movies I like to watch and recommend new titles for me. This is only possible with constant data mining.
-
I feel that the most significant take-away was learning Tableau. Upon entering this course, I was proficient in using Excel but had never even heard of Tableau. I am blown away with how useful and relevant Tableau! I’ve used in it my current job and I’ve discussed it in multiple interviews purely because of this class – I would not have known of it otherwise. No matter what field an individual is, it will always be advantageous to be familiar with data management software.
-
For me, the most important takeaway was learning how to structure, interpret, and present data. I think that Excel, Tableau, and the Piktochart application were the best tools we learned about. The theme of numbers and data determining future outcomes is very important to me because I am majoring in statistics. If I had to explain to a future student about what this class is about, I would say my first sentence of this response: this class is about learning how to structure, interpret, and present data.
-
Tableau was the most important thing i took away from this calss. This is a great program that I could use in he real world and put on my resume
-
then most important take away was using tableau and excel together. Tableau gave me a software to analysis data and how to interrupt data.
-
The most important thing i learned throughout the class is to work with Tableau.This software will be very useful in the future for organizing data.I learned how to be proficient at it and how to solve problems that can occur when organizing data.
-
This course will help you understand how to use Excel and Tableau. Not only will it help you to understand those databases but it will help you understand how they work and the purpose of using them
-
I believe the most important takeaway from this course is the understanding that data can be manipulated in may ways, consciously and unconsciously, to support a hypothesis. It is important to recognise the native bias’ in these data to ensure that when you are formulating a testable hypothesis that you are looking at the correct portions and relevant portions of the useable data sets.
-
I think learning to use data visualization tool like Tableau was important take away from this course. But also getting to learn different functionality of excel such as vlookup is important for cleaning data and adding data. It was also interesting to know importance of data in profit and non profit organizations. Overall it was a good experience to learn about data and its visualization.
-
The most important take away from this course was learning about data and how to deal with specific types of data. Also making good visualizations on Tableau.
-
For me, the most important takeaway from this course was learning how to analyze data in Tableau and gaining a much better understanding of how to use Excel. This course got me interested in learning more about data and data analysis, and it was a great introduction to these topics.
-
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 2 months ago
Here is the study guide for the third (final) exam.
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 2 months ago
Here is the link for the driver download
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 2 months ago
Here is the exercise.
And here is the spreadsheet you’ll need [In-Class Exercise 13.2 – VandelayOrdersAll.xlsx].
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 2 months ago
Here is the exercise
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Leave your response as a comment on this post by the beginning of class on April 20, 2017. Remember, it only needs to be three or four sentences. For these weekly questions, I’m mainly interested in your o […]
-
data driven service- snapchat
columns- how many people are on snapchat; reviews; how many snaps are sent a day; how many snaps are received a day
rows- individual numbers for each column -
For Instagrams it store users photos. A row stores a user name, and some columns would be the photos stored, user biography, amount of followers, amount of those the user is following, and amount of photos posted.
-
For Twitter columns could include Twitter handle, Number of followers, Number of people following, Number of tweets and Number of replies. The rows would then be the data that fits into each column.
-
when i think about a data driven service it will be
Connect mcgraw hill accounting
Columns will be:
Number of assignment , students name : registration ID
Rows: assignment grades, name of student : class in which they are enroll -
ESPN uses many different spreadsheets to display statistics. In the case of NBA player stats, each row would represent an individual player, and the columns could display values for points per game, rebounds per game, assists per game, and field point percentage.
-
For snapchat the first row could be username, snap score, and snaps per day then the columns under the first row could be the corresponding data. For example the data for my user would be column 1: tylervelez column 2: 22,262 column 3: 20 and it could continue down with more users.
-
Data-drive Service: Tumblr
Columns: reblogs, likes, and also different types of posts: text, q&a, photo, video, music
Rows: Blog usernames -
Data-driven service- Instagram
Rows- Username and bio
Columns- # Photos posted, # of followers, # of following -
We could take statistics for Twitter such as:
Columns: Followers, Following, # of Tweets, Replies, Likes, Retweets, # of Times visiting the App/Website each day
Rows: All of the rows contain quantitative figures (sums of each)
We could then use this data to find averages such as average Likes per Tweet, Retweets per Tweet, Tweets per visit, etc.
-
For Amazon:
I believe the columns would be the product id, product quantity, manufacturers, distributors, price, last on sale
The rows would contain the product id, product quantity, # of manufacturers, # of distributors, price, and the date when the product was last on sale. -
ESPN fantasy football app, displays the player’s name, position and number of points scored.
-
Blackboard is a great example.
The rows could be indiviual students
Columns could be:
-Homework Grades
-Test Grade
-Quiz Grade
-Attendence -
An example of a data-driven service would be Twitter
The columns would include:
-Followers
-Following
-Pictures
-Number of Tweets
-Likes
-Retweets
-Replies
And the rows would include all the quantitative data from each of the columns. -
For a company like Netflix, an excel spreadsheet columns would have:
The name of the TV Show/Movie
Cast
Crew
Genre
Netflix rating
How many seasons? If applicable
How many episodes? -
For Instagram:
Date of photo posting
Number of followers
Number of following
Amount of Likes
Number of comments -
An example of Data-Driven Service is Facebook.
Some rows would include – Age, number of friends, city of residence, education, -
Data Driven service: Youtube
Columns: Trending, History, Your Uploads, Best of Youtube, etc.
Rows: Video info (link) -
Data driven service: Livescore
Columns: Teams matches
Rows : results, fouls, offsides, corners, time goals were scored -
Service: MyFitnessPal
Columns: # of meals per day, calories per item, # steps per day, # of workouts
Rows: values for each item
-
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Here is the exercise.
And here is the spreadsheet you’ll need for the exercise [In-Class Exercise 12.2 – Sentiment Analysis Tools.xlsx].
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Some quick instructions:
You must complete the quiz by the start of class on April 18, 2017.
When you click on the link, you may see a Google sign in screen. Use your AccessNet ID and password to […] -
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Here is the exercise.
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Here is the exercise.
Here is the excel spreadsheet you will need to complete this exercise [In-Class Exercise 11.2 – NCAA 2013-2014 Player Stats]
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Leave your response as a comment on this post by the beginning of class on April 13, 2017.
Leave a post about your group project:
What is the subject of your group project?
Which of your fellow […]-
I am working with Tyler Velez, Benjamin Thomas, Samantha Sederstrand
our project will be done on Walmart store data. -
1. The subject of our project focuses on variables that may affect voters during election years.
2. My group members are Ryan Tempestini, Cole Krug, Jim Dwyer, Haley Sienkiewicz, and Taylor Thomas. -
1. Our group is going to analyze the store data from 45 Wal-Mart Stores (https://www.kaggle.com/c/walmart-recruiting-store-sales-forecasting/data)
2. Benjamin Thomas, Samantha Sederstrand, Tyler Velez, Rob Ritrovato
-
Subject: Is to find the averages of nba players by position.
Group Members: Asim Ciceksay; Darien Bennett : Shefali Kakarlapudi: Virag N. Maniar -
Subject: Access to and utilization of healthcare by state.
Group: Caitlyn Cignarella, Adam Lerro, Andrew Bertz, David Moyer, Linh Tran. -
Our topic is the amount of parking tickets in Philly.
Group members: Vasil Vasilev, Artur Muzyka, Connor McLarney, Taylor Peluso
-
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Some quick instructions:
You must complete the quiz by the start of class on April 11, 2017.
When you click on the link, you may see a Google sign in screen. Use your AccessNet ID and password to sign […] -
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Here is the exercise.
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Leave your response as a comment on this post by the beginning of class on April 6, 2017. Remember, it only needs to be three or four sentences. For these weekly questions, I’m mainly interested in your o […]
-
This article talks about how the house will allow internet providers to sell browsing information with out a user’s consent. This is a privacy issue since users will not be able to opt out of this option and the websites they visit will be logged and by ISPs to see the websites visited but not specific pages. I do not think that this is an appropriate legislation to be passed because it does not give a user the option to say no, but the decision is made for them by the government.
-
https://fivethirtyeight.com/features/is-the-home-run-back-will-the-shift-ever-die-and-8-more-burning-baseball-questions/
This article is very interesting because baseball season is starting up this week and the article has to deal with data analysis behind the sport. The article really touches all parts of baseball and the data behind it. The article uses infographics to show the data, the most interesting infographic to me was probably the one that talks about the defensive shifts that occur. Defensive shifts have become more popular over the past five years, so to show the data on that was interesting to see. I find all articles that talk about data and sports to be very interesting. -
This is a very interesting article that counters the theory that tracking time can increase productivity. In fact, this article states that tracking time can often negatively affect how a workforce perceives their work. For example, employees tend to work less hard in the final 30 minutes of their day, knowing that they will be let off soon. Many people mentally check out prior to the end of their workday.
-
About one-in-seven Americans don’t think men should be able to take any paternity leave
I find this article very interesting because nowadays it is very common to argue about equality between men and women. This article gives some data about how men of different ages think about fathers leaving work after the birth or adoption of a child or to take care of a family member or themselves because of a serious health condition.
-
https://projects.fivethirtyeight.com/trump-approval-ratings/
This article is about Donald Trump, and how popular he is. This is really interesting to me because in every poll he is more disapproved then approved. It is funny to see this because he was voted for as president but somehow he is less liked.
-
http://www.nbcnews.com/storyline/zika-virus-outbreak/new-study-shows-higher-feared-zika-risk-babies-n742571
This article is about the amount of woman effected with the Zika virus and the percentage of their babies who have some type of birth defect. Nearly 10% of women who have the Zika virus and are pregnant, result with babies with some type of birth defect. This is relevant to me because my sister-in-law is currently pregnant and being that summer is just around the corner it is going to be extremely important to keep her away from mosquitos as much as possible so this disease does not transfer to her. -
Sociologists urge use of big data to study human interaction
This article was extremely interesting especially to me as an anthropology major. The article states that sociology experts are urging sociologists and social psychologists to focus on developing online research studies with the help of big data in order to advance the theories of social interaction and structure. It’s relevant to me because studying in the field of human interaction, it’s cool to see how we could contribute to big data.
-
https://fivethirtyeight.com/features/the-four-types-of-scarlett-johansson-movies/
The article shows the four types of Scarlett Johansson Movies by plotting her films’ inflation-adjusted domestic box office revenue according to the numbers against their critic score on Rotten Tomatoes. It comes out the 4 types are Prestigious, Home Alone 3 and Peers, Her, and The Avenger. Each category is listed with several films that she was in and all help readers to make sense of her career. -
Trump is about to sign a bill letting internet providers sell personal information to advertisers
This article explains the new legislation passed by the Trump administration and Congress to allow companies to freely use personal internet data. In the past, companies needed special permissions to get this information, but not any longer. Personal information through social media and web searches can be freely used for advertising. I want to know more about this law because I thought some companies already had access to this information, and were already using it for personalized advertising – which we often see on Google searches and Facebook. -
This article is about the data that Windows 10 collects. It also discusses how it wasn’t very well received publicly and has recently been updated. I found this interesting because this OS is the first to have mandatory opt-in for data collection and has now been adjusted to opt-out of the data collection.
-
http://www.cnbc.com/2017/04/04/gambling-software-helped-amateur-make-1300-in-12-days.html
This article comes from CNBC, the article is about “Justin” an amateur gamble who, thanks to the NCAA tournament made $1300 in just 12 days. By buying access to offshore sports books he was able to study gambling patterns and learn how to make the best bet. Justin started with just $120 on the online gambling site, Bovada. This article interested me because it linked sports, gambling, and data into a cool story that others can now learn from and try to duplicate. -
https://fivethirtyeight.com/features/unc-played-ugly-enough-to-win/
This article shows how UNC had to play a different style of basketball than what is normally seen today to win the National Championship on Monday. This is a great one because it breaks down the game and shows the key components to their win.
-
Case Study: Implementing Data Governance for Data Lakes and Big Data
This article focused basically on data integration for healthcare providers. They wanted to find an easier and more useful way to collect all of a patients information from different data sources and combine them so when they go to their primary care they have all the information they need to help patients. They had to integrate their data as well as protect the patient’s information because it was crucial while collecting data from a variety of different sources. -
http://www.reuters.com/article/us-twitter-lite-idUSKBN1780GT
This article is about Twitter releasing a “Twitter Lite”, which can be accessed via web browser. It is aimed for people who only have limited data on their phones hoping to pick up this target market. Twitter is lagging behind other tech firms. They noticed that 40% of their users view Twitter through a web browser rather than the mobile application.
-
“Be Suspicious Of Online Movie Ratings, Especially Fandango’s”
https://fivethirtyeight.com/features/fandango-movies-ratings/
By exploring movie review data from IMDb, Rotten Tomatoes, and Fandango, the author found that nearly all movies on Fandango (97%) had at least a 3-star rating. He revealed that Fandango has a very flawed system — Ted 2, for example, had a 4.5 rating on Fandango, while IMDb and Rotten Tomatoes gave it 2.5 stars. The problem is in the way Fandango aggregates its users’ reviews. They tend to always round up to nearest half-star when that would not be the right mathematical thing to do (rounding up a 4.1-star rating to 4.5, instead of 4.0). As a self-proclaimed cinephile, it was interesting to learn that Fandango’s ratings are not so reliable. Although I had already noticed that their ratings tend to be higher than other websites, I had no idea that they intentionally manipulate data to make movies “look better.” -
http://money.cnn.com/2017/03/28/technology/house-internet-privacy-repeal/
The title of the article: “Congress just killed your Internet privacy protections” pretty much sums up the article. Under Obama administration it was illegal for internet service providers to collect personal information without permission and then sell it to other companies. However now that the congress repealed internet privacy protections, internet providers are free to share our internet browsing data. -
https://fivethirtyeight.com/features/new-york-now-has-more-mets-fans-than-yankees-fans/
This article compares both New York MLB teams, the Mets and the Yankees, to see which team is more popular in their home state. For the first time in over a decade, New Yorkers surveyed leaned towards the Mets. This is interesting because the Yankees are one of the most valuable sports franchises in the world with a very distinguishable logo. -
https://projects.fivethirtyeight.com/2017-nba-predictions/?ex_cid=rrpromo
This article contains a lot of data about the NBA. It’s interesting to me because I follow the NBA closely and like to stay updated on the current events. It contains descriptive data of current team records and predictive data of a projected season record. It also has playoff odds and single game odds which can be useful for betting or simply guessing who would win. -
Online Education Gets Up Close and Personal as Big Data Improves Performance
This article talks about how the use of big data is changing the landscape of education. Big data can be used to tailor the educational needs for specific students while also enhancing the educational experience. This data can be used to personalize educational needs to every individual student. -
https://fivethirtyeight.com/features/to-ease-the-student-debt-crisis-hold-colleges-responsible/
This article is about an issue that almost every college student can relate to. The idea of student debt and what should be done with it. Obviously, big data has a huge part to play in the process of identifying the problems related to the rising rate of student borrowing. Those data points can include interest rates, location of the student and college, the national economic performance, etc. The article specifically targets colleges. More specially, on how colleges need to bear for of the responsibility in trying to attack the student debt problem. In my opinion, information gathering is very important. Colleges already gather enough info about each student and their loans. However, combing more factors like dollar value and future interest predictions, colleges can alter their financial aid programs.
-
Uber’s open source data visualization tool now goes beyond maps
This article talks about uber data visualization plans and how new key features that are helping people experience pick up times and visualize drop off times.
-
https://projects.fivethirtyeight.com/trump-approval-ratings/
This article shows how popular or unpopular Donald Trump is. This is interesting because the ratings show that Donald Trump is mostly disapproved in most of the states in the country. It’s still very interesting to see that even though he is disapproved in maximum states and he still won the elections and is elected as a president. -
The recent repeal of landmark privacy protections empowers internet providers to enter the market for online advertising where they can now collect, store, share, and sell consumer’s internet behavior. This also means that ISPs are no longer needed to upgrade security guidelines that protect their customers against hackers and thieves. They are allowed to sell their customers information to the highest bidder. No consent necessary, this can all be done without customers disclosing permission.
-
https://fivethirtyeight.com/features/to-ease-the-student-debt-crisis-hold-colleges-responsible/
This article talks about how student loans affect students on their post college life.On the information they give, it is explained that most of them have a lifetime debt and it is very negative.But still college is an investment and on the long run it will give you more than what you invested on it.In my opinion going to college is necessary if you want to be competent and pursue your goals.Because yo will need the knowledge and the connections you can get throughout your college life. -
This article is about data used by businesses. I am all about entrepreneurship and that’s why I found this article really helpful and entertaining.
-
This article talks about how Trump is allowing ISPs to sell your browsing data. This can be done without your permission through internet providers like Comcast, AT&T, and Verizon. The author of the bill, a Republican representative from Tennessee, has already received $700,000 from large internet companies and lobbyists over the years. This is something we should all be outraged about. It’s ridiculous that politicians can violate OUR rights and privacy for THEIR gain. Unfortunately, this is how the current political system in the United States works, and it’s corrupt. The article advises the reader to better educate themselves on the issue and to also reach out to local representatives to express opposition.
-
This article talks about the demographics, and statistical evidence of diversity during Fashion Week in Europe. The article includes visual displays of bar graphs, and lists showcasing diversity amongst the location of the shows, as well as specific designer shows. I chose this article and topic on fashion because, I’m someone who heavily follows the fashion industry. I invest my money on the newest editions of Vogue, I follow and observe the designers works, and I focus on the models who partake in this industry. So as a whole, it interests me. But another reason I chose it, was to see the comparisons and rise of diversity that is entering the industry that has been often predominantly occupied by white models. And with the rise and revolution of plus-sized models, it’s intriguing to see how their appearance, will really make the difference within the industry.
-
https://www.healthdatamanagement.com/news/deadline-looms-for-hospitals-suppliers-to-set-emergency-plans
Data in emergency management, specifically tracking compliance with specific codes and regulations, is highly important to motivating stakeholders to spend time and resources to comply with these regulations. If data on this matter is not tracked, there’s no way for the regulating bodies to assess whether organization remain compliant. -
https://fivethirtyeight.com/features/the-u-s-job-market-is-on-a-historic-growth-streak/
This article is about the U.S job market growth. Employers added many jobs in the market which means the recovery in the U.S. job market has been remarkable for its durability and resilience. It shows monthly job growth and the average growth.
-
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Some quick instructions:
You must complete the quiz by the start of class on April 4, 2017.
When you click on the link, you may see a Google sign in screen. Use your AccessNet ID and password to sign […] -
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Here are the assignment instructions. Groups MUST be 5 members. You may not do this assignment on your own or in smaller groups than 5.
The assignment is due April 25, 2017. We’ll do the pres […]
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Here is the study guide for the second exam.
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Here is the exercise.
And here is the Excel workbook you’ll need [Pew Story Data (Jan – May 2012).xlsx]
-
Mark Sabat wrote a new post on the site MIS 0855: Data Science Spring 2017 7 years, 3 months ago
Here is the exercise.
And here are the workbooks [2012 Presidential Election Results by District.xlsx and Portrait 113th Congress.xlsx]
- Load More