A Data Science Project- Part 4: Chi-Square Test of Independence

In the last article, we have discussed ANOVA test, and it gave us insight into checking the distribution of response variable among groups of an independent variable. Today, we will learn how to check relationships between two categorical variables. http://yesthedataguy.tumblr.com/post/134133746036/how-to-perform-chi-square-test-of Advertisements

A Data Science Project- Part 3: Hypothesis testing and ANOVA

This post is in continuation of A Data Science project series. In this post, We will use ANOVA test whenever we need to check if two or more groups are different from each other or not. i.e. let’s say there are four races in a school – White, Hispanic, Black and Indian. Now school management … More A Data Science Project- Part 3: Hypothesis testing and ANOVA

A Data Science Project- Part 2: Making Sense of Data

I hope you have gone through part1 and part2, Today I will tell you not only how to explore data through visualization but also the most important part how to interpret them. Don’t miss reading summary in the end of every post. we will create bar chart, histograms, scatter-plot, box-plots etc. We will also check … More A Data Science Project- Part 2: Making Sense of Data

A Data Science Project- Part 1(b)

In the previous article (https://d4datascience.wordpress.com/2016/11/10/a-data-science-project-part-1/), we have done basic data analysis like calculating means, frequency tables, summary etc. Now we will derive new variables. Why? Derived variables will help to understand more about them. For example, We have derived variable ip(derived from incomeperperson variable) which will help us to understand how many people fall in lower … More A Data Science Project- Part 1(b)