Introduction Interactive plot version of blog is available at here. This is the part 3 of our Python for Stock Market Analysis series and here, we will explore some of popular growth rates that can be used to see how well is our value is changing over the period of time. Lets take some of […]
Statistics
Central Tendency vs Dispersion
Hello everyone, welcome back! In this blog we will again focus into the some of widely used central tendency techniques and then measure of spread in the Statistical Analysis of the EDA part. If you are looking for a brief walk-through of a Statistical Data Analysis in Data Science please refer to this blog of […]
R Exercise: Association Rule Mining in R
Association Rule Mining Association rule mining (also known as Association Rule Learning) is a typical technique for determining relationships (co-occurrence) between many variables. It is mostly used in grocery stores, e-commerce websites, and other similar establishments, in addition to massive transactional databases. Amazon knows what else you want to buy when you order something on […]
Walkthrough of Statistical Analysis in Data Science
What is Statistical Analysis in Data Science? This blog starts from definition to explaining and experimenting with different part of statistics. Introduction Statistics is very important field and there are lots of definitions and use cases available and noting few. Statistics is a systematic collection of data on measurements or observations, often related to demographic […]
K Medoids Clustering from Scratch in Python
K Medoids Clustering is a clustering algorithm, have you tried to write it from scratch in Python? But before that, if you are also looking for other algorithms from scratch then please follow along: Linear Regression from Scratch Logistic Regression from Scratch Logistic Regression with Different Loss Functions PCA from Scratch K-Means Clustering from Scratch […]
Exploratory Data Analysis in R with Tests
Exploratory Data Analysis in R Hello everyone welcome to our another blog of in R where we will perform various statistical test on Titanic dataset. Code to Read Titanic Dataset data = read.csv("E:/code/Titanic Survival Practice/train.csv") df = data.frame(data) summary(df) PassengerId Survived Pclass Name Min. : 1.0 Min. :0.0000 Min. :1.000 Length:891 1st Qu.:223.5 1st Qu.:0.0000 […]
T-Test for Statistics and Brief Introduction in R
t- test History About Student t- Distribution t-static: t-static was first discovered by Englishman W.S Gosset with his nick name student who published it in 1908 in his research paper entitled " The probable error of the sample mean". Therefore, this t-statistic is called student’s t-statics. Later on Prof. R.A Fisher developed and defined the […]