i
Exploring R
Evolution of R
Programming Features of R
R for Machine Learning
R for Data Analysis
Application of R
R vs. Python vs. SAS
R vs. Excel vs.Tableau
Install R base on Windows
Install R Studio on Windows
Install R base on Ubuntu
Install R Studio on Ubuntu
R Starter
First R Program
Working with R Packages
R Workplace and R Sessions
Manage working directory
Customize R studio
RStudio Debugger
RStudio History and Environment variables
R Syntax
R Variables
R Data Types & Structures
R Arithmetic Operators
R Logical Operators
R If Statement
R - If…Else Statement
If…else if…Else Statement
R for loop
R while loop
R repeat loop
R String Construction
R String Manipulation Functions
Creating Character Strings
R Functions
R built-in functions
Working with Vector
R Vector Indexing
R Vector Modification
R Arithmetic Vector Operations
R Lists
Access List elements (List Slicing)
List modification
R Matrix construction
Access Matrix elements
R Matrix Modification
R Matrix Operations
R Array Construction
Accessing Array Elements
Manipulating Array Elements
R Data Frames
Data Extraction
Data Frame Expansion
R Built-in Data frames
R Factors
Manage Factor levels
Factor Functions
R Contingency Tables
R Data Visualization
R – Charts and Graphs
R Density Plot
R Strip Charts
R Boxplots
R Violin Plots
R Bar Charts
R Pie Charts
R Area Plots
R Time Series
Graphics with ggplot2
Ggplot2 Structure
ggplot2 Bar Charts
ggplot2 Pie Chart
ggplot2 Area Plot
ggplot2 Histogram
ggplot2 Scatter Plot
ggplot2 Box Plot
Mean & Median
Standard Deviation
Normal Distribution
Correlation
T-Tests
Chi-Square Test
ANOVA Test
Survival Analysis
Data Pre-processing and Missing Value Analysis
Missing data treatment
Missing value analysis with mice package
Outlier Analysis
Problems with outliers
Outlier Detection
Outlier Treatment
Simple Linear Regression
Mathematical Computation
Linear Regression in R
A complete Simple Regression Analysis
Multiple Linear Regression
Mathematical Analysis
Model Interpretation
A complete Multiple Regression Analysis
Logistic Regression
Mathematical Computation in R
Logistic Regression in R
Heart Risk Analysis using LR
Support Vector Machine
Heart Risk Analysis using SVM
Decision Trees
Random Forest
K means Clustering
Big data Analytics using R-Hadoop
RHADOOP Packages:
rJava: Low-Level R to Java Interface
rhdfs: Integrate R with HDFS
rmr2: MapReduce job in R
plyrmr: Data Manipulation with MapReduce job
rhbase: Integrate HBase with R
Environment setup for RHADOOP
Getting Started with RHADOOP
Statistical Analysis:
We will now shed light on Statistical functions, where we will go through the necessary measurements and tests of statistics. We will start with basic mean and median function; then, we will follow the standard deviation and normal distribution in detail. The next section will focus on correlation and finally finish the session with statistical tests, a T-test, and chi-squared test.
Mean & Median
Mean:
Statistical Mean is the same as the average in mathematics. We can calculate it by adding all the data points and dividing by the number of data points. In R-Pro, we use the mean ( ) function to return the mean of a data set.
It is determined by taking the sum of the values and dividing them with the number of values.The function mean() is used to calculate this in R.
In mean function, we mainly use three arguments; the first one is for the data source, sometimes we use trim to drop some unwanted observations, and finally, we can remove Null values just to set na.rm=TRUE. This mean function will return the mean of the mpg data set.
Syntax:
Example:
Median
The median is the middle number. We can calculate it by ordering all the data points and picking out the one in the middle. If there are two middle numbers, we will take the mean of those two numbers.
We use the median function to find out the median of a data set, similar to mean, the syntax of median contains ra.rm = arguments to handle the null values. In a data series, the middle most value is considered the median.To calculate this value, R uses the median () function.
This is an example to find out the median from mpg data.
Syntax:
Example:
Don't miss out!