i
Exploring R
Evolution of R
Programming Features of R
R for Machine Learning
R for Data Analysis
Application of R
R vs. Python vs. SAS
R vs. Excel vs.Tableau
Install R base on Windows
Install R Studio on Windows
Install R base on Ubuntu
Install R Studio on Ubuntu
R Starter
First R Program
Working with R Packages
R Workplace and R Sessions
Manage working directory
Customize R studio
RStudio Debugger
RStudio History and Environment variables
R Syntax
R Variables
R Data Types & Structures
R Arithmetic Operators
R Logical Operators
R If Statement
R - If…Else Statement
If…else if…Else Statement
R for loop
R while loop
R repeat loop
R String Construction
R String Manipulation Functions
Creating Character Strings
R Functions
R built-in functions
Working with Vector
R Vector Indexing
R Vector Modification
R Arithmetic Vector Operations
R Lists
Access List elements (List Slicing)
List modification
R Matrix construction
Access Matrix elements
R Matrix Modification
R Matrix Operations
R Array Construction
Accessing Array Elements
Manipulating Array Elements
R Data Frames
Data Extraction
Data Frame Expansion
R Built-in Data frames
R Factors
Manage Factor levels
Factor Functions
R Contingency Tables
R Data Visualization
R – Charts and Graphs
R Density Plot
R Strip Charts
R Boxplots
R Violin Plots
R Bar Charts
R Pie Charts
R Area Plots
R Time Series
Graphics with ggplot2
Ggplot2 Structure
ggplot2 Bar Charts
ggplot2 Pie Chart
ggplot2 Area Plot
ggplot2 Histogram
ggplot2 Scatter Plot
ggplot2 Box Plot
Mean & Median
Standard Deviation
Normal Distribution
Correlation
T-Tests
Chi-Square Test
ANOVA Test
Survival Analysis
Data Pre-processing and Missing Value Analysis
Missing data treatment
Missing value analysis with mice package
Outlier Analysis
Problems with outliers
Outlier Detection
Outlier Treatment
Simple Linear Regression
Mathematical Computation
Linear Regression in R
A complete Simple Regression Analysis
Multiple Linear Regression
Mathematical Analysis
Model Interpretation
A complete Multiple Regression Analysis
Logistic Regression
Mathematical Computation in R
Logistic Regression in R
Heart Risk Analysis using LR
Support Vector Machine
Heart Risk Analysis using SVM
Decision Trees
Random Forest
K means Clustering
Big data Analytics using R-Hadoop
RHADOOP Packages:
rJava: Low-Level R to Java Interface
rhdfs: Integrate R with HDFS
rmr2: MapReduce job in R
plyrmr: Data Manipulation with MapReduce job
rhbase: Integrate HBase with R
Environment setup for RHADOOP
Getting Started with RHADOOP
In this section, we compare the three most commonly used programming tools for Data Analysis.
Feature |
R |
Python |
SAS |
Cost-Effectiveness |
R is open-source software and free to use. |
A python is open-source software and free to use. |
SAS is commercial software. For most of the professionals, it is expensive and still beyond control |
Learning Ease |
R is a low-level programming language, but it is very easy to learn and understand due to its simple structure. |
Because of its simplicity and usability, Python is very easy to learn and understand. |
SAS is one of the world's easiest languages. Anyone can learn SAS without having any programming knowledge. |
Data Management |
R is capable of handling Big data, structured and unstructured data. |
Python is suitable for structured and unstructured data. They are also a compelling language for Big Data analysis. |
SAS is also efficient for Structured or unstructured data and Big data analysis. |
Graphical Capabilities |
R has the highest graphics capabilities due to packages such as Lattice, ggplot, RGIS, etc. |
Python gives a fierce competition to R with the help of graphical packages such as VisPy, Matplotlib. |
SAS provides functional graphical functionalities. But it is purely functional. We need a thorough understanding of the SAS Graph package to configure it. |
Community Support |
As R is an open-source language, it has a good community, and they are the best among others. |
Similar to R, Python has a very good community, and they are also very active. |
SAS provides an excellent technical support experience that is not available for Python and R. It also has a great community. |
Application Advancements |
Due to the open nature of R, the development of new features and techniques are fast as compared to SAS. |
Similar to R, the development of new features and techniques are speedy. |
Compared to Python and R, it is less prone to errors, but it usually takes a long-time for a new release. |
Deep Learning |
R has introduced KerasR and Keras packages. We can effectively generate Deep Models with R. |
Python's introduction of TensorFlow and Keras has made significant advances in the area of deep learning. |
SAS has recently introduced deep learning, and it is still in the development phase. There is a long road to travel for SAS for deep learning. |
Don't miss out!