WA2711

R Programming from the Ground Up Training

Over the past few years, R has been steadily gaining popularity with business analysts, statisticians and data scientists as a tool of choice for conducting statistical analysis of data as well as supervised and unsupervised machine learning.
Course Details

Duration

2 days

Prerequisites

Participants should have the general knowledge of statistics and programming

Target Audience

  • Business Analysts
  • Technical Managers
  • Programmers

Skills Gained

  • R data structures
  • R functions
  • Statistical data analysis with R
Course Outline
  • What is R
    • What is R?
    • Positioning of R in the Data Science Space
    • The Legal Aspects
    • Microsoft R Open
    • R Integrated Development Environments
    • Running R
    • Running RStudio
    • Getting Help
    • General Notes on R Commands and Statements
    • Assignment Operators
    • R Core Data Structures
    • Assignment Example
    • R Objects and Workspace
    • Printing Objects
    • Arithmetic Operators
    • Logical Operators
    • System Date and Time
    • Operations
    • User-defined Functions
    • Control Statements
    • Conditional Execution
    • Repetitive Execution
    • Repetitive execution
    • Built-in Functions
  • Introduction to Functional Programming with R
    • What is Functional Programming (FP)?
    • Terminology: Higher-Order Functions
    • A Short List of Languages that Support FP
    • Functional Programming in R
    • Vector and Matrix Arithmetic
    • Vector Arithmetic Example
    • More Examples of FP in R
  • Managing Your Environment
    • Getting and Setting the Working Directory
    • Getting the List of Files in a Directory
    • The R Home Directory
    • Executing External R commands
    • Loading External Scripts in RStudio
    • Listing Objects in Workspace
    • Removing Objects in Workspace
    • Saving Your Workspace in R
    • Saving Your Workspace in RStudio
    • Saving Your Workspace in R GUI
    • Loading Your Workspace
    • Diverting Output to a File
    • Batch (Unattended) Processing
    • Controlling Global Options
  • R Type System and Structures
    • The R Data Types
    • System Date and Time
    • Formatting Date and Time
    • Using the mode() Function
    • R Data Structures
    • What is the Type of My Data Structure?
    • Creating Vectors
    • Logical Vectors
    • Character Vectors
    • Factorization
    • Multi-Mode Vectors
    • The Length of the Vector
    • Getting Vector Elements
    • Lists
    • A List with Element Names
    • Extracting List Elements
    • Adding to a List
    • Matrix Data Structure
    • Creating Matrices
    • Creating Matrices with cbind() and rbind()
    • Working with Data Frames
    • Matrices vs Data Frames
    • A Data Frame Sample
    • Creating a Data Frame
    • Accessing Data Cells
    • Getting Info About a Data Frame
    • Selecting Columns in Data Frames
    • Selecting Rows in Data Frames
    • Getting a Subset of a Data Frame
    • Sorting (ordering) Data in Data Frames by Attribute(s)
    • Editing Data Frames
    • The str() Function
    • Type Conversion (Coercion)
    • The summary() Function
    • Checking an Object's Type
  • Extending R
    • The Base R Packages
    • Loading Packages
    • What is the Difference between Package and Library?
    • Extending R
    • The CRAN Web Site
    • Extending R in R GUI
    • Extending R in RStudio
    • Installing and Removing Packages from Command-Line
  • Read-Write and Import-Export Operations in R
    • Reading Data from a File into a Vector
    • Example of Reading Data from a File into A Vector
    • Writing Data to a File
    • Example of Writing Data to a File
    • Reading Data into A Data Frame
    • Writing CSV Files
    • Importing Data into R
    • Exporting Data from R
  • Statistical Computing Features in R
    • Statistical Computing Features
    • Descriptive Statistics
    • Basic Statistical Functions
    • Examples of Using Basic Statistical Functions
    • Non-uniformity of a Probability Distribution
    • Writing Your Own skew and kurtosis Functions
    • Generating Normally Distributed Random Numbers
    • Generating Uniformly Distributed Random Numbers
    • Using the summary() Function
    • Math Functions Used in Data Analysis
    • Examples of Using Math Functions
    • Correlations
    • Correlation Example
    • Testing Correlation Coefficient for Significance
    • The cor.test() Function
    • The cor.test() Example
    • Regression Analysis
    • Types of Regression
    • Simple Linear Regression Model
    • Least-Squares Method (LSM)
    • LSM Assumptions
    • Fitting Linear Regression Models in R
    • Example of Using lm()
    • Confidence Intervals for Model Parameters
    • Example of Using lm() with a Data Frame
    • Regression Models in Excel
    • Multiple Regression Analysis
  • Data Manipulation and Transformation in R
    • Applying Functions to Matrices and Data Frames
    • The apply() Function
    • Using apply()
    • Using apply() with a User-Defined Function
    • apply() Variants
    • Using tapply()
    • Adding a Column to a Data Frame
    • Dropping A Column in a Data Frame
    • The attach() and detach() Functions
    • Sampling
    • Using sample() for Generating Labels
    • Set Operations
    • Example of Using Set Operations
    • The dplyr Package
    • Object Masking (Shadowing) Considerations
    • Getting More Information on dplyr in RStudio
    • The search() or searchpaths() Functions
    • Handling Large Data Sets in R with the data.table Package
    • The fread() and fwrite() functions from the data.table Package
    • Using the Data Table Structure
  • Data Visualization in R
    • Data Visualization
    • Data Visualization in R
    • The ggplot2 Data Visualization Package
    • Creating Bar Plots in R
    • Creating Horizontal Bar Plots
    • Using barplot() with Matrices
    • Using barplot() with Matrices Example
    • Customizing Plots
    • Histograms in R
    • Building Histograms with hist()
    • Example of using hist()
    • Pie Charts in R
    • Examples of using pie()
    • Generic X-Y Plotting
    • Examples of the plot() function
    • Dot Plots in R
    • Saving Your Work
    • Supported Export Options
    • Plots in RStudio
    • Saving a Plot as an Image
  • Using R Efficiently
    • Object Memory Allocation Considerations
    • Garbage Collection
    • Finding Out About Loaded Packages
    • Using the conflicts() Function
    • Getting Information About the Object Source Package with the pryr Package
    • Using the where() Function from the pryr Package
    • Timing Your Code
    • Timing Your Code with system.time()
    • Timing Your Code with System.time()
    • Sleeping a Program
    • Handling Large Data Sets in R with the data.table Package
    • Passing System-Level Parameters to R
  • Lab Exercises
    • Lab 1. Getting Started with R
    • Lab 2. Learning the R Type System and Structures
    • Lab 3. Read and Write Operations in R
    • Lab 4. Data Import and Export in R
    • Lab 5. k-Nearest Neighbors Algorithm
    • Lab 6. Creating Your Own Statistical Functions
    • Lab 7. Simple Linear Regression
    • Lab 8. Monte-Carlo Simulation (Method)
    • Lab 9. Data Processing with R
    • Lab 10. Using R Graphics Package
    • Lab 11. Using R Efficiently