homepage

courses

previous -- view my teaching portfolio here

 

 

 

Daemen University

Machine Learning -- CSC 401

 

Course Description (from the College): This course introduces several fundamental concepts and methods for machine learning. It will introduce a range of algorithms and techniques together with their applications, in addition to analyzing and handling large data sets. Several software libraries and ata sets publicly available will be used to illustrate the application of these algorithms.  (3 hours)
Prerequisite: MTH 325

 

Syllabus/Syllabus -- in Word format
Final Project Directions

Homeworks
 

Important Dates

 

see the syllabus for a more detailed calendar
 

Email List

can be gotten through Blackboard

Announcements:

Math Adjunct Office
My voicemail
My Daemen email: bmccall@daemen.edu
Office hours: see syllabus


 

Homeworks

 

Weekly Code Assignments

WCA #1
WCA #2
WCA #3
WCA #4
WCA #5
WCA #6
WCA #7
WCA #8
WCA #9
WCA #10
WCA #11
WCA #12
WCA #13

Package Comparison Assignments

PC #1
PC #2
PC #3
PC #4
PC #5
PC #6

Modeling Projects Directions

MP #1
MP #2
MP #3
MP #4
MP #5
MP #6

Final Project Suggestions (or select your own)

Notes

9/3 N R 9/5 N R
9/10 N R 9/12 N R
9/17 N R 9/19 N R
9/24 N R 9/26 N R
10/1 N R 10/3 N R
10/8 N R 10/10 N R
  10/17 N R
10/22 N R 10/24 N R
10/29 N R 10/31 N R
  11/7 N R
11/12 N R 11/14 N R
11/19 N R 11/21 N R
11/26 N R  
12/3 N R  
Datasets Examples

Datasets

Categorical Data from Kaggle
Sources

Code Examples


included in lecture notes

 

R Tutorials:

Bar Graphs (base R)
Boxplot (base R)
Dotplots (base R)
Histogram (base R)
Normal Distribution shaded between two values (base R)
Normal Probability Plots (base R)
Scatterplots with Trendlines and Residual Graphs (base R)

 

Handouts:

Answer Keys


Resources

Tutorials on Advanced Stats and Machine Learning With R
Applied Statistics with R (textbook)
Intermediate Statistics with R (textbook)
Probability and Statistics for Engineering and the Sciences, Jay. L. Devore, 8th ed. (textbook)
Introductory Statistics (textbook)
Practical Statistics for Data Scientists (textbook)
Online Statistics Book (textbook)
A Little Book of Time Series Analysis for R (textbook)
A Course in Time Series Analysis (textbook/notes)
Introduction to Probability for Data Science (textbook)
Friedman's ANOVA Test
How to Perform Friedman's Test in R
Kendall's Tau
Calculating Kendall's Rank Correlation in R
Introduction to Bootstrapping (Statistics by Jim)
Boostrapping in R
Tutorial on Permutation Tests in R
How to use Permuation Tests
Understanding AUC-ROC Curves
Some Packages for ROC Curves
Time Series Analysis in R
Getting Started with Multiple Imputation in R
Basic Statistics Using R
Learning Statistics with R
Statistics with R (Table of Contents)
Stats and R
Intro to Hypothesis Testing in R
R-Tutorial: An R Introduction to Statistics
Tidy Modeling with R
R Cheatsheets
Free Web Books for Learning (Statistics) with R
Easier ggplot with ggcharts
R Color Brewer's Palettes
Markdown Cheat Sheet
Smoothing
Cubic and Smoothing Splines in R
B-Spline Basis for Polynomial Splines
Intro to Stats in R
R Code Examples
Distance Metrics
Scaling Methods
Comparing Distance Metrics
Optimization Algorithms in Machine Learning
Matrices in R
Covariance Matrix
Normal Equation
Residual Analysis
Regression Metrics
Penalty Functions in Regression
Optimizing Penalized Regression in R
Penalized Regression in R
Gaussian Process Regression
Variable Selection Methods
Ensemble Methods
Ensemble Models with R
Curves and Splines
Basis Expansion and Splines
Distance Metrics in KNN
Tie Breaking Methods in KNN
KNN for classification or regression
Decision Tree Splitting Methods
Splitting Criteria
Decision Tree Ensemble Methods
Agglomerative Clustering
Principal Component Analysis in R
Principal Component Regression
Factor Analysis
K-Means
Spectral Clustering
Natural Language Processing
Simple Neural Network
Time Series in R

R Project
R Studio
Anaconda
Using R with Anaconda

 

Links!

PDF Graph Paper
Bad Graphs (Convention Speeches)
Visualizing Data Badly: 8 Examples
Correlation is not Causation: orginal article / handout
Presidents by State
TI-Connect Software
How much people lie on surveys
On the Hazards of Significance Testing
Exploring Correlation and Regression
Central Limit Theorem: with Bunnies and Dragons
SOCR: Statistics Online Computational Resources
How Many Ways Can You Arrange a Deck of Cards?
Free Online Math Courses
Confidence Interval for Rho
Free Courses from Coursera

Coding
R Tutorials
MTH 324/MTH 325

 

 

 
(c) 2013, 2007, 2004 by Betsy McCall, all rights reserved
To contact the webmistress, email betsy@pewtergallery.com
Last updated: 2022 May 8th