FA590 Statistical Machine Learning in Finance

Course Catalog Description

Introduction

This course offers a comprehensive overview of both classical and modern statistical learning methods with a strong emphasis on their application in finance. The classical approaches covered include linear regression, logistic regression, and k-Nearest Neighbors (k-NN), providing foundational tools for prediction and classification. The course will also explore modern methods such as decision trees, ensemble techniques (boosting and bagging), support vector machines,and neural networks, as well as advanced topics like model assessment, feature selection, and unsupervised learning techniques like clustering. Throughout the course, students will apply these methods to real-world financial datasets, gaining hands-on experience in statistical learning as it pertains to asset pricing, portfolio optimization, and other key areas in finance.

Prerequisites - A quantitative background is required, including knowledge of probability, statistics, and calculus. Prior programming experience in languages such as Python or R is highly recommended. Students without programming experience may still enroll in the course but should anticipate challenges in completing assignments and the project. Such students are expected to be highly motivated and willing to acquire the necessary programming skills as the course progresses.


Campus Fall Spring Summer
On Campus X X
Web Campus X X X

Instructors

Professor Email Office
Prof. Zonghao Yang zyang99@stevens.edu 515F, Altorfer Hall

More Information

Course Outcomes

The objective of this course is to equip students with a comprehensive understanding of various statistical learning models and their applications in finance. Through a combination of lectures,coding assignments, and applied projects using real-world financial data, students will gain both theoretical knowledge and practical experience in implementing these models.
After successful completion of this course, students will be able to:

  1. Distinguish between supervised vs. unsupervised learning and parametric vs. nonparametric models.
  2. Identify key statistical learning techniques, including classical and modern approaches
  3. Apply these techniques to analyze financial datasets, focusing on asset pricing, portfolio optimization, and market prediction.
  4. Evaluate and select the most suitable model for financial data analysis based on performance

Course Resources

Textbook

  • Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani, and Jonathan Taylor. An Introduction to Statistical Learning with Applications in Python. Springer, 2023

    This book is available free from the author's web site at " https://www.statlearning.com/

Other Readings

  • Trevor Hastie, Robert Tibshirani, and Jerome Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Second Edition. Springer, 2013
  • John C. Hull. Machine Learning in Business. Third Edition. 2021
  • Marcos Lopez de Prado. Advances in Financial Machine Learning. First Edition. Wiley. 2018
  • Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press. 2016

Grading

Grading Policies

Grades will be based on

Assignments - 30%

Final Project - 30%

Quizes - 30%

Engagement and Critical Thinking - 10%


Lecture Outline

Week Date Topic Reading Notes
Week 1 Jan 21 Introduction to Statistical Learning Chapter 1 and Chapter 2, ISL
Week 2 Jan 28 Introduction to Finance Assignment 1
Week 3 Feb 4 Regression Models: Linear Regression Chapter 3, ISL
Week 4 Feb 11 Regression Models: Applications Chapter 6, ISL Quiz 1: Linear Regression
Week 5 Feb 18 NO CLASS Monday Class
Schedule after
President’s Day
Week 6 Feb 25 Classification Models: Logistic Regression Chapter 4, ISL 1. Assignment 1 Due
2. Assignment 2
Week 7 Mar 4 Model Assessment and Selection Chapter 5, ISL Quiz 2: Logistic Regression
Week 8 Mar 11 Classification Models: Support Vector
Machines
Chapter 9, ISL 1. Quiz 3: Model Assessment and Selection
2. Assignment 2 Due 3. Assignment 3
Week 9 Mar 18 NO CLASS Spring Recess
Week 10 Mar 25 Regression and Classification Trees Chapter 8, ISL Quiz 4: SVM
Week 11 Apr 1 Bagging, Random Forests, Adaboost Chapter 10, ISL Assignment 3 Due
Week 12 Apr 8 Neural Networks Quiz 5: Decision Tree
Week 13 Apr 15 Early Default and Return Prediction using Neural Networks
Week 14 Apr 22 Unsupervised Learning Chapter 12, ISL 1. Quiz 6: Neural Networks
2. Deadline for Final Project
Week 15 Apr 29 Statistical Learning for Unstructured Data 1. Quiz 7: Unsupervised Learning
2. Announce Final Project Awards
Week 16 May 6 Project Presentation and Review Selected Projects