FA590 Statistical Machine Learning in Finance
Course Catalog Description
Introduction
This course offers a comprehensive overview of both classical and modern statistical learning methods with a strong emphasis on their application in finance. The classical approaches covered include linear regression, logistic regression, and k-Nearest Neighbors (k-NN), providing foundational tools for prediction and classification. The course will also explore modern methods such as decision trees, ensemble techniques (boosting and bagging), support vector machines,and neural networks, as well as advanced topics like model assessment, feature selection, and unsupervised learning techniques like clustering. Throughout the course, students will apply these methods to real-world financial datasets, gaining hands-on experience in statistical learning as it pertains to asset pricing, portfolio optimization, and other key areas in finance.
Prerequisites - A quantitative background is required, including knowledge of probability, statistics, and calculus. Prior programming experience in languages such as Python or R is highly recommended. Students without programming experience may still enroll in the course but should anticipate challenges in completing assignments and the project. Such students are expected to be highly motivated and willing to acquire the necessary programming skills as the course progresses.
Campus | Fall | Spring | Summer |
---|---|---|---|
On Campus | X | X | |
Web Campus | X | X | X |
Instructors
Professor | Office | |
---|---|---|
Prof. Zonghao Yang | zyang99@stevens.edu | 515F, Altorfer Hall |
More Information
Course Outcomes
The objective of this course is to equip students with a comprehensive understanding of various statistical learning models and their applications in finance. Through a combination of lectures,coding assignments, and applied projects using real-world financial data, students will gain both
theoretical knowledge and practical experience in implementing these models.
After successful completion of this course, students will be able to:
- Distinguish between supervised vs. unsupervised learning and parametric vs. nonparametric models.
- Identify key statistical learning techniques, including classical and modern approaches
- Apply these techniques to analyze financial datasets, focusing on asset pricing, portfolio optimization, and market prediction.
- Evaluate and select the most suitable model for financial data analysis based on performance
Course Resources
Textbook
-
Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani, and Jonathan Taylor. An Introduction to Statistical Learning with Applications in Python. Springer, 2023
This book is available free from the author's web site at " https://www.statlearning.com/
Other Readings
- Trevor Hastie, Robert Tibshirani, and Jerome Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Second Edition. Springer, 2013
- John C. Hull. Machine Learning in Business. Third Edition. 2021
- Marcos Lopez de Prado. Advances in Financial Machine Learning. First Edition. Wiley. 2018
- Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press. 2016
Grading
Grading Policies
Grades will be based on
Assignments - 30%
Final Project - 30%
Quizes - 30%
Engagement and Critical Thinking - 10%
Lecture Outline
Week | Date | Topic | Reading | Notes |
---|---|---|---|---|
Week 1 | Jan 21 | Introduction to Statistical Learning | Chapter 1 and Chapter 2, ISL | |
Week 2 | Jan 28 | Introduction to Finance | Assignment 1 | |
Week 3 | Feb 4 | Regression Models: Linear Regression | Chapter 3, ISL | |
Week 4 | Feb 11 | Regression Models: Applications | Chapter 6, ISL | Quiz 1: Linear Regression |
Week 5 | Feb 18 | NO CLASS | Monday Class Schedule after President’s Day |
|
Week 6 | Feb 25 | Classification Models: Logistic Regression | Chapter 4, ISL | 1. Assignment 1
Due 2. Assignment 2 |
Week 7 | Mar 4 | Model Assessment and Selection | Chapter 5, ISL | Quiz 2: Logistic Regression |
Week 8 | Mar 11 | Classification Models: Support Vector Machines |
Chapter 9, ISL | 1. Quiz 3: Model
Assessment and
Selection 2. Assignment 2 Due 3. Assignment 3 |
Week 9 | Mar 18 | NO CLASS | Spring Recess | |
Week 10 | Mar 25 | Regression and Classification Trees | Chapter 8, ISL | Quiz 4: SVM |
Week 11 | Apr 1 | Bagging, Random Forests, Adaboost | Chapter 10, ISL | Assignment 3 Due |
Week 12 | Apr 8 | Neural Networks | Quiz 5: Decision Tree | |
Week 13 | Apr 15 | Early Default and Return Prediction using Neural Networks | ||
Week 14 | Apr 22 | Unsupervised Learning | Chapter 12, ISL | 1. Quiz 6: Neural
Networks 2. Deadline for Final Project |
Week 15 | Apr 29 | Statistical Learning for Unstructured Data | 1. Quiz 7:
Unsupervised
Learning 2. Announce Final Project Awards |
|
Week 16 | May 6 | Project Presentation and Review | Selected Projects |