FE512 Database Engineering



Course Catalog Description

Introduction

This lab course will introduce HFSL databases resources and the tools required to use them. More generally, the course will introduce a variety of software tools used to interact with data storage systems and manipulate them. The focus will be on how to use these tools in practice to perform common data engineering tasks, with a focus on the financial services industry, in both on-prem and cloud environments. The course will require students to spend a significant amount of time programming in Python, SQL, and other languages.

Campus Fall Spring Summer
On Campus X
Web Campus

Instructors

Professor Email Office
Ionut Florescu ifloresc@stevens.edu Babbio 544
Edward Loeser eloeser@stevens.edu Babbio 109

More Information

Course Description

The course covers basic and advanced knowledge of SQL using PostgreSQL. Python and R will be used alongside SQL to interact with data systems. Additional topics will be introduced.


Course Outcomes

At the end of this course, students will be able to: evaluate a data problem and implement a simple software stack to solve it; connect to databases, call data APIs, and manipulate data using a programming language; use SQL to write DDL and DML statement

The objective of the course is for students to:
• build fundamental skills in writing programs to manipulate data and interact with data APIs
• learn to write more complex, data-intensive programs
• understand basic database concepts
• become comfortable researching, evaluating, and implementing open-source software tools
• gain a basic understanding of the large and rapidly changing data storage technology landscape


Course Resources

Prerequisites and Textboks

There are no course prerequisites. Students will need to be comfortable with Python, SQL, and text-based user interfaces. There is no required textbook. However we present a list of reference books for the course:
• Designing Data-Intensive Applications, Martin Kleppmann
• Options, Futures, and Other Derivatives, John C. Hull
• Effective awk Programming, Arnold Robbins
• SQL in 10 Minutes, Ben Forta
PostgreSQL Docs , The PostgreSQL Global Development Group