Foundation Models & the Future of Machine Learning

CS 839, Fall 2023
Department of Computer Sciences
University of Wisconsin–Madison


Logistics Info

Course Description

Description: Large pretrained machine learning models, also known as foundation models, have taken the world by storm. Models like ChatGPT, Claude, and Stable Diffusion have astonishing abilities to answer questions, speak with users, and generate sophisticated art---all without any additional training. This course covers all aspects of these fascinating models. We will learn how such models are built, including data acquisition, selecting model architectures, and pretraining approaches. Next, a significant focus is how to use and deploy foundation models, including prompting strategies, providing in-context examples, fine-tuning, integrating into existing data science pipelines, and more. We discuss recent advances that improve foundation models, such as large-scale human feedback. Finally, we cover the potential societal impacts of these models. Familiarity with basic machine learning is assumed.

Prerequisites

Familiarity with basic machine learning is assumed. We will review advanced material as needed, but experience and maturity is recommended.

Announcements

  • The Piazza code will be emailed shortly.
  • Welcome! Please see the syllabus page for details about the class and the schedule page to get a sense of the material covered in this class.