C S 4 8 9
Drag

CS489: Generative AI & Large Language Models

Mon-Wed: 2pm to 4pm

Instructors Image
About CS489 Course

Introduction to Generative AI & Large Language Models

In CS489, we dive deep into the world of Generative AI, exploring the intricacies of Large Language Models and their applications.

  • Key Technologies: Transformers & GANs

  • Exploration of Diffusion Models

  • Emphasis on NLP Techniques & Word Embeddings

  • Hands-on Experience with Transformers

  • Generative AI in Image Generation

  • Study of Models like OpenAI's DALLE-2

CS489 is dedicated to providing students with a robust understanding of Generative AI, preparing them for advanced studies and industry challenges.

CS489 Course Details

Introduction to Generative AI and Large Language Models

The CS489 course delves into the world of Generative AI, focusing on Large Language Models like GPT and their applications.

About the Course

Introduction to Generative AI and Large Language Models

The course "Introduction to Generative AI and Large Language Models" delves into Generative AI, focusing on Large Language Models like GPT. It covers key technologies such as Transformers and GANs, and explores diffusion models for data generation. The curriculum emphasizes NLP techniques, particularly word embeddings, and offers hands-on experience in building and deploying models using Transformers. The course also investigates Generative AI in image generation, highlighting models like OpenAI's DALLE-2, and interlinks GANs, Transformers, NLP, and computer vision for a comprehensive understanding.

Textbook Image
Recommended Textbooks

Essential Readings for the Course

Students are advised to refer to the following textbooks for a comprehensive understanding of the course content.

  • 01

    Transformers for Natural Language Processing
    Second Edition

    Build, train, and fine-tune deep neural network architectures for NLP with Python, Hugging Face, and OpenAI's GPT-3, ChatGPT, and GPT-4

    Author: Denis Rothman

  • 02

    Deep Learning

    By Ian Goodfellow, Yoshua Bengio, and Aaron Courville. A foundational text from MIT Press.

Course Prerequisites

What You Need to Know Before Enrolling

Students are expected to have a foundational understanding of certain topics before enrolling in CS489.

  • 01

    College Prerequisite

    CS330: Operating Systems course

  • 02

    Python Skills

    Python knowledge is essential for all assignments.

  • 03

    Math Knowledge

    Familiarity with college calculus, linear algebra, and matrix operations.

  • 04

    Statistics Basics

    Understanding of probabilities, distributions, and standard deviation.

  • 05

    Machine Learning Foundations

    Knowledge in cost functions, derivatives, and gradient descent is beneficial.

Course Learning Objectives

At the end of this course, the student will be able to:

  • 01

    Knowledge and Understanding

    CLO1: Understand principles of Generative AI and Large Language Models

  • 02

    Skills

    CLO2: Build Generative AI Models for NLP and Computer Vision

    CLO3: Apply Generative AI techniques to real-world applications

  • 03

    Value

    CLO4: Acknowledge ethical and Responsible AI considerations in AI applications

Instructors & Contacts

Meet the Course Team

The CS489 course is led by a team of experienced instructors and assistants dedicated to providing a comprehensive learning experience.


  • 01

    Lead Instructor



  • 02

    Co-Instructors



  • 03

    Assistants




Schedule

Schedule


ID Lectures Hands-on Sessions Readings
NATURAL LANGUAGE PROCESSING AND LARGE LANGUAGE MODELS
Week 1. Introduction
1 Introduction to Generative AI and Large Language Models (ChatGPT) [Slides|Lecture Notes] Hands-on: Introduction to Python Programming
[Notebook| Video]
Hands-on: Python Built-in Data Structures
[ Notebook| Video]
Hands-on: Introduction to Numpy Arrays
[Notebook| Video]
Hands-on: Introduction to Pandas Dataframes
[ Overview| Series| Dataframes| Data| Stats Video]
Gozalo-Brizuela et al. A survey of Generative AI Applications, arXiv:2306.02781, 2023
2 Generative AI Life Cycle Hands-on: Introduction to OpenAI Hands-on: Introduction to HuggingFace
[Slides|Notebook]
Week 2. Practical Large Language Models
3 Prompt Engineering Hands-on: Prompt Engineering Basics
4 Instruction Fine-Tuning of Large Language Models Hands-on: Develop a fine-tuned large language model. https://crfm.stanford.edu/2023/03/13/alpaca.html Quiz 1 on LLMs
Week 3. Conceptual Text Representation
5 Word Tokenization OpenaAI Python API Assignment 1: Fine Tuning
6 Word Embedding Introduction to LangChain
Week 4. LLM Architectural Design
7 Transformers for Sequence to Sequence Models Hands-on: Implementing Word Tokenization
8 Transformers Architecture for Language Language Models: LLAMA, GPT, PaLM, Falcon, Assignment 3
Week 5. LLM Training and its Challenges
9 LLM Pre-Training and Scaling Laws Quiz 4
10 Parameter Efficient Fine-Tuning (PEFT): LoRa and Software Prompts Assignment 4
Week 6. LLM Alignment to Human Preferences
11 Introduction to Reinforcement Learning
12 Reinforcement Learning with Human Feedback
Week 7. LLM Deployment
12 LLM Optimization and Deployment
14 Building Full-Stack LLM applications.
GENERATIVE AI FOR COMPUTER VISION APPLICATIONS
Week 8. Introduction to Computer Vision
15 Computer Vision Concepts
16 Deep Learning Principles for Computer Vision
Week 9. Extensive Hands-on Week
17
18
Week 10. GANs
19 Generative Models I: CNN, GANs Architecture and Applications
20 Vision Transformer for Generative AI Hands-on: Building a GAN
Week 11. Diffusion Model
21 Generative Models II: Diffusion Models Assignment 5
22 Vision Transformer for Generative AI
Week 12. Generative AI Risks and Ethical Consideration
23 Ethical and Social Considerations
24 Responsible AI
Week 13. Advanced Topics
25 Research Trends in Generative AI (e.g., Explainable AI, General Artificial Intelligence)
26 Practical Implementation: Project Development Hands-on: Project Development Quiz 11
Week 14. Project Presentations
27 Project Presentations
28 Project Presentations
Week 15.
29 Final Exam Final Exam

© Robotics and Internet-of-Things Lab.