2026년도 1학기 특론: 현대 기계학습의 실전적 이론 (CSED703Q-01) 강의계획서

1. 수업정보

학수번호 CSED703Q 분반 01 학점 3.00
이수구분 전공선택 강좌유형 강의실 강좌 선수과목
포스테키안 핵심역량
강의시간 화, 목 / 14:00 ~ 15:15 / 제2공학관 강의실 [107호] 성적취득 구분 G

2. 강의교수 정보

전광성 이름 전광성 학과(전공) 인공지능대학원
이메일 주소 kwangsungjun@postech.ac.kr Homepage https://kwangsungjun.github.io
연구실 전화
Office Hours

3. 강의목표

This course explores the theoretical foundations underlying modern machine learning systems, with particular emphasis on large language models. We begin by reviewing key concepts in statistical learning theory, including risk minimization, generalization bounds, and concentration inequalities. Building on this foundation, we examine theoretical perspectives on alignment via preference learning in LLMs, test-time scaling, reinforcement learning with verifiable rewards, and the role of the base model in exploration. Throughout, the course emphasizes mathematical clarity, conceptual insight, and the relationship between theory and empirical practice, enabling students to understand not only how modern systems work, but why they work.

4. 강의선수/수강필수사항

Machine Learning

5. 성적평가

중간고사 기말고사 출석 과제 프로젝트 발표/토론 실험/실습 퀴즈 기타
30 10 15 30 15 0 100
비고
Class Participation (10%), Quiz (15%), Paper critique (15%), Presentation (30%), Midterm exam (30%).
Be aware that these weights are subject to changes.

6. 강의교재

도서명 저자명 출판사 출판년도 ISBN
Shalev-Shwartz, Shai, and Shai Ben-David. Understanding machine learning: From theory to algorithms. Cambridge university press, 2014. (available online for free) 0000

7. 참고문헌 및 자료

8. 강의진도계획

1. Introduction to learning theory
2. Backgrounds: concentration inequalities
3. Backgrounds: supervised learning
4. Backgrounds: convexity
5. Interactive machine learning
6. LLM alignment: RLHF and DPO
7. Theory of RLHF and DPO
8. Unified theory
9. Test-time scaling
10. Theory of imperfect process verifiers
11. Representation-Based Exploration for Language Models
12. The Computational Role of the Base Model in Exploration
13. Student presentation
14. Student presentation
15. Student presentation
16. Student presentation
(This class is experimental; the schedule and content listed above are subject to changes)

9. 수업운영

- Lecture Type: Lecture, Student Presentation
- Academic Integrity: Penalty for cheating – Removal from course with failing (F) grade
- This course consists of two main parts: lectures and student presentations. The lecture part provides in-depth knowledge and technologies required to analyze modern machine learning algorithms and develop novel ones. In the presentation part, enrolled students present latest articles published in top system conferences, such as ICML, NeurIPS, ICLR, COLT, AISTATS, UAI, AAAI, etc. A list of recommended papers for the presentation will be given, but students are free to choose papers outside the list. For each student presentation, there will be 2-3 other students assigned to write a paper critique about the paper being presented, and the presenter must address questions raised by these critiques.

10. 학습법 소개 및 기타사항

11. 장애학생에 대한 학습지원 사항

- 수강 관련: 문자 통역(청각), 교과목 보조(발달), 노트필기(전 유형) 등

- 시험 관련: 시험시간 연장(필요시 전 유형), 시험지 확대 복사(시각) 등

- 기타 추가 요청사항 발생 시 장애학생지원센터(279-2434)로 요청