机器学习基石 - Theory of Generalization

机器学习基石上 (Machine Learning Foundations)—Mathematical Foundations
Hsuan-Tien Lin, 林轩田,副教授 (Associate Professor),资讯工程学系 (Computer Science and Information Engineering)

Theory of Generalization

Restriction of Break Points

  • growth function mH(N): max number of dichotomies
  • 漏出一线曙光的点 break point
  • break point k restricts maximum possible mH(N) a lot for N>k

Bounding Function: Basic Cases

  • B(N,k): maximum possible mH(N) when break point = k
    机器学习基石 - Theory of Generalization
  • 表格
    机器学习基石 - Theory of Generalization

Bounding Function: Inductive Cases

  • B(4,3) 的估计
    • 机器学习基石 - Theory of Generalization
    • 机器学习基石 - Theory of Generalization
  • Putting It All Together
    机器学习基石 - Theory of Generalization
  • B(N,k)i=0k1(Ni)
    • 数学归纳法
    • CN1 i+CN1 i+1=CN i+1
    • actually can be =
    • B(N,k)2B(N1,k1)+(B(N1,k)B(N1,k1))
  • can bound mH(N) by only one break point

A Pictorial Proof

机器学习基石 - Theory of Generalization

  • Step 1: Replace Eout by Ein 
    机器学习基石 - Theory of Generalization
  • Step 2: Decompose H by Kind
    机器学习基石 - Theory of Generalization
  • Step 3: Use Hoeffding without Replacement
    机器学习基石 - Theory of Generalization
  • Vapnik-Chervonenkis (VC) bound
    机器学习基石 - Theory of Generalization