w02_Fundeamental_of_ML_정보이론

Learn the most classical methods of machine learning • Rule based approach • Classical statistics approach • Information theory appraoch

Rule based machine learning • How to find the specialized and the generalized rules • Why the rules are easily broken

Decision Tree • How to create a decision tree given a training dataset • Why the tree becomes a weak learner with a new dataset

Linear Regression • How to infer a parameter set from a training dataset • Why the feature engineering has its limit

1. Rule based machine learning

통계 기법 활용 학습 방법

문제점 : 현재 데이터를 가지고는 잘 판단 하지만, 미래 데이터에 대하여서는 보장 할수 없음

The training dataset will not be a perfect sample of the

real world: Noise, Inconsistencies

이러한 한계로 선택트리가 실세계에서는 잘 사용 안됨

Random Variable이 얼마나 불확실성이 높은지/낮은지 평가 하는 지표
Higher entropy means more uncertainty
공식 :
- $\sum_x$ 의 x는 동전던지기는 F,T/ 주사위는 1~6 을 의미, Discrete한 경우
- 만일 Continuos한 경우는 $\sum \rightarrow \int$ 적분으로 변환하여 처리