Logistic regression
😄 fun fact: actually better for classification instead of regression problems
Assume there is a plane in Rd parameterized by W
P(Y=1∣x,W)P(Y=0∣x,W)=ϕ(WTx)=1−ϕ(WTx)∵ϕ(a)=1+e−a1
maximum likelihood
1−ϕ(a)=ϕ(−a)
WML=Wargmax∏P(xi,yi∣W)=Wargmax∏P(W)P(xi,yi,W)=Wargmax∏P(yi∣xi,W)P(xi)=Wargmax[∏P(xi)][∏P(yi∣xi,W)]=Wargmaxi=1∑nlog(τ(yiWTxi))
maximize the following:
i=1∑n(yilogpi+(1−yi)log(1−pi))
softmax
softmax(y)i=∑ieyieyi
where y∈Rk