3.3. The Perceptron Algorithm

Our major concern now is to compute the unknown parameters wi, i = 0,…, l, defining the decision hyperplane. In this section, we assume that the two classes ω1, ω2 are linearly separable. In other words, we assume that there exists a hyperplane, defined by w*Tx = 0, such that(3.5)The formulation above also covers the case of a hyperplane not crossing the origin, that is, w*Tx + w*0 = 0, since this can be brought into the previous formulation by defining the extended (l + 1)-dimensional vectors x′ ≡ [xT, 1]T, w′ ≡ [w*T, w*0]T. Then w*Tx + w*0 = w′Tx′.

We will approach the problem as a typical optimization task ( ...

Get Pattern Recognition, 4th Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.