It is based on using Boost to learn a classification model (ie an eye, a face, etc) based on a training set of feature vectors extracted from running different configs of Haar-like kernels over it. I know that sentence was dense with nomenclature but not sure how much you know about machine learning and/or computer vision. The idea is to take an image of an eye (or non-eye), break it down into a mathematical vector (composed of filter responses from the Haar-like kernels) and then "picking out" the most important components with some sort of machine learning algorithm (ie regression, Boost, SVM, etc).
Jack