which W will be best——Optimization
learning rate
SGD-at every iteration ,we sample some small set of training example,which name is minibatch
Image Feature:
to train a classifier, we should compute various feature representations of that image, that are maybe computing different kinds of quantities relating to the appearance of the image, then concatenate these different feature vectors and then input the classifier.