计算机代写|深度学习代写deep learning代考|Classical Approaches for Image Classification

Although the SVM and its kernel extension are beautiful convex optimization frameworks devoid of local minimizers, there are fundamental challenges in using these methods for image classification. In particular, the ambient space $\mathcal{X}$ should not be significantly large in the SVM due to the computationally extensive optimization procedure. Accordingly, one of the essential steps of using the SVM framework is feature engineering, which pre-processes the input images to obtain significantly smaller dimensional vector $\boldsymbol{x} \in \mathcal{X}$ that can capture all essential information of the input images. For example, a classical pipeline for the image classification task can be summarized as follows (see Fig. 2.7):

• Process the data set to extract hand-crafted features based on some knowledge of imaging physics, geometry, and other analytic tools,
• or extract features by feeding the data into a standard set of feature extractors such as SIFT (the Scale-Invariant Feature Transform) [12], or SURF (the Speeded-Up Robust Features) [13], etc.
• Choose the kernels based on your domain expertise.
• Put the training data composed of hand-crated features and labels into a kernel SVM to learn a classifier.

Here, the main technical innovations usually comes from the feature extraction, often based on the serendipitous discoveries of lucky graduate students. Moreover, kernel selection also requires domain expertise that was previously the subject of extensive research. We will see later that one of the main innovations in the modern deep learning approach is that this hand-crafted feature engineering and kernel design are no longer required as they are automatically learned from the training data. This simplicity can be one of the main reasons for the success of deep learning, which led to the deluge of new deep tech companies.

So far we have mainly discussed the binary classification problems. Note that more general forms of the classifiers beyond the binary classifier are of importance in practice: for example. ImageNet has more than $20.000$ categories. The extension of the linear classifier for such a setup is important, but will be discussed later.

计算机代写|深度学习代写deep learning代考|Linear, Logistic, and Kernel Regression

In machine learning, regression analysis refers to a process for estimating the relationships between dependent variables and independent variables. This method is mainly used to predict and find the cause-and-effect relationship between variables. For example, in a linear regression, a researcher tries to find the line that best fits the data according to a certain mathematical criterion (see Fig. 3.1a). Another important regression problem is the logistic regression. For example, in Fig. 3.1b, the dependent variables are binary properties such as yes or no for a given question, and the goal is to fit the binary data using continuously varying independent variables. It is easy to understand that this problem is closely related to the binary classification problem. For the case of Fig.3.1c, the technical issue is a bit different from the other two. Here, the distribution cannot be regressed out by a linear line. Moreover, the dependent variable is not binary, but has continuous values. In fact, a better regression approach is to fit the data with a smoothly varying curve. In fact, this is directly related to a nonlinear regression problem.

Although regression analysis is a classical approach that can be dated back to the least squares method by Legendre in 1805 and by Gauss in 1809 , regression analysis is still a key idea of the deep learning approaches, as will be discussed later. Therefore, we will visit the classical regression approach to discuss three specific forms of regression analysis: linear regression, logistic regression, and kernel regression. Later on, this overview will prove useful in understanding modern regression approaches using deep neural networks.

