文章作者：激萌の小宅

促销：￥0

价格：￥0

配送方式：购买后立即生效（如购买异常，请联系站长）

付款之后一定要等待自动跳转结束，否则购买可能会失败

0 天

有效期
0

总销量
0

累计评价

详细说明

<h1 style="font-size: 32px; font-weight: bold; border-bottom: 2px solid rgb(204, 204, 204); padding: 0px 4px 0px 0px; text-align: left; margin: 0px 0px 10px;">SVM手写数字识别（使用SVM进行手写数据OCR）</h1>下面这段代码是 <a href="/api/system/download?file=OpenCV-Python-Tutorial-%E4%B8%AD%E6%96%87%E7%89%88.pdf" target="_blank" title="OpenCV-Python-Tutorial-中文版.pdf （P273）">OpenCV-Python-Tutorial-中文版.pdf （P273）</a>中的实现 <p style="margin-top: 0px; margin-bottom: 10px; text-wrap: wrap; padding: 0px; list-style: none; border: 0px; overflow-wrap: break-word; word-break: break-all; line-height: 1.5em; font-family: "Helvetica Neue", Helvetica, "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans CJK SC", "WenQuanYi Micro Hei", Arial, sans-serif; color: rgb(51, 51, 51); box-sizing: border-box;">当前系列所有demo下载地址：<p style="margin-top: 0px; margin-bottom: 10px; text-wrap: wrap; padding: 0px; list-style: none; border: 0px; overflow-wrap: break-word; word-break: break-all; line-height: 1.5em; font-family: "Helvetica Neue", Helvetica, "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans CJK SC", "WenQuanYi Micro Hei", Arial, sans-serif; color: rgb(51, 51, 51); box-sizing: border-box;"><a href="https://github.com/GaoRenBao/OpenCv4-Demo" target="_blank" style="margin: 0px; padding: 0px; list-style: none; border: 0px; color: rgb(0, 102, 0); transition-duration: 0.2s; transition-property: opacity; outline: none; opacity: 0.8;">https://github.com/GaoRenBao/OpenCv4-Demo</a><p style="margin-top: 0px; margin-bottom: 10px; text-wrap: wrap; padding: 0px; list-style: none; border: 0px; overflow-wrap: break-word; word-break: break-all; line-height: 1.5em; font-family: "Helvetica Neue", Helvetica, "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans CJK SC", "WenQuanYi Micro Hei", Arial, sans-serif; color: rgb(51, 51, 51); box-sizing: border-box;"><a href="https://gitee.com/fuckgrb/OpenCv4-Demo" target="_blank" style="margin: 0px; padding: 0px; list-style: none; border: 0px; color: rgb(0, 102, 0); transition-duration: 0.2s; transition-property: opacity; outline: none;">https://gitee.com/fuckgrb/OpenCv4-Demo</a><p style="margin-top: 0px; margin-bottom: 10px; text-wrap: wrap; padding: 0px; list-style: none; border: 0px; overflow-wrap: break-word; word-break: break-all; line-height: 1.5em; font-family: "Helvetica Neue", Helvetica, "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans CJK SC", "WenQuanYi Micro Hei", Arial, sans-serif; box-sizing: border-box; color: rgb(51, 51, 51);">不同编程语言对应的OpenCv版本以及开发环境信息如下： <table border="1" style="text-wrap: wrap; border-right: none; border-bottom: none; border-image: initial; border-left: 1px solid rgb(102, 102, 102); border-top: 1px solid rgb(102, 102, 102);"><tbody><tr class="firstRow"><td width="81" valign="top" style="border-color: windowtext rgb(102, 102, 102) rgb(102, 102, 102) windowtext; border-bottom-width: 1px; border-bottom-style: solid; border-right-width: 1px; border-right-style: solid; padding: 5px;">语言</td><td width="223" valign="top" style="border-color: windowtext rgb(102, 102, 102) rgb(102, 102, 102) windowtext; border-bottom-width: 1px; border-bottom-style: solid; border-right-width: 1px; border-right-style: solid; padding: 5px;">OpenCv版本</td><td width="242" valign="top" style="border-color: windowtext rgb(102, 102, 102) rgb(102, 102, 102) windowtext; border-bottom-width: 1px; border-bottom-style: solid; border-right-width: 1px; border-right-style: solid; padding: 5px;">IDE</td></tr><tr><td width="81" valign="top" style="border-top: none; border-left-color: windowtext; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">C#</td><td width="223" valign="top" style="border-top: none; border-left-color: windowtext; word-break: break-all; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">OpenCvSharp4.4.8.0.20230708</td><td width="242" valign="top" style="border-top: none; border-left-color: windowtext; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">Visual Studio 2022</td></tr><tr><td width="81" valign="top" style="border-top: none; border-left-color: windowtext; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">C++</td><td width="223" valign="top" style="border-top: none; border-left-color: windowtext; word-break: break-all; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">OpenCv-4.5.5-vc14_vc15</td><td width="242" valign="top" style="border-top: none; border-left-color: windowtext; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">Visual Studio 2022</td></tr><tr><td width="81" valign="top" style="border-top: none; border-left-color: windowtext; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">Python</td><td width="223" valign="top" style="border-top: none; border-left-color: windowtext; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">OpenCv-Python (4.6.0.66)</td><td width="242" valign="top" style="border-top: none; border-left-color: windowtext; border-bottom: 1px solid rgb(102, 102, 102); border-right: 1px solid rgb(102, 102, 102); padding: 5px;">PyCharm Community Edition 2022.1.3</td></tr></tbody></table> 备注：PDF中提供的Python代码无法运行，当前的Python代码是CSDN上找的<a href="https://blog.csdn.net/JJSXBL/article/details/127924492" target="_blank">https://blog.csdn.net/JJSXBL/article/details/127924492</a> 测试原图<img src="/upload/image/6383669113329171091730634.png" title="digits.png" alt="digits.png"/> <strong style="font-family: "Helvetica Neue", Helvetica, "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans CJK SC", "WenQuanYi Micro Hei", Arial, sans-serif; margin: 0px; padding: 0px; list-style: none; border: 0px; color: rgb(51, 51, 51); ">C#版本代码如下：GitHub上的一个demo<a href="https://github.com/shimat/opencvsharp_samples/tree/master/SamplesLegacy/Samples/SVMSample.cs" target="_blank">https://github.com/shimat/opencvsharp_samples/tree/master/SamplesLegacy/Samples/SVMSample.cs</a><pre class="brush:c#;toolbar:false">using OpenCvSharp; using OpenCvSharp.ML; using System; using System.Collections.Generic; using System.Linq; namespace demo {     internal class Program     {         #region GitHub上的一个demo         private static double Function(double x)         {             return x + 50 * Math.Sin(x / 15.0);         }         /// <summary>         /// GitHub上的一个demo         /// https://github.com/shimat/opencvsharp_samples/tree/master/SamplesLegacy/Samples/SVMSample.cs         /// </summary>         public static void RunTest()         {             // Training data                       var points = new Point2f[500];             var responses = new int[points.Length];             var rand = new Random();             for (int i = 0; i < responses.Length; i++)             {                 float x = rand.Next(0, 300);                 float y = rand.Next(0, 300);                 points[i] = new Point2f(x, y);                 responses[i] = (y > Function(x)) ? 1 : 2;             }             // Show training data and f(x)             using (Mat pointsPlot = Mat.Zeros(300, 300, MatType.CV_8UC3))             {                 for (int i = 0; i < points.Length; i++)                 {                     int x = (int)points[i].X;                     int y = (int)(300 - points[i].Y);                     int res = responses[i];                     Scalar color = (res == 1) ? Scalar.Red : Scalar.GreenYellow;                     pointsPlot.Circle(x, y, 2, color, -1);                 }                 // f(x)                 for (int x = 1; x < 300; x++)                 {                     int y1 = (int)(300 - Function(x - 1));                     int y2 = (int)(300 - Function(x));                     pointsPlot.Line(x - 1, y1, x, y2, Scalar.LightBlue, 1);                 }                 Cv2.ImShow("pointsPlot", pointsPlot);             }             // Train             var dataMat = new Mat(points.Length, 2, MatType.CV_32FC1, points);             var resMat = new Mat(responses.Length, 1, MatType.CV_32SC1, responses);             var svm = SVM.Create();             // normalize data             dataMat /= 300.0;             // SVM parameters             svm.Type = SVM.Types.CSvc;             svm.KernelType = SVM.KernelTypes.Rbf;             svm.TermCriteria = TermCriteria.Both(1000, 0.000001);             svm.Degree = 100.0;             svm.Gamma = 100.0;             svm.Coef0 = 1.0;             svm.C = 1.0;             svm.Nu = 0.5;             svm.P = 0.1;             svm.Train(dataMat, SampleTypes.RowSample, resMat);             // Predict for each 300x300 pixel             Mat retPlot = Mat.Zeros(300, 300, MatType.CV_8UC3);             for (int x = 0; x < 300; x++)             {                 for (int y = 0; y < 300; y++)                 {                     float[] sample = { x / 300f, y / 300f };                     var sampleMat = new Mat(1, 2, MatType.CV_32FC1, sample);                     int ret = (int)svm.Predict(sampleMat);                     var plotRect = new Rect(x, 300 - y, 1, 1);                     if (ret == 1)                         retPlot.Rectangle(plotRect, Scalar.Red);                     else if (ret == 2)                         retPlot.Rectangle(plotRect, Scalar.GreenYellow);                 }             }             Cv2.ImShow("retPlot", retPlot);         }         #endregion         #region SVM手写数字识别         /// <summary>         /// 切图         /// </summary>         /// <returns></returns>         static List<List<Mat>> RoiImages(string path)         {             Mat img = Cv2.ImRead(path);             Mat gray = new Mat();             Cv2.CvtColor(img, gray, ColorConversionCodes.BGR2GRAY);             List<List<Mat>> cells = new List<List<Mat>>();             int w = gray.Width / 100; // 每行100个字符             int h = gray.Height / 50; // 共50行             Mat roi = new Mat();             for (int i = 0; i < 10; i++) // 数字0~9             {                 cells.Add(new List<Mat>());                 for (int j = 0; j < 5; j++) // 每组数字有5行                 {                     for (int n = 0; n < 100; n++) // 每行100个字符                     {                         Rect rect = new Rect()                         {                             X = n * w,                             Y = h * ((i * 5) + j),                             Width = w,                             Height = h                         };                         roi = new Mat(gray, rect);                         cells[i].Add(roi);                         //Console.WriteLine(rect);                         //Cv2.ImShow("roi", roi);                         //Cv2.WaitKey(1);                     }                 }             }             return cells;         }         /// <summary>         /// SVM手写数字识别         /// </summary>         static void demo()         {             // 切图             var cells = RoiImages("../../../images/digits.png");             // 训练数据数量，前400张图片用来训练             int train_sample_count = 400 * 10;             // 声明训练数据集合 mat，图像尺寸：20*20             Mat trainData = new Mat(train_sample_count, 20 * 20, MatType.CV_32FC1);             // 声明训练数据标签 mat             Mat trainLabel = new Mat(train_sample_count, 1, MatType.CV_32SC1);             // 组织训练数据，循环训练文件夹内所有图片。             int trainNum = 0;             for (int i = 0; i < 10; i++)             {                 var cell = cells[i].Take(400);                 foreach (Mat temp in cell)                 {                     // 转换CV_32FC1，因为下面训练函数需要这个格式                     temp.ConvertTo(temp, MatType.CV_32FC1);                     // 写入到训练数据集合的mat内，注意reshape的用法。                     /*                     reshape有两个参数：                     其中，参数：cn为新的通道数，如果cn = 0，表示通道数不会改变。                     参数rows为新的行数，如果rows = 0，表示行数不会改变。                     注意：新的行* 列必须与原来的行*列相等。就是说，如果原来是5行3列，新的行和列可以是1行15列，3行5列，5行3列，15行1列。                           设置行数后，列数会自动调整。比如此处 调整为 1行784列。                     */                     temp.Reshape(0, 1).CopyTo(trainData.Row(trainNum));                     // 写入到训练标签集合的mat内                     // 这个必须是int类型，不能用float，和knn的操作有点差异                     trainLabel.Set<int>(trainNum, i);                     trainNum++;                 }             }             // 实例化对象             SVM svm = SVM.Create();             // 设置核函数             svm.KernelType = SVM.KernelTypes.Linear;             // 设置训练类型             svm.Type = SVM.Types.CSvc;             svm.C = 2.67;             svm.Gamma = 5.383;             // 输入样本，进行训练             svm.Train(trainData, SampleTypes.RowSample, trainLabel);             // 存储训练结果             //svm.Save("svm_data.dat");             /**************  Now testing ***************/             // 测试数据数量，后100张图片用来测试             int test_sample_count = 100 * 10;             // 声明测试数据集合 mat             Mat testData = new Mat(test_sample_count, 20 * 20, MatType.CV_32FC1);             // 声明测试数据标签 mat             Mat testLabel = new Mat(test_sample_count, 1, MatType.CV_32FC1);             // 组织测试数据             int testNum = 0;             for (int i = 0; i < 10; i++)             {                 var cell = cells[i].Skip(400);                 foreach (Mat temp in cell)                 {                     temp.ConvertTo(temp, MatType.CV_32FC1);                     temp.Reshape(0, 1).CopyTo(testData.Row(testNum));                     testLabel.Set<float>(testNum, i);                     testNum++;                 }             }             Mat result = new Mat();             svm.Predict(testData, result);             int t = 0;             int f = 0;             for (int i = 0; i < test_sample_count; i++)             {                 int predict = (int)result.At<float>(i);                 int actual = (int)testLabel.At<float>(i);                 if (predict == actual)                 {                     Console.WriteLine("正确：" + predict + "-" + actual);                     t++;                 }                 else                 {                     Console.WriteLine("错误------：" + predict + "-" + actual);                     f++;                 }             }             double accuracy = (t * 1.0) / (t + f);             Console.WriteLine("准确率：" + accuracy); // 准确率：0.908             Console.Read();         }         #endregion         static void Main(string[] args)         {             //RunTest();             //Cv2.WaitKey();             demo();         }     }</pre> <strong style="font-family: "Helvetica Neue", Helvetica, "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans CJK SC", "WenQuanYi Micro Hei", Arial, sans-serif; margin: 0px; padding: 0px; list-style: none; border: 0px; color: rgb(51, 51, 51); ">C++版本代码如下：<pre class="brush:cpp;toolbar:false">#include <opencv2/opencv.hpp> #include <opencv2/ml/ml.hpp> #include <iostream> using namespace cv; using namespace cv::ml; using namespace std; // 切图 vector<vector<Mat>> RoiImages(string path) {     Mat img = cv::imread(path);     Mat gray;     cv::cvtColor(img, gray, COLOR_BGR2GRAY);     vector<vector<Mat>> cells;     int w = gray.cols / 100; // 每行100个字符     int h = gray.rows / 50; // 共50行     for (int i = 0; i < 10; i++) // 数字0~9     {         vector<Mat> imgs;         for (int j = 0; j < 5; j++) // 每组数字有5行         {             for (int n = 0; n < 100; n++) // 每行100个字符             {                 Rect rect(n * w, h * ((i * 5) + j), w, h);                 Mat roi = gray(rect);                 imgs.push_back(roi);                 //cv::imshow("roi", roi);                 //cv::waitKey(1);             }         }         cells.push_back(imgs);     }     return cells; } int main() {     // 切图     vector<vector<Mat>> cells = RoiImages("../images/digits.png");     // 训练数据数量，前400张图片用来训练     int train_sample_count = 400 * 10;     // 声明训练数据集合 mat，图像尺寸：20*20     Mat trainData(train_sample_count, 20 * 20, CV_32FC1);     // 声明训练数据标签 mat     Mat trainLabel(train_sample_count, 1, CV_32SC1);     // 组织训练数据，循环训练文件夹内所有图片。     int trainNum = 0;     for (int i = 0; i < 10; i++)     {         for (int j = 0; j < 400; j++)         {             Mat temp = cells[i][j];             // 转换CV_32FC1，因为下面训练函数需要这个格式             temp.convertTo(temp, CV_32FC1);             // 写入到训练数据集合的mat内，注意reshape的用法。             /*             reshape有两个参数：             其中，参数：cn为新的通道数，如果cn = 0，表示通道数不会改变。             参数rows为新的行数，如果rows = 0，表示行数不会改变。             注意：新的行* 列必须与原来的行*列相等。就是说，如果原来是5行3列，新的行和列可以是1行15列，3行5列，5行3列，15行1列。                   设置行数后，列数会自动调整。比如此处 调整为 1行784列。             */             temp.reshape(0, 1).copyTo(trainData.row(trainNum));             // 写入到训练标签集合的mat内             trainLabel.at<int>(trainNum) = i;             trainNum++;         }     }     // 实例化对象     Ptr<SVM> svm = SVM::create();     // 设置核函数     svm->setKernel(SVM::LINEAR);  //核函数     // 设置训练类型     svm->setType(SVM::C_SVC);     svm->setC(2.67);     svm->setGamma(5.383);     // 输入样本，进行训练     svm->train(TrainData::create(trainData, ROW_SAMPLE, trainLabel));     // 存储训练结果     svm->save("svm_data.dat");     /**************  Now testing ***************/     // 测试数据数量，后100张图片用来测试     int test_sample_count = 100 * 10;     // 声明测试数据集合 mat     Mat testData(test_sample_count, 20 * 20, CV_32FC1);     // 声明测试数据标签 mat     Mat testLabel(test_sample_count, 1, CV_32FC1);     // 组织测试数据     int testNum = 0;     for (int i = 0; i < 10; i++)     {         for (int j = 400; j < cells[i].size(); j++)         {             Mat temp = cells[i][j];             temp.convertTo(temp, CV_32FC1);             temp.reshape(0, 1).copyTo(testData.row(testNum));             testLabel.at<float>(testNum) = i;             testNum++;         }     }     Mat result;     svm->predict(testData, result);     int t = 0;     int f = 0;     for (int i = 0; i < test_sample_count; i++)     {         int predict = (int)result.at<float>(i);         int actual = (int)testLabel.at<float>(i);         if (predict == actual)         {             cout << "正确：" << predict << "-" << actual << endl;             t++;         }         else         {             cout << "错误------：" << predict << "-" << actual << endl;             f++;         }     }     double accuracy = (t * 1.0) / (t + f);     cout << "准确率：" << accuracy << endl; // 准确率：0.908 }</pre> <strong style="font-family: "Helvetica Neue", Helvetica, "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "Noto Sans CJK SC", "WenQuanYi Micro Hei", Arial, sans-serif; margin: 0px; padding: 0px; list-style: none; border: 0px; color: rgb(51, 51, 51); ">Python版本代码如下：PDF中的原始代码我在Git上有提供，需要的，可以去下载<pre class="brush:python;toolbar:false">import cv2 as cv import numpy as np SZ = 20 bin_n = 16  # Number of bins affine_flags = cv.WARP_INVERSE_MAP | cv.INTER_LINEAR # 这个函数用来给手写体做一个变形，变成正体 def deskew(img):     m = cv.moments(img)     if abs(m['mu02']) < 1e-2:         return img.copy()     skew = m['mu11'] / m['mu02']     M = np.float32([[1, skew, -0.5 * SZ * skew], [0, 1, 0]])     img = cv.warpAffine(img, M, (SZ, SZ), flags=affine_flags)     return img # 这个函数把图片分成四部分，分别成生成方向直方图，然后返回，用来作为SVM的输入，这点可K近邻算法是不一样的 def hog(img):     gx = cv.Sobel(img, cv.CV_32F, 1, 0)     gy = cv.Sobel(img, cv.CV_32F, 0, 1)     mag, ang = cv.cartToPolar(gx, gy)     bins = np.int32(bin_n * ang / (2 * np.pi))  # quantizing binvalues in (0...16)     bin_cells = bins[:10, :10], bins[10:, :10], bins[:10, 10:], bins[10:, 10:]     mag_cells = mag[:10, :10], mag[10:, :10], mag[:10, 10:], mag[10:, 10:]     hists = [np.bincount(b.ravel(), m.ravel(), bin_n) for b, m in zip(bin_cells, mag_cells)]     hist = np.hstack(hists)  # hist is a 64 bit vector     return hist # 样本读取 img = cv.imread(cv.samples.findFile('../images/digits.png'), 0) if img is None:     raise Exception("we need the digits.png image from samples/data here !") # 把整张图片分成单独的图片，原数据是50行，100列 cells = [np.hsplit(row, 100) for row in np.vsplit(img, 50)] # First half is trainData, remaining is testData train_cells = [i[:50] for i in cells] test_cells = [i[50:] for i in cells] ######################## Now training ######################## # 使用deskew函数，把倾斜的字体矫正 deskewed = [list(map(deskew, row)) for row in train_cells] # 生成图像的HOG图 hogdata = [list(map(hog, row)) for row in deskewed] # 序列化，变成输入数据 trainData = np.float32(hogdata).reshape(-1, 64) # 把与图片对应的数字生成出来 responses = np.repeat(np.arange(10), 250)[:, np.newaxis] # 实例化对象 svm = cv.ml.SVM_create() # 设置核函数 svm.setKernel(cv.ml.SVM_LINEAR) # 设置训练类型 svm.setType(cv.ml.SVM_C_SVC) svm.setC(2.67) svm.setGamma(5.383) # 输入样本，进行训练 svm.train(trainData, cv.ml.ROW_SAMPLE, responses) # 存储训练结果 svm.save('svm_data.dat') ######     Now testing      ######################## # 对检验样本的手写体进行矫正 deskewed = [list(map(deskew, row)) for row in test_cells] # 生成检验样本的HOG图 hogdata = [list(map(hog, row)) for row in deskewed] # 检验样本序列化 testData = np.float32(hogdata).reshape(-1, bin_n * 4) # 生成识别结果 result = svm.predict(testData)[1] #######   Check Accuracy   ######################## mask = result == responses correct = np.count_nonzero(mask) print(correct * 100.0 / result.size) # 输出：93.8</pre>

SVM手写数字识别（使用SVM进行手写数据OCR）

下面这段代码是 OpenCV-Python-Tutorial-中文版.pdf （P273）中的实现

当前系列所有demo下载地址：

https://github.com/GaoRenBao/OpenCv4-Demo

https://gitee.com/fuckgrb/OpenCv4-Demo

不同编程语言对应的OpenCv版本以及开发环境信息如下：

语言	OpenCv版本	IDE
C#	OpenCvSharp4.4.8.0.20230708	Visual Studio 2022
C++	OpenCv-4.5.5-vc14_vc15	Visual Studio 2022
Python	OpenCv-Python (4.6.0.66)	PyCharm Community Edition 2022.1.3

备注：PDF中提供的Python代码无法运行，当前的Python代码是CSDN上找的

https://blog.csdn.net/JJSXBL/article/details/127924492

测试原图

C#版本代码如下：

GitHub上的一个demo

https://github.com/shimat/opencvsharp_samples/tree/master/SamplesLegacy/Samples/SVMSample.cs

using OpenCvSharp;
using OpenCvSharp.ML;
using System;
using System.Collections.Generic;
using System.Linq;

namespace demo
{
    internal class Program
    {
        #region GitHub上的一个demo
        private static double Function(double x)
        {
            return x + 50 * Math.Sin(x / 15.0);
        }

        /// <summary>
        /// GitHub上的一个demo
        /// https://github.com/shimat/opencvsharp_samples/tree/master/SamplesLegacy/Samples/SVMSample.cs
        /// </summary>
        public static void RunTest()
        {
            // Training data          
            var points = new Point2f[500];
            var responses = new int[points.Length];
            var rand = new Random();
            for (int i = 0; i < responses.Length; i++)
            {
                float x = rand.Next(0, 300);
                float y = rand.Next(0, 300);
                points[i] = new Point2f(x, y);
                responses[i] = (y > Function(x)) ? 1 : 2;
            }

            // Show training data and f(x)
            using (Mat pointsPlot = Mat.Zeros(300, 300, MatType.CV_8UC3))
            {
                for (int i = 0; i < points.Length; i++)
                {
                    int x = (int)points[i].X;
                    int y = (int)(300 - points[i].Y);
                    int res = responses[i];
                    Scalar color = (res == 1) ? Scalar.Red : Scalar.GreenYellow;
                    pointsPlot.Circle(x, y, 2, color, -1);
                }
                // f(x)
                for (int x = 1; x < 300; x++)
                {
                    int y1 = (int)(300 - Function(x - 1));
                    int y2 = (int)(300 - Function(x));
                    pointsPlot.Line(x - 1, y1, x, y2, Scalar.LightBlue, 1);
                }
                Cv2.ImShow("pointsPlot", pointsPlot);
            }

            // Train
            var dataMat = new Mat(points.Length, 2, MatType.CV_32FC1, points);
            var resMat = new Mat(responses.Length, 1, MatType.CV_32SC1, responses);
            var svm = SVM.Create();
            // normalize data
            dataMat /= 300.0;

            // SVM parameters
            svm.Type = SVM.Types.CSvc;
            svm.KernelType = SVM.KernelTypes.Rbf;
            svm.TermCriteria = TermCriteria.Both(1000, 0.000001);
            svm.Degree = 100.0;
            svm.Gamma = 100.0;
            svm.Coef0 = 1.0;
            svm.C = 1.0;
            svm.Nu = 0.5;
            svm.P = 0.1;

            svm.Train(dataMat, SampleTypes.RowSample, resMat);

            // Predict for each 300x300 pixel
            Mat retPlot = Mat.Zeros(300, 300, MatType.CV_8UC3);
            for (int x = 0; x < 300; x++)
            {
                for (int y = 0; y < 300; y++)
                {
                    float[] sample = { x / 300f, y / 300f };
                    var sampleMat = new Mat(1, 2, MatType.CV_32FC1, sample);
                    int ret = (int)svm.Predict(sampleMat);
                    var plotRect = new Rect(x, 300 - y, 1, 1);
                    if (ret == 1)
                        retPlot.Rectangle(plotRect, Scalar.Red);
                    else if (ret == 2)
                        retPlot.Rectangle(plotRect, Scalar.GreenYellow);
                }
            }
            Cv2.ImShow("retPlot", retPlot);
        }
        #endregion

        #region SVM手写数字识别
        /// <summary>
        /// 切图
        /// </summary>
        /// <returns></returns>
        static List<List<Mat>> RoiImages(string path)
        {
            Mat img = Cv2.ImRead(path);
            Mat gray = new Mat();
            Cv2.CvtColor(img, gray, ColorConversionCodes.BGR2GRAY);

            List<List<Mat>> cells = new List<List<Mat>>();
            int w = gray.Width / 100; // 每行100个字符
            int h = gray.Height / 50; // 共50行
            Mat roi = new Mat();
            for (int i = 0; i < 10; i++) // 数字0~9
            {
                cells.Add(new List<Mat>());
                for (int j = 0; j < 5; j++) // 每组数字有5行
                {
                    for (int n = 0; n < 100; n++) // 每行100个字符
                    {
                        Rect rect = new Rect()
                        {
                            X = n * w,
                            Y = h * ((i * 5) + j),
                            Width = w,
                            Height = h
                        };
                        roi = new Mat(gray, rect);
                        cells[i].Add(roi);

                        //Console.WriteLine(rect);
                        //Cv2.ImShow("roi", roi);
                        //Cv2.WaitKey(1);
                    }
                }
            }
            return cells;
        }

        /// <summary>
        /// SVM手写数字识别
        /// </summary>
        static void demo()
        {
            // 切图
            var cells = RoiImages("../../../images/digits.png");

            // 训练数据数量，前400张图片用来训练
            int train_sample_count = 400 * 10;
            // 声明训练数据集合 mat，图像尺寸：20*20
            Mat trainData = new Mat(train_sample_count, 20 * 20, MatType.CV_32FC1);
            // 声明训练数据标签 mat
            Mat trainLabel = new Mat(train_sample_count, 1, MatType.CV_32SC1);

            // 组织训练数据，循环训练文件夹内所有图片。
            int trainNum = 0;
            for (int i = 0; i < 10; i++)
            {
                var cell = cells[i].Take(400);
                foreach (Mat temp in cell)
                {
                    // 转换CV_32FC1，因为下面训练函数需要这个格式
                    temp.ConvertTo(temp, MatType.CV_32FC1);
                    // 写入到训练数据集合的mat内，注意reshape的用法。
                    /*
                    reshape有两个参数：
                    其中，参数：cn为新的通道数，如果cn = 0，表示通道数不会改变。
                    参数rows为新的行数，如果rows = 0，表示行数不会改变。
                    注意：新的行* 列必须与原来的行*列相等。就是说，如果原来是5行3列，新的行和列可以是1行15列，3行5列，5行3列，15行1列。
                          设置行数后，列数会自动调整。比如此处 调整为 1行784列。
                    */
                    temp.Reshape(0, 1).CopyTo(trainData.Row(trainNum));
                    // 写入到训练标签集合的mat内
                    // 这个必须是int类型，不能用float，和knn的操作有点差异
                    trainLabel.Set<int>(trainNum, i);
                    trainNum++;
                }
            }

            // 实例化对象
            SVM svm = SVM.Create();
            // 设置核函数
            svm.KernelType = SVM.KernelTypes.Linear;
            // 设置训练类型
            svm.Type = SVM.Types.CSvc;
            svm.C = 2.67;
            svm.Gamma = 5.383;
            // 输入样本，进行训练
            svm.Train(trainData, SampleTypes.RowSample, trainLabel);
            // 存储训练结果
            //svm.Save("svm_data.dat");

            /**************  Now testing ***************/
            // 测试数据数量，后100张图片用来测试
            int test_sample_count = 100 * 10;
            // 声明测试数据集合 mat
            Mat testData = new Mat(test_sample_count, 20 * 20, MatType.CV_32FC1);
            // 声明测试数据标签 mat
            Mat testLabel = new Mat(test_sample_count, 1, MatType.CV_32FC1);
            // 组织测试数据
            int testNum = 0;
            for (int i = 0; i < 10; i++)
            {
                var cell = cells[i].Skip(400);
                foreach (Mat temp in cell)
                {
                    temp.ConvertTo(temp, MatType.CV_32FC1);
                    temp.Reshape(0, 1).CopyTo(testData.Row(testNum));
                    testLabel.Set<float>(testNum, i);
                    testNum++;
                }
            }

            Mat result = new Mat();
            svm.Predict(testData, result);
            int t = 0;
            int f = 0;
            for (int i = 0; i < test_sample_count; i++)
            {
                int predict = (int)result.At<float>(i);
                int actual = (int)testLabel.At<float>(i);
                if (predict == actual)
                {
                    Console.WriteLine("正确：" + predict + "-" + actual);
                    t++;
                }
                else
                {
                    Console.WriteLine("错误------：" + predict + "-" + actual);
                    f++;
                }
            }

            double accuracy = (t * 1.0) / (t + f);
            Console.WriteLine("准确率：" + accuracy); // 准确率：0.908
            Console.Read();
        }
        #endregion

        static void Main(string[] args)
        {
            //RunTest();
            //Cv2.WaitKey();

            demo();
        }
    }

C++版本代码如下：

#include <opencv2/opencv.hpp>
#include <opencv2/ml/ml.hpp>
#include <iostream>

using namespace cv;
using namespace cv::ml;
using namespace std;

// 切图
vector<vector<Mat>> RoiImages(string path)
{
    Mat img = cv::imread(path);
    Mat gray;
    cv::cvtColor(img, gray, COLOR_BGR2GRAY);

    vector<vector<Mat>> cells;
    int w = gray.cols / 100; // 每行100个字符
    int h = gray.rows / 50; // 共50行
    for (int i = 0; i < 10; i++) // 数字0~9
    {
        vector<Mat> imgs;
        for (int j = 0; j < 5; j++) // 每组数字有5行
        {
            for (int n = 0; n < 100; n++) // 每行100个字符
            {
                Rect rect(n * w, h * ((i * 5) + j), w, h);
                Mat roi = gray(rect);
                imgs.push_back(roi);
                //cv::imshow("roi", roi);
                //cv::waitKey(1);
            }
        }
        cells.push_back(imgs);
    }
    return cells;
}

int main()
{
    // 切图
    vector<vector<Mat>> cells = RoiImages("../images/digits.png");

    // 训练数据数量，前400张图片用来训练
    int train_sample_count = 400 * 10;
    // 声明训练数据集合 mat，图像尺寸：20*20
    Mat trainData(train_sample_count, 20 * 20, CV_32FC1);
    // 声明训练数据标签 mat
    Mat trainLabel(train_sample_count, 1, CV_32SC1);

    // 组织训练数据，循环训练文件夹内所有图片。
    int trainNum = 0;
    for (int i = 0; i < 10; i++)
    {
        for (int j = 0; j < 400; j++)
        {
            Mat temp = cells[i][j];
            // 转换CV_32FC1，因为下面训练函数需要这个格式
            temp.convertTo(temp, CV_32FC1);
            // 写入到训练数据集合的mat内，注意reshape的用法。
            /*
            reshape有两个参数：
            其中，参数：cn为新的通道数，如果cn = 0，表示通道数不会改变。
            参数rows为新的行数，如果rows = 0，表示行数不会改变。
            注意：新的行* 列必须与原来的行*列相等。就是说，如果原来是5行3列，新的行和列可以是1行15列，3行5列，5行3列，15行1列。
                  设置行数后，列数会自动调整。比如此处 调整为 1行784列。
            */
            temp.reshape(0, 1).copyTo(trainData.row(trainNum));
            // 写入到训练标签集合的mat内
            trainLabel.at<int>(trainNum) = i;
            trainNum++;
        }
    }


    // 实例化对象
    Ptr<SVM> svm = SVM::create();
    // 设置核函数
    svm->setKernel(SVM::LINEAR);  //核函数
    // 设置训练类型
    svm->setType(SVM::C_SVC);
    svm->setC(2.67);
    svm->setGamma(5.383);
    // 输入样本，进行训练
    svm->train(TrainData::create(trainData, ROW_SAMPLE, trainLabel));
    // 存储训练结果
    svm->save("svm_data.dat");

    /**************  Now testing ***************/
    // 测试数据数量，后100张图片用来测试
    int test_sample_count = 100 * 10;
    // 声明测试数据集合 mat
    Mat testData(test_sample_count, 20 * 20, CV_32FC1);
    // 声明测试数据标签 mat
    Mat testLabel(test_sample_count, 1, CV_32FC1);
    // 组织测试数据
    int testNum = 0;
    for (int i = 0; i < 10; i++)
    {
        for (int j = 400; j < cells[i].size(); j++)
        {
            Mat temp = cells[i][j];
            temp.convertTo(temp, CV_32FC1);
            temp.reshape(0, 1).copyTo(testData.row(testNum));
            testLabel.at<float>(testNum) = i;
            testNum++;
        }
    }

    Mat result;
    svm->predict(testData, result);
    int t = 0;
    int f = 0;
    for (int i = 0; i < test_sample_count; i++)
    {
        int predict = (int)result.at<float>(i);
        int actual = (int)testLabel.at<float>(i);
        if (predict == actual)
        {
            cout << "正确：" << predict << "-" << actual << endl;
            t++;
        }
        else
        {
            cout << "错误------：" << predict << "-" << actual << endl;
            f++;
        }
    }

    double accuracy = (t * 1.0) / (t + f);
    cout << "准确率：" << accuracy << endl; // 准确率：0.908
}

Python版本代码如下：

PDF中的原始代码我在Git上有提供，需要的，可以去下载

import cv2 as cv
import numpy as np

SZ = 20
bin_n = 16  # Number of bins
affine_flags = cv.WARP_INVERSE_MAP | cv.INTER_LINEAR


# 这个函数用来给手写体做一个变形，变成正体
def deskew(img):
    m = cv.moments(img)
    if abs(m['mu02']) < 1e-2:
        return img.copy()
    skew = m['mu11'] / m['mu02']
    M = np.float32([[1, skew, -0.5 * SZ * skew], [0, 1, 0]])
    img = cv.warpAffine(img, M, (SZ, SZ), flags=affine_flags)
    return img


# 这个函数把图片分成四部分，分别成生成方向直方图，然后返回，用来作为SVM的输入，这点可K近邻算法是不一样的
def hog(img):
    gx = cv.Sobel(img, cv.CV_32F, 1, 0)
    gy = cv.Sobel(img, cv.CV_32F, 0, 1)
    mag, ang = cv.cartToPolar(gx, gy)
    bins = np.int32(bin_n * ang / (2 * np.pi))  # quantizing binvalues in (0...16)
    bin_cells = bins[:10, :10], bins[10:, :10], bins[:10, 10:], bins[10:, 10:]
    mag_cells = mag[:10, :10], mag[10:, :10], mag[:10, 10:], mag[10:, 10:]
    hists = [np.bincount(b.ravel(), m.ravel(), bin_n) for b, m in zip(bin_cells, mag_cells)]
    hist = np.hstack(hists)  # hist is a 64 bit vector
    return hist


# 样本读取
img = cv.imread(cv.samples.findFile('../images/digits.png'), 0)
if img is None:
    raise Exception("we need the digits.png image from samples/data here !")

# 把整张图片分成单独的图片，原数据是50行，100列
cells = [np.hsplit(row, 100) for row in np.vsplit(img, 50)]
# First half is trainData, remaining is testData
train_cells = [i[:50] for i in cells]
test_cells = [i[50:] for i in cells]

######################## Now training ########################

# 使用deskew函数，把倾斜的字体矫正
deskewed = [list(map(deskew, row)) for row in train_cells]
# 生成图像的HOG图
hogdata = [list(map(hog, row)) for row in deskewed]
# 序列化，变成输入数据
trainData = np.float32(hogdata).reshape(-1, 64)
# 把与图片对应的数字生成出来
responses = np.repeat(np.arange(10), 250)[:, np.newaxis]

# 实例化对象
svm = cv.ml.SVM_create()
# 设置核函数
svm.setKernel(cv.ml.SVM_LINEAR)
# 设置训练类型
svm.setType(cv.ml.SVM_C_SVC)
svm.setC(2.67)
svm.setGamma(5.383)

# 输入样本，进行训练
svm.train(trainData, cv.ml.ROW_SAMPLE, responses)
# 存储训练结果
svm.save('svm_data.dat')

######     Now testing      ########################

# 对检验样本的手写体进行矫正
deskewed = [list(map(deskew, row)) for row in test_cells]
# 生成检验样本的HOG图
hogdata = [list(map(hog, row)) for row in deskewed]
# 检验样本序列化
testData = np.float32(hogdata).reshape(-1, bin_n * 4)
# 生成识别结果
result = svm.predict(testData)[1]

#######   Check Accuracy   ########################
mask = result == responses
correct = np.count_nonzero(mask)
print(correct * 100.0 / result.size)
# 输出：93.8