Scikit-learn - 精度评价

博主： AIHGF
发布时间：2018 年 05 月 21 日
1828 次浏览
暂无评论
1699字数
分类：机器学习

Python sklearn.metrics 提供了很多任务的评价指标，如分类任务的混淆矩阵、平均分类精度、每类分类精度、总体分类精度、F1-score 等；以及回归任务、聚类任务等多种内置函数.

1. 分类 - 混淆矩阵 Confusion Matrix

sklearn.metrics.confusion_matrix

from sklearn.metrics import confusion_matrix

计算混淆矩阵，以估计分类精度.

记混淆矩阵 ${ C }$，混淆矩阵元素 ${ C_{ij} }$ 为 gt_label=i , pred_label=j 的元素个数，i，j 为类别 labels.

二值分类中， true negatives 数为 ${ C_{0,0} }$，false negatives 数为 ${ C_{1,0} }$，true positives 数为 ${ C_{1,1} }$，false negatives 数为 ${ C_{0,1} }$.

使用示例：

C = confusion_matrix(gt_labels, pred_labels, labels=None, sample_weight=None)[source]
# C 为 n_classes x n_classes 的混淆矩阵

其中，

[1] - gt_labels - Groundtruth label 值

[2] - pred_labels - 分类器预测的 label 值

[3] - labels - labels 列表，用于索引混淆矩阵

示例1：

from sklearn.metrics import confusion_matrix
gt_labels = [2, 0, 2, 2, 0, 1]
pred_labels = [0, 0, 2, 2, 0, 2]
confusion_matrix(gt_labels, pred_labels)
# array([[2, 0, 0],
#        [0, 0, 1],
#        [1, 0, 2]])

示例2：

from sklearn.metrics import confusion_matrix
gt_labels = ["cat", "ant", "cat", "cat", "ant", "bird"]
pred_labels = ["ant", "ant", "cat", "cat", "ant", "cat"]
confusion_matrix(y_true, y_pred, labels=["ant", "bird", "cat"])
# array([[2, 0, 0],
#        [0, 0, 1],
#        [1, 0, 2]])

示例3：

二值分类情况，

from sklearn.metrics import confusion_matrix
tn, fp, fn, tp = confusion_matrix([0, 1, 0, 1], [1, 1, 1, 0]).ravel()
#(tn, fp, fn, tp)
#(0, 2, 1, 1)

最后修改：2021 年 04 月 28 日

© 允许规范转载

如果觉得我的文章对你有用，请随意赞赏

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

评论 *

私密评论

名称 *

🎲

邮箱 *

地址

Scikit-learn - 精度评价

AIHGF • 2018 年 05 月 21 日

<p>Python <span class="external-link"><a class="no-external-link" href="http://scikit-learn.org/stable/modules/classes.html#module-sklearn.metrics" target="_blank"><i data-feather="external-link"></i>sklearn.metrics</a></span> 提供了很多任务的评价指标，如分类任务的混淆矩阵、平均分类精度、每类分类精度、总体分类精度、F1-score 等；以及回归任务、聚类任务等多种内置函数.</p><h2>1. 分类 - 混淆矩阵 Confusion Matrix</h2><blockquote><span class="external-link"><a class="no-external-link" href="http://scikit-learn.org/stable/modules/generated/sklearn.metrics.confusion_matrix.html" target="_blank"><i data-feather="external-link"></i>sklearn.metrics.confusion_matrix</a></span></blockquote><pre><code class="lang-python">from sklearn.metrics import confusion_matrix</code></pre><p>计算混淆矩阵，以估计分类精度.</p><p>记混淆矩阵 ${ C }$，混淆矩阵元素 ${ C_{ij} }$ 为 gt&#95;label=i , pred&#95;label=j 的元素个数，i，j 为类别 labels.</p><p>二值分类中， true negatives 数为 ${ C_{0,0} }$，false negatives 数为 ${ C_{1,0} }$，true positives 数为 ${ C_{1,1} }$，false negatives 数为 ${ C_{0,1} }$.</p><p>使用示例：</p><pre><code class="lang-python">C = confusion_matrix(gt_labels, pred_labels, labels=None, sample_weight=None)[source]
# C 为 n_classes x n_classes 的混淆矩阵</code></pre><p>其中，</p><p>[1] - gt_labels - Groundtruth label 值</p><p>[2] - pred_labels - 分类器预测的 label 值</p><p>[3] - labels - labels 列表，用于索引混淆矩阵</p><p><strong>示例1：</strong></p><pre><code class="lang-python">from sklearn.metrics import confusion_matrix
gt_labels = [2, 0, 2, 2, 0, 1]
pred_labels = [0, 0, 2, 2, 0, 2]
confusion_matrix(gt_labels, pred_labels)
# array([[2, 0, 0],
#        [0, 0, 1],
#        [1, 0, 2]])</code></pre><p><strong>示例2：</strong></p><pre><code class="lang-python">from sklearn.metrics import confusion_matrix
gt_labels = [&quot;cat&quot;, &quot;ant&quot;, &quot;cat&quot;, &quot;cat&quot;, &quot;ant&quot;, &quot;bird&quot;]
pred_labels = [&quot;ant&quot;, &quot;ant&quot;, &quot;cat&quot;, &quot;cat&quot;, &quot;ant&quot;, &quot;cat&quot;]
confusion_matrix(y_true, y_pred, labels=[&quot;ant&quot;, &quot;bird&quot;, &quot;cat&quot;])
# array([[2, 0, 0],
#        [0, 0, 1],
#        [1, 0, 2]])</code></pre><p><strong>示例3：</strong></p><p>二值分类情况，</p><pre><code class="lang-python">from sklearn.metrics import confusion_matrix
tn, fp, fn, tp = confusion_matrix([0, 1, 0, 1], [1, 1, 1, 0]).ravel()
#(tn, fp, fn, tp)
#(0, 2, 1, 1)</code></pre>