典型人体姿态估计的深度模型和代码

博主： AIHGF
发布时间：2018 年 06 月 28 日
2761 次浏览
暂无评论
2964字数
分类：姿态估计

原文：Deep Learning Models and Code for Pose Estimation

姿态估计旨在 RGB 图片和 Video 中的人体像素映射到肢体的三维曲面(3D surface)，其涉及了很多计算机视觉任务，如目标检测，姿态估计，分割，等等.

姿态估计的应用场景不仅包括关键点定位，如图形(Graphics)，增强显示(Augmented Reality, AR)，人机交互(Human-Computer Interaction，HCI)，还包括 3D 目标识别的很多方面.

这里，汇总了一些姿态估计的开源深度学习模型和代码实现.

1. DensePose

DensePose 出自 Facebook Research，其开源了 DensePose 实现的代码，模型和数据集.
DensePose 数据集，DensePose-COCO，用于人体姿态估计的大规模数据集.
DensePose-COCO 数据集，是在 50K COCO 图片上手工标注的图片-表面(image-to-surface)对应的大规模数据集.

DensePose 论文提出了 DensePose-RCNN，是 Mask-RCNN 的一个变形，针对每秒多帧的每个人体区域，其回归密集地回归特定肢体部分的 UV 坐标.
DensePose 基于 DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild - 2016.
DensePose 的目标是确定每个像素点的曲面位置(surface location)，以及该肢体曲面所属的对应 2D 参数化.

DensePose 采用了基于 FPN 的 Mask R-CNN 结构，RoI-Align Pooling.
此外，DensePose 在 RoI-Pooling 的输出端加入全卷积网路.

Github项目 - DensePose

2. OpenPose

OpenPose 是 CMU Perceptual Computing Lab 开源的一个实时多人关键点检测库.

OpenPose 提供了 2D 和 3D 多人关键点检测方法，以及特定参数的姿态估计的标准化工具包.
OpenPose 可以采用很多不同的输入方式，如，图片image, 视频video，IP相机camera，等等.
OpenPose 的输出也可以是很多不同形式，如图片和关键点(PNG，JPG，AVI)，可读格式的关键点(JSON，XML，YML)，甚至是数组类.
OpenPose 的输入和输出参数，还可以根据需要进行调整.

OpenPose 提供了 C++ API，可以在 CPU 和 GPU 上运行，也兼容 AMD 显卡.

3. Realtime Multi-Person Pose Estimation

Realtime Multi-Person Pose Estimation 的实现与 OpenPose 具有很高的关联性.
其采用由下而上(bottom-up) 的方法进行实时多人姿态估计，且不需要任何人体检测器.

Realtime Multi-Person Pose Estimation 采用了非参数化表示 - Part Affinity Fields (PAFs)，以学习图片中各人体的肢体关联性.

其它实现：

4. AlphaPose

[Home - AlphaPose]
[Github - AlphaPose] - 包括 TensorFlow 和 PyTorch 实现.
[Paper - RMPE: Regional Multi-person Pose Estimation - ICCV2017]

AlphaPose 是上海交通大学开源的精确多人姿态估计，声称是第一个开源系统.
AlphaPose 可以同时对图片, videos，以及图片列表，进行姿态估计和姿态追踪(pose tracking). 可以得到很多不同的输出，包括 PNG，JPEG，AVI 格式的关键点图片，JSON 格式的关键点输出，便于很多应用场景.

AlphaPose 采用区域多人姿态估计(regional multi-person pose estimation, RMPE)框架，以在人体边界框不准确的情况下，提升姿态估计. 其主要包括三部分：

Symmetric Spatial Transformer Network (SSTN)
Parametric Pose Non-Maximum-Suppression (NMS)
Pose-Guided Proposals Generator (PGPG)

5. MPII Human Pose

MPII 人体姿态数据集，是铰链人体姿态估计的大规模数据集.

该开源实现是人体肢体姿态估计算法的 TensorFlow 实现，基于论文 ArtTrack 和 DeeperCut.
其主要是关注真实图片中的铰链人体姿态估计任务，同时处理人体检测和姿态估计任务. 而不是先检测人体，然后再估计人体姿态.

6. DeepPose

DeepPose 是 2014 年的一篇论文，首先采用深度神经网络的方法进行人体姿态估计，其采用 DNN-based 关键点回归方法.

最后修改：2018 年 10 月 09 日

如果觉得我的文章对你有用，请随意赞赏

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

评论 *

私密评论

名称 *

🎲

邮箱 *

地址

典型人体姿态估计的深度模型和代码

AIHGF • 2018 年 06 月 28 日

<blockquote>原文：<a class="no-external-link" href="https://modelzoo.co/blog/deep-learning-models-and-code-for-pose-estimation" target="_blank">Deep Learning Models and Code for Pose Estimation</a></blockquote>姿态估计旨在 RGB 图片和 Video 中的人体像素映射到肢体的三维曲面(3D surface)，其涉及了很多计算机视觉任务，如目标检测，姿态估计，分割，等等.姿态估计的应用场景不仅包括关键点定位，如图形(Graphics)，增强显示(Augmented Reality, AR)，人机交互(Human-Computer Interaction，HCI)，还包括 3D 目标识别的很多方面.这里，汇总了一些姿态估计的开源深度学习模型和代码实现.<h2>1. DensePose</h2><img src="https://www.aiuai.cn/uploads/sina/5ce8df712bbad.jpg" alt="" style=""><ul>
<li><a class="no-external-link" href="http://densepose.org/" target="_blank">[Home - DensePose]</a></li>
<li><a class="no-external-link" href="https://github.com/facebookresearch/Densepose" target="_blank">[Github - DensePose]</a> - 基于 <a class="no-external-link" href="https://github.com/facebookresearch/Detectron" target="_blank">Detectron</a> 和 <a class="no-external-link" href="https://github.com/caffe2/caffe2" target="_blank">Caffe2</a>.</li>
<li><a class="no-external-link" href="https://github.com/facebookresearch/DensePose/blob/master/INSTALL.md#fetch-densepose-data" target="_blank">[Dataset - DensePose]</a></li>
<li><a class="no-external-link" href="https://arxiv.org/abs/1802.00434" target="_blank">[Paper - DensePose: Dense Human Pose Estimation In The Wild-2018]</a></li>
</ul>DensePose 出自 <a class="no-external-link" href="https://research.fb.com/" target="_blank">Facebook Research</a>，其开源了 DensePose 实现的代码，模型和数据集. 
DensePose 数据集，<a class="no-external-link" href="https://github.com/facebookresearch/DensePose/blob/master/INSTALL.md#fetch-densepose-data" target="_blank">DensePose-COCO</a>，用于人体姿态估计的大规模数据集. 
DensePose-COCO 数据集，是在 50K <a class="no-external-link" href="http://cocodataset.org/" target="_blank">COCO</a> 图片上手工标注的图片-表面(image-to-surface)对应的大规模数据集.DensePose 论文提出了 DensePose-RCNN，是 Mask-RCNN 的一个变形，针对每秒多帧的每个人体区域，其回归密集地回归特定肢体部分的 UV 坐标. 
DensePose 基于 <a class="no-external-link" href="https://arxiv.org/abs/1612.01202" target="_blank">DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild - 2016</a>. 
DensePose 的目标是确定每个像素点的曲面位置(surface location)，以及该肢体曲面所属的对应 2D 参数化.DensePose 采用了基于 FPN 的 Mask R-CNN 结构，RoI-Align Pooling. 
此外，DensePose 在 RoI-Pooling 的输出端加入全卷积网路. 
<img src="https://www.aiuai.cn/uploads/sina/5ce8df719a764.jpg" alt="" style=""><blockquote><a href="https://www.aiuai.cn/aifarm278.html">Github项目 - DensePose</a></blockquote><h2>2. OpenPose</h2><img src="https://www.aiuai.cn/uploads/sina/5ce8df722617b.jpg" alt="" style=""><ul>
<li><a class="no-external-link" href="https://github.com/CMU-Perceptual-Computing-Lab/openpose" target="_blank">[Github - OpenPose]</a></li>
<li><a class="no-external-link" href="http://domedb.perception.cs.cmu.edu/" target="_blank">[Dataset - OpenPose]</a></li>
</ul>OpenPose 是 <a class="no-external-link" href="https://github.com/CMU-Perceptual-Computing-Lab" target="_blank">CMU Perceptual Computing Lab</a> 开源的一个实时多人关键点检测库.OpenPose 提供了 2D 和 3D 多人关键点检测方法，以及特定参数的姿态估计的标准化工具包. 
OpenPose 可以采用很多不同的输入方式，如，图片image, 视频video，IP相机camera，等等. 
OpenPose 的输出也可以是很多不同形式，如图片和关键点(PNG，JPG，AVI)，可读格式的关键点(JSON，XML，YML)，甚至是数组类. 
OpenPose 的输入和输出参数，还可以根据需要进行调整.OpenPose 提供了 C++ API，可以在 CPU 和 GPU 上运行，也兼容 AMD 显卡.<h2>3. Realtime Multi-Person Pose Estimation</h2><img src="https://www.aiuai.cn/uploads/sina/5ce8df738c223.jpg" alt="" style=""> 
<img src="https://www.aiuai.cn/uploads/sina/5ce8df747104d.jpg" alt="" style=""><ul>
<li><a class="no-external-link" href="https://github.com/ZheC/Realtime_Multi-Person_Pose_Estimation" target="_blank">[Github - RMPE]</a></li>
<li><a class="no-external-link" href="https://arxiv.org/abs/1611.08050" target="_blank">[Paper - Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields - CVPR2017]</a></li>
<li><a class="no-external-link" href="http://arxiv.org/abs/1602.00134" target="_blank">[Convolutional Pose Machines – CVPR2016]</a></li>
</ul>Realtime Multi-Person Pose Estimation 的实现与 OpenPose 具有很高的关联性. 
其采用由下而上(bottom-up) 的方法进行实时多人姿态估计，且不需要任何人体检测器.Realtime Multi-Person Pose Estimation 采用了非参数化表示 - Part Affinity Fields (PAFs)，以学习图片中各人体的肢体关联性.<img src="https://www.aiuai.cn/uploads/sina/5ce8df752ed2e.jpg" alt="" style="">其它实现：<ul>
<li><a class="no-external-link" href="https://github.com/CMU-Perceptual-Computing-Lab/openpose" target="_blank">OpenPose C++ Library</a></li>
<li><a class="no-external-link" href="https://github.com/ZheC/Realtime_Multi-Person_Pose_Estimation" target="_blank">TensorFlow implementation</a></li>
<li><a class="no-external-link" href="https://modelzoo.co/model/keras-realtime-multi-person-pose-estimation" target="_blank">Keras implementation one</a> and <a class="no-external-link" href="https://github.com/michalfaber/keras_Realtime_Multi-Person_Pose_Estimation" target="_blank">two</a></li>
<li><a class="no-external-link" href="https://github.com/tensorboy/pytorch_Realtime_Multi-Person_Pose_Estimation" target="_blank">PyTorch implementation one</a>, <a class="no-external-link" href="https://github.com/DavexPro/pytorch-pose-estimation" target="_blank">two</a>, and <a class="no-external-link" href="https://github.com/MVIG-SJTU/AlphaPose/tree/pytorch" target="_blank">three</a></li>
<li><a class="no-external-link" href="https://github.com/dragonfly90/mxnet_Realtime_Multi-Person_Pose_Estimation" target="_blank">MXNet implementation</a></li>
<li><a class="no-external-link" href="https://github.com/DeNA/Chainer_Realtime_Multi-Person_Pose_Estimation" target="_blank">Chainer inplementation</a></li>
</ul><h2>4. AlphaPose</h2><img src="https://www.aiuai.cn/uploads/sina/5ce8df75e6acf.jpg" alt="" style=""> 
<img src="https://www.aiuai.cn/uploads/sina/5ce8df76ab1b5.jpg" alt="" style=""><ul>
<li><a class="no-external-link" href="http://www.mvig.org/research/alphapose.html" target="_blank">[Home - AlphaPose]</a></li>
<li><a class="no-external-link" href="https://github.com/MVIG-SJTU/AlphaPose" target="_blank">[Github - AlphaPose]</a> - 包括 TensorFlow 和 PyTorch 实现.</li>
<li><a class="no-external-link" href="https://arxiv.org/abs/1612.00137" target="_blank">[Paper - RMPE: Regional Multi-person Pose Estimation - ICCV2017]</a></li>
</ul>AlphaPose 是上海交通大学开源的精确多人姿态估计，声称是第一个开源系统. 
AlphaPose 可以同时对图片, videos，以及图片列表，进行姿态估计和姿态追踪(pose tracking). 可以得到很多不同的输出，包括 PNG，JPEG，AVI 格式的关键点图片，JSON 格式的关键点输出，便于很多应用场景.AlphaPose 采用区域多人姿态估计(regional multi-person pose estimation, RMPE)框架，以在人体边界框不准确的情况下，提升姿态估计. 其主要包括三部分：<ul>
<li>Symmetric Spatial Transformer Network (SSTN)</li>
<li>Parametric Pose Non-Maximum-Suppression (NMS)</li>
<li>Pose-Guided Proposals Generator (PGPG)</li>
</ul><h2>5. MPII Human Pose</h2><img src="https://www.aiuai.cn/uploads/sina/5ce8df77aa67e.jpg" alt="" style=""> 
<img src="https://www.aiuai.cn/uploads/sina/5ce8df78651b4.jpg" alt="" style=""><ul>
<li><a class="no-external-link" href="http://pose.mpi-inf.mpg.de/" target="_blank">[Home - MPII Human Pose Models]</a></li>
<li><a class="no-external-link" href="https://github.com/eldar/pose-tensorflow" target="_blank">[Github - MPII Pose]</a></li>
<li><a class="no-external-link" href="http://human-pose.mpi-inf.mpg.de/" target="_blank">[Dataset - MPII]</a></li>
<li><a class="no-external-link" href="https://arxiv.org/abs/1612.01465" target="_blank">[Paper - ArtTrack]</a></li>
<li><a class="no-external-link" href="https://arxiv.org/abs/1605.03170" target="_blank">[Paper - DeeperCut]</a></li>
</ul><a class="no-external-link" href="http://human-pose.mpi-inf.mpg.de/" target="_blank">MPII 人体姿态数据集</a>，是铰链人体姿态估计的大规模数据集.该开源实现是人体肢体姿态估计算法的 TensorFlow 实现，基于论文 ArtTrack 和 DeeperCut. 
其主要是关注真实图片中的铰链人体姿态估计任务，同时处理人体检测和姿态估计任务. 而不是先检测人体，然后再估计人体姿态.<h2>6. DeepPose</h2><img src="https://www.aiuai.cn/uploads/sina/5ce8df78e9e21.jpg" alt="" style=""><ul>
<li><a class="no-external-link" href="https://research.google.com/pubs/archive/42237.pdf" target="_blank">[Paper - DeepPose]</a></li>
<li><a class="no-external-link" href="https://github.com/mitmul/deeppose" target="_blank">[Github - Chainer]</a> - 非官方实现</li>
<li><a class="no-external-link" href="https://github.com/asanakoy/deeppose_tf" target="_blank">[Github - TensorFlow]</a> - 非官方实现</li>
</ul>DeepPose 是 2014 年的一篇论文，首先采用深度神经网络的方法进行人体姿态估计，其采用 DNN-based 关键点回归方法.

1. DensePose

2. OpenPose

3. Realtime Multi-Person Pose Estimation

4. AlphaPose

5. MPII Human Pose

6. DeepPose

发表评论 取消回复 使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

典型人体姿态估计的深度模型和代码

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款