Skip to content

Latest commit

 

History

History
58 lines (52 loc) · 3.03 KB

File metadata and controls

58 lines (52 loc) · 3.03 KB

Pose Estimation Similarity With Tensorflow

This repository consists of all the code required for similar 2-D pose detection in dance videos. This can used for any type of pose estimation application to find the similarity.

We will be using tensorflow for our position estimation using openpose for tensorflow.

Outputs:

For Wrong Dance:

For Right Dance:

Credits:

Ildoo Kim: https://github.com/ildoonet
GitHub Repo Link: https://github.com/ildoonet/tf-pose-estimation

Gunjan Seth: https://github.com/gsethi2409
GitHub Repo Link: https://github.com/gsethi2409/tf-pose-estimation

Ipython Notebook File for Pose Estimation Similarity:

2D-Pose-Similarity.ipynb

Requirements:

  • Python 3.7
  • Tensorflow 2.0+
  • OpenCV

Installation Steps:

1. Git Clone:

Clone this repository into your local machine.
git clone https://github.com/mohit9949/Pose-Estimation-Similarity-With-TensorFlow.git

2. Installing the requirements:

Install all the requirements provided in the requirements.txt
cd Pose-Estimation-Similarity-With-TensorFlow
pip install -r requirements.txt
If any problem with downloading pycocotools follow this link at Step 4: https://github.com/markjay4k/Mask-RCNN-series/blob/master/Mask_RCNN%20Install%20Instructions.ipynb

3. Install SWIG:

conda install swig
or
Download Link: http://www.swig.org/survey.html

4. Build C++ library for post-processing:

cd tf_pose/pafprocess
swig -python -c++ pafprocess.i && python setup.py build_ext --inplace

5. Installing tf_slim:

cd ../../
pip install git+https://github.com/adrianc-a/tf-slim.git@remove_contrib
or
git clone https://github.com/google-research/tf-slim.git
and copy the folder tf_slim in it to our repository Pose-Estimation-Similarity-With-TensorFlow

6. Download Tensorflow Graph File(pb file).

cd models/graph/cmu
bash download.sh
cd ../../..

Conclusion:

  • We have developed a pose estimation similarity pipeline to compare similarity between two poses from the given feed of videos or live cam.
    Flaws:
  • This approach fails when the trainer is far or the user is near to the camera or vise-versa. This happens because there is a scale variation between the keypoints of the image.
    Solution:
  • We can eleminate this problem by croping out the image of a person using a CNN architecture like Yolo or anything that could detect the bounding boxes of a person.
  • This image then can be fed to the openpose model to estimate keypoints for both the sources.
    Scope of improvement:
  • The accuracy of the model for keypoint prediction can be increased by taking a much powerful pretrained model architecture than mobilenet.