This repository consists of all the code required for similar 2-D pose detection in dance videos. This can used for any type of pose estimation application to find the similarity.
We will be using tensorflow for our position estimation using openpose for tensorflow.
Ildoo Kim: https://github.com/ildoonet
GitHub Repo Link: https://github.com/ildoonet/tf-pose-estimation
Gunjan Seth: https://github.com/gsethi2409
GitHub Repo Link: https://github.com/gsethi2409/tf-pose-estimation
2D-Pose-Similarity.ipynb
- Python 3.7
- Tensorflow 2.0+
- OpenCV
Clone this repository into your local machine.
git clone https://github.com/mohit9949/Pose-Estimation-Similarity-With-TensorFlow.git
Install all the requirements provided in the requirements.txt
cd Pose-Estimation-Similarity-With-TensorFlow
pip install -r requirements.txt
If any problem with downloading pycocotools follow this link at Step 4: https://github.com/markjay4k/Mask-RCNN-series/blob/master/Mask_RCNN%20Install%20Instructions.ipynb
conda install swig
or
Download Link: http://www.swig.org/survey.html
cd tf_pose/pafprocess
swig -python -c++ pafprocess.i && python setup.py build_ext --inplace
cd ../../
pip install git+https://github.com/adrianc-a/tf-slim.git@remove_contrib
or
git clone https://github.com/google-research/tf-slim.git
and copy the folder tf_slim
in it to our repository Pose-Estimation-Similarity-With-TensorFlow
cd models/graph/cmu
bash download.sh
cd ../../..
- We have developed a pose estimation similarity pipeline to compare similarity between two poses from the given feed of videos or live cam.
Flaws: - This approach fails when the trainer is far or the user is near to the camera or vise-versa. This happens because there is a scale variation between the keypoints of the image.
Solution: - We can eleminate this problem by croping out the image of a person using a CNN architecture like Yolo or anything that could detect the bounding boxes of a person.
- This image then can be fed to the openpose model to estimate keypoints for both the sources.
Scope of improvement: - The accuracy of the model for keypoint prediction can be increased by taking a much powerful pretrained model architecture than mobilenet.