A Code Implementation for Advanced Human Pose Estimation Using MediaPipe, OpenCV and Matplotlib

Human pose estimation is a cutting-edge computer vision technology that transforms visual data into actionable insights about human movement. By utilizing advanced machine learning models like MediaPipe’s BlazePose and powerful libraries such as OpenCV, developers can track body key points with unprecedented accuracy. In this tutorial, we explore the seamless integration of these, demonstrating how Python-based frameworks enable sophisticated pose detection across various domains, from sports analytics to healthcare monitoring and interactive applications.

First, we install the essential libraries:

Copy CodeCopiedUse a different Browser

!pip install mediapipe opencv-python-headless matplotlib

Then, we import the important libraries needed for our implementation:

Copy CodeCopiedUse a different Browser

import cv2import mediapipe as mpimport matplotlib.pyplot as pltimport numpy as np

We initialize the MediaPipe Pose model in static image mode with segmentation enabled and a minimum detection confidence of 0.5. It also imports utilities for drawing landmarks and applying drawing styles.

Copy CodeCopiedUse a different Browser

mp_pose = mp.solutions.posemp_drawing = mp.solutions.drawing_utilsmp_drawing_styles = mp.solutions.drawing_stylespose = mp_pose.Pose(    static_image_mode=True,    model_complexity=1,    enable_segmentation=True,    min_detection_confidence=0.5)

Here, we define the detect_pose function, which reads an image, processes it to detect human pose landmarks using MediaPipe, and returns the annotated image along with the detected landmarks. If landmarks are found, they are drawn using default styling.

Copy CodeCopiedUse a different Browser

def detect_pose(image_path):    image = cv2.imread(image_path)    image_rgb = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)    results = pose.process(image_rgb)    annotated_image = image_rgb.copy()    if results.pose_landmarks:        mp_drawing.draw_landmarks(            annotated_image,            results.pose_landmarks,            mp_pose.POSE_CONNECTIONS,            landmark_drawing_spec=mp_drawing_styles.get_default_pose_landmarks_style()        )    return annotated_image, results.pose_landmarks

We define the visualize_pose function, which displays the original and pose-annotated images side by side using matplotlib. The extract_keypoints function converts detected pose landmarks into a dictionary of named keypoints with their x, y, z coordinates and visibility scores.

Copy CodeCopiedUse a different Browser

def visualize_pose(original_image, annotated_image):    plt.figure(figsize=(16, 8))    plt.subplot(1, 2, 1)    plt.title('Original Image')    plt.imshow(cv2.cvtColor(original_image, cv2.COLOR_BGR2RGB))    plt.axis('off')    plt.subplot(1, 2, 2)    plt.title('Pose Estimation')    plt.imshow(annotated_image)    plt.axis('off')    plt.tight_layout()    plt.show()def extract_keypoints(landmarks):    if landmarks:        keypoints = {}        for idx, landmark in enumerate(landmarks.landmark):            keypoints[mp_pose.PoseLandmark(idx).name] = {                'x': landmark.x,                'y': landmark.y,                'z': landmark.z,                'visibility': landmark.visibility            }        return keypoints    return None

Finally, we load an image from the specified path, detect and visualize human pose landmarks using MediaPipe, and then extract and print the coordinates and visibility of each detected keypoint.

Copy CodeCopiedUse a different Browser

image_path = '/content/Screenshot 2025-03-26 at 12.56.05 AM.png'original_image = cv2.imread(image_path)annotated_image, landmarks = detect_pose(image_path)visualize_pose(original_image, annotated_image)keypoints = extract_keypoints(landmarks)if keypoints:    print("Detected Keypoints:")    for name, details in keypoints.items():        print(f"{name}: {details}")

In this tutorial, we explored human pose estimation using MediaPipe and OpenCV, demonstrating a comprehensive approach to body keypoint detection. We implemented a robust pipeline that transforms images into detailed skeletal maps, covering key steps including library installation, pose detection function creation, visualization techniques, and keypoint extraction. Using advanced machine learning models, we showcased how developers can transform raw visual data into meaningful movement insights across various domains like sports analytics and healthcare monitoring.

Here is the Colab Notebook. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. Don’t Forget to join our 85k+ ML SubReddit.

The post A Code Implementation for Advanced Human Pose Estimation Using MediaPipe, OpenCV and Matplotlib appeared first on MarkTechPost.

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签