Application of 2D human pose estimation for automated hold suggestion for visually impaired climbers

  • Anna Maschek

    Student thesis: Master's Thesis

    Abstract

    This thesis investigates the application of 2D human pose estimation models in the
    context of climbing, with the primary goal of leveraging computer vision to enable
    visually impaired climbers to train more independently.
    A novel dataset consisting of 23 videos featuring 11 different climbers ascending two
    routes is introduced. The dataset includes information about the pixel locations of holds
    in the video, the holds used by the climber in each video, and a homography matrix to
    calculate real-world distances.
    An initial evaluation of ten 2D human pose estimation models was conducted by
    overlaying their detections on the video to assess their effectiveness in identifying climbing poses. The three most accurate and reliable models, ViTPose L, YOLOv8-pose X,
    and MediaPipe H, are then used in combination with climbing hold positions to identify
    the holds currently used by the climber. The performance of the models for obtaining
    these predictions is thoroughly investigated on the custom dataset, with ViTPose L
    demonstrating the highest accuracy, albeit at the cost of taking twice as long to predict
    a single pose compared to the other two models.
    Additionally, two approaches for predicting the next hold are discussed: one rulebased and the other utilizing a database.
    Future work will focus on refining the prediction models and expanding the dataset
    to include more diverse climbing scenarios, further improving the system’s applicability
    in real-world settings.
    Date of Award2024
    Original languageEnglish (American)
    SupervisorDavid Christian Schedl (Supervisor)

    Studyprogram

    • Interactive Media

    Cite this

    '