Train Hand gesture recognition

Question

Train Hand gesture recognition

ila555 opened this issue 5 years ago · comments

ila555 commented 5 years ago

I wanna do train hand gesture using your hand tracking but I dont know where to start.

ila555 commented 5 years ago

thanks

Volodymyr · Answer 1 · Mon Feb 17 2020 17:04:16 GMT+0800 (China Standard Time)

data collection:
you need a dataset with labeled video sequences, which I assume you have, or if you don't I've seen a paper by some people at MIT who did gesture recognition, hopefully they used public data
run the hand pose estimation algo and see how well it performs, most likely you'll need to do some post processing to at least smooth out the predictions in time, as the current iteration does pose estimation frame-by-frame
perform cleaning on the new pose-labeled dataset you have: pick subsequences that are reasonably well processed by the pose predictor

after all that you should have a decent starting point to train a gesture recognition algo

ila555 · Answer 2 · Tue Feb 18 2020 17:09:42 GMT+0800 (China Standard Time)

What do you mean by hand pose estimation? is it the one that i run the imported hand tracking or i have to create hand estimation algo?

Volodymyr · Answer 3 · Wed Feb 19 2020 04:54:13 GMT+0800 (China Standard Time)

I meant the hand tracker

ila555 · Answer 4 · Thu Feb 20 2020 15:22:25 GMT+0800 (China Standard Time)

is there any way that I can do something like this in the Hand Tracker?

float pseudoFixKeyPoint = landmarkList.landmark(2).x();
if (landmarkList.landmark(3).x() < pseudoFixKeyPoint && landmarkList.landmark(4).x() < pseudoFixKeyPoint)
{
thumbIsOpen = true;
}

pseudoFixKeyPoint = landmarkList.landmark(6).y();
if (landmarkList.landmark(7).y() < pseudoFixKeyPoint && landmarkList.landmark(8).y() < pseudoFixKeyPoint)
{
    firstFingerIsOpen = true;
}

pseudoFixKeyPoint = landmarkList.landmark(10).y();
if (landmarkList.landmark(11).y() < pseudoFixKeyPoint && landmarkList.landmark(12).y() < pseudoFixKeyPoint)
{
    secondFingerIsOpen = true;
}

pseudoFixKeyPoint = landmarkList.landmark(14).y();
if (landmarkList.landmark(15).y() < pseudoFixKeyPoint && landmarkList.landmark(16).y() < pseudoFixKeyPoint)
{
    thirdFingerIsOpen = true;
}

pseudoFixKeyPoint = landmarkList.landmark(18).y();
if (landmarkList.landmark(19).y() < pseudoFixKeyPoint && landmarkList.landmark(20).y() < pseudoFixKeyPoint)
{
    fourthFingerIsOpen = true;
}

Volodymyr · Answer 5 · Fri Feb 21 2020 03:21:10 GMT+0800 (China Standard Time)

well the code you've pasted is C++ so there's a lot of extra complication of reading predicted landmarks, but generally speaking yes, you can do the same thing with the code in this repo.
First you follow the provided example and get a prediction dict that has all the keypoints in a numpy array, then you write custom code to do the above.