Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flow Elaboration & PSL Lexicon #21

Open
imsamimalik opened this issue Nov 15, 2023 · 5 comments
Open

Flow Elaboration & PSL Lexicon #21

imsamimalik opened this issue Nov 15, 2023 · 5 comments

Comments

@imsamimalik
Copy link

@AmitMY I have a little confusion understanding the flow of this process.

Why do we need to convert each text into the gloss, because, at the next step, you are using the gloss to find its relevant video? Can't we just skip the gloss part and just use text to find its video and proceed forward?

And what lexicon are you using for PSL(Pakistan Sign Language)?

Thanks!

@imsamimalik
Copy link
Author

And is it powering the https://sign.mt/ website? When I explored this repo, many implementations were missing.

@AmitMY
Copy link
Collaborator

AmitMY commented Nov 16, 2023

We have a paper describing the flow: https://arxiv.org/pdf/2305.17714.pdf

As for your question, we do search by words at first, but in reality, words are conjugated into different forms, in german for example, the verb changes depending on the subject. i "esse", you "isst", they "essen" - and the dictionary will likely only include the base form, "essen". glossifying (or in our default implementation, lemmatization) strips away this variety.

As for PSL - we use spreadthesign in https://sign.mt but ideally we would need to move away from it.

The "implementations" missing that you refer to - I understand those as missing lexicons, not code, since what is running in sign.mt is this repository, with a different lexicon. There are many lexicons one could add (for example sign-language-processing/datasets#32) but it is not our focus, and should be simple for end users to add

@imsamimalik
Copy link
Author

Oh okay, I got it. I noticed the NotImplemented placeholder in the pose-to-video module.

@imsamimalik
Copy link
Author

@AmitMY Sorry to disturb you again.

How do you extract a pose file from a video?

I have tried a few methods but they do not fulfill my needs.

  • I have tried the original OpenPose Python Library. But its model URL has not been working(the link where models are hosted is down). So eventually the absence of models makes it useless.
  • I have tried ControlNet. It can give me individual frames image files with pose detected but I have no clue how to convert it to a pose or JSON file.

Any help would be appreciated.

@AmitMY
Copy link
Collaborator

AmitMY commented Nov 25, 2023

Use MediaPipe holistic
easiest way imo is

pip install git+https://github.com/sign-language-processing/transcription
pip install mediapipe 

video_to_pose -i sign.mp4 --format mediapipe -o sign.pose

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants