Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature: Quickstart with smaller dataset #1

Open
cleong110 opened this issue Mar 26, 2024 · 5 comments
Open

feature: Quickstart with smaller dataset #1

cleong110 opened this issue Mar 26, 2024 · 5 comments

Comments

@cleong110
Copy link

cleong110 commented Mar 26, 2024

It would be real nice to have a smaller or even toy dataset for "get it running and give it a go" sort of purposes, so as to not need to download 500GB of pose data

Potentially DGS Corpus or RWTH Phoenix 2014 T?

@cleong110
Copy link
Author

cleong110 commented Mar 26, 2024

After various issues (sign-language-processing/datasets#65, sign-language-processing/datasets#66, sign-language-processing/datasets#67) I managed to get DGS Corpus running... until it crashed with a "Killed" message.

The default setting for DGS Corpus seems to download all the videos and load enough of them into memory that my system crashed.

@cleong110
Copy link
Author

https://stackoverflow.com/questions/65231843/is-it-possible-to-only-load-part-of-a-tensorflow-dataset suggests a potential solution. It may be that you can download the whole dataset, but only load a portion, by specifying something like split="train[:5%]"

@cleong110
Copy link
Author

@cleong110
Copy link
Author

OK, answered my own question here, but let me test it out at least.

@cleong110
Copy link
Author

Made an issue about it, sign-language-processing/datasets#68

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant