Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding Hugging Face Dataset Link for Palestine #7

Open
3 of 6 tasks
mlibre opened this issue Feb 6, 2025 · 4 comments
Open
3 of 6 tasks

Adding Hugging Face Dataset Link for Palestine #7

mlibre opened this issue Feb 6, 2025 · 4 comments
Assignees
Labels
content proposal Content related to the Israel-Palestine issue. no_tech This issue not requires any technical attentions

Comments

@mlibre
Copy link

mlibre commented Feb 6, 2025

Title:

Adding Hugging Face Dataset Link for Palestine

Description:

This issue proposes adding a link to the Hugging Face dataset mlibre/palestine to the repository. The dataset includes valuable information related to Palestine, which can be useful for training, fine-tuning LLMs and AIs.

Type of Content:

Other (Dataset Link and Documentation)

Audience:

Researchers, Data Scientists, Activists, Students, and the General Public interested in Palestine-related data.

Attachments:

Hugging Face Dataset Link

Questions:

  • Are you sure this content is not already covered in the repository? ✅
  • Do you intend to improve someone else's existing content? If so, please provide details somewhere.
  • How will your proposed content contribute to the understanding of the Israel-Palestine issue?
    • It provides structured data that can be used for research, machine learning models, documentation, training and fine-tuning purposes.
  • Is this a time-sensitive topic or event that needs immediate attention?
  • Do you have a timeframe for creating and submitting this content?
  • Are you open to collaborating with other contributors on this content? ✅

Additional Information:

This dataset could be used for a variety of applications, including academic research, media analysis, and advocacy work. Further documentation can be added if needed to explain the dataset’s structure and potential use cases.

@mlibre mlibre added content proposal Content related to the Israel-Palestine issue. no_tech This issue not requires any technical attentions labels Feb 6, 2025
@Zain-ul-din
Copy link
Owner

Zain-ul-din commented Feb 7, 2025

@mlibre Sure! Bro, go for it. just make sure the data must be in JSON, CSV, or YMAL that can be sent over the HTTP also consider adding timestamps to data like updatedAt.

@mlibre
Copy link
Author

mlibre commented Feb 7, 2025

Thanks for the feedback, @Zain-ul-din! I will add documentation on how to download dataset over HTTP. And will also add timestamps to the data.

@mlibre
Copy link
Author

mlibre commented Feb 8, 2025

Hey @Zain-ul-din

The dataset has been updated with a download link and an additional field to indicate when the data was scraped. The latest combined file can now be accessed here.

@Zain-ul-din
Copy link
Owner

Hi @mlibre, thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
content proposal Content related to the Israel-Palestine issue. no_tech This issue not requires any technical attentions
Projects
None yet
Development

No branches or pull requests

2 participants