You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have been building and testing the code using a very small set of converted articles. Because these articles are protected by copyright, I cannot include them in the repository.
It would be great if the test data could be shared. To make sure that shareable test data are representative:
use current test files, but replace words in the body and title with random data
keep the markup of search phrases
keep the capitalisation
keep length of words the same
keep punctuation
words replacements may be the same within a document, but
I have been building and testing the code using a very small set of converted articles. Because these articles are protected by copyright, I cannot include them in the repository.
It would be great if the test data could be shared. To make sure that shareable test data are representative:
Perhaps this is something for https://github.com/gchq/CyberChef?
The text was updated successfully, but these errors were encountered: