You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@kwilcox We just spent an hour trying to figure out why some datasets from our catalog were not getting picked up by the crawler and eventually we found the problem: the path looked like /models/model_a/run27/output_files/catalog.ncml and it was getting rejected by the default skips because it contains "files".
@kwilcox We just spent an hour trying to figure out why some datasets from our catalog were not getting picked up by the crawler and eventually we found the problem: the path looked like
/models/model_a/run27/output_files/catalog.ncml
and it was getting rejected by the default skips because it contains "files".The default skips are:
Could we remove the
.*files.*
line, or if we need it for some common use case, make it more specific, like.*files$
?https://github.com/ioos/thredds_crawler/blob/master/thredds_crawler/crawl.py#L55
The text was updated successfully, but these errors were encountered: