issues Search Results · repo:CivicActions/edscrapers language:Python
Filter by
86 results
(74 ms)86 results
inCivicActions/edscrapers (press backspace or delete to remove)Leaving a stack trace from the server here:
Exception on /_dash-update-component [POST]
Traceback (most recent call last):
File /usr/local/lib/python3.7/site-packages/flask/app.py , line 2446, in wsgi_app ...
bug
nightsh
- 1
- Opened on Aug 11, 2020
- #210
Update the Scraping Dashboard to properly report improvements that have been done on the scrapers. In our last round of
the scraping exercise, we also included the ability to track data profiles that were ...
higorspinto
- 1
- Opened on Jul 29, 2020
- #207
During an earlier run of one of our scrapers, a new URL was discovered: rems.ed.gov. This new URL will require a
dedicated scraper (crawler and parser) in order to extract structured data profiles from ...
higorspinto
- 2
- Opened on Jul 21, 2020
- #203
Edgov harvest source has an invalid entry that broke the fetch stage.
A data profile with name: presidents-fy-2010-budget-request-for-the-u-s-department-of-education contains an invalid
HelpDesk Email ...
higorspinto
- 1
- Opened on Jul 13, 2020
- #199
Example: https://us-ed-testing.ckan.io/dataset/the-education-innovator-december-7-2010
We need to avoid harvesting data profiles that only have TXT files as resources, as they are very likely to be false ...
higorspinto
- 1
- Opened on Jul 10, 2020
- #192
Add a metadata field to identify scraped datasets. This change allows us to create statistics related to data stewards
reviewing/updating the database.
Acceptance Criteria
- [x] Have a field that identifies ...
higorspinto
- Opened on Jun 23, 2020
- #188
The default value of Bureau Code field should be 018:00
Reference: https://docs.google.com/spreadsheets/d/1hVJludofP08Usdd0LuAC-Jst3DXkcpsxZ1j5sh_rOjc
Acceptance Criteria
- [x] Have the default value ...
higorspinto
- Opened on Jun 19, 2020
- #185
Description
datasets belong to Collections, while Collections belong to Sources. We have implemented Collections and Sources for
that data portal using collections transformer and sources transformer ...
osahon-okungbowa
- Opened on Jun 15, 2020
- #181
There are two copies / backups of old transformer versions in the code base. They both came out of PR #177. Making a
note here so we won t forget to tidy up the code.
- edscrapers/transformers/datajson/transform ...
housekeeping
nightsh
- 1
- Opened on Jun 8, 2020
- #178
During phase 1 we created a functional scraper for crawling and parsing data from this office. The scraped data was
successfully ingested into the data portal.
For phase 2, we need to improve the quality ...
higorspinto
- Opened on May 27, 2020
- #169

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Press the /
key to activate the search input again and adjust your query.
Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Press the /
key to activate the search input again and adjust your query.