Skip to content

issues Search Results · repo:CivicActions/edscrapers language:Python

Filter by

86 results
 (74 ms)

86 results

inCivicActions/edscrapers (press backspace or delete to remove)

Leaving a stack trace from the server here: Exception on /_dash-update-component [POST] Traceback (most recent call last): File /usr/local/lib/python3.7/site-packages/flask/app.py , line 2446, in wsgi_app ...
bug
  • nightsh
  • 1
  • Opened 
    on Aug 11, 2020
  • #210

Update the Scraping Dashboard to properly report improvements that have been done on the scrapers. In our last round of the scraping exercise, we also included the ability to track data profiles that were ...
  • higorspinto
  • 1
  • Opened 
    on Jul 29, 2020
  • #207

During an earlier run of one of our scrapers, a new URL was discovered: rems.ed.gov. This new URL will require a dedicated scraper (crawler and parser) in order to extract structured data profiles from ...
  • higorspinto
  • 2
  • Opened 
    on Jul 21, 2020
  • #203

Edgov harvest source has an invalid entry that broke the fetch stage. A data profile with name: presidents-fy-2010-budget-request-for-the-u-s-department-of-education contains an invalid HelpDesk Email ...
  • higorspinto
  • 1
  • Opened 
    on Jul 13, 2020
  • #199

Example: https://us-ed-testing.ckan.io/dataset/the-education-innovator-december-7-2010 We need to avoid harvesting data profiles that only have TXT files as resources, as they are very likely to be false ...
  • higorspinto
  • 1
  • Opened 
    on Jul 10, 2020
  • #192

Add a metadata field to identify scraped datasets. This change allows us to create statistics related to data stewards reviewing/updating the database. Acceptance Criteria - [x] Have a field that identifies ...
  • higorspinto
  • Opened 
    on Jun 23, 2020
  • #188

The default value of Bureau Code field should be 018:00 Reference: https://docs.google.com/spreadsheets/d/1hVJludofP08Usdd0LuAC-Jst3DXkcpsxZ1j5sh_rOjc Acceptance Criteria - [x] Have the default value ...
  • higorspinto
  • Opened 
    on Jun 19, 2020
  • #185

Description datasets belong to Collections, while Collections belong to Sources. We have implemented Collections and Sources for that data portal using collections transformer and sources transformer ...
  • osahon-okungbowa
  • Opened 
    on Jun 15, 2020
  • #181

There are two copies / backups of old transformer versions in the code base. They both came out of PR #177. Making a note here so we won t forget to tidy up the code. - edscrapers/transformers/datajson/transform ...
housekeeping
  • nightsh
  • 1
  • Opened 
    on Jun 8, 2020
  • #178

During phase 1 we created a functional scraper for crawling and parsing data from this office. The scraped data was successfully ingested into the data portal. For phase 2, we need to improve the quality ...
  • higorspinto
  • Opened 
    on May 27, 2020
  • #169
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue origami icon

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Issue search results · GitHub