-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GitHub Pages Connector #4233
base: main
Are you sure you want to change the base?
GitHub Pages Connector #4233
Conversation
@akhilender-bongirwar is attempting to deploy a commit to the Danswer Team on Vercel. A member of the Team first needs to authorize it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
PR Summary
Added a new GitHub Pages connector to index content from GitHub Pages sites, implementing direct GitHub API integration with rate limiting and error handling.
- Missing
os
import ingithub_pages/connector.py
but usingos.path
functions doc_updated_at
uses current time instead of file's last modified time in_process_file
- No validation for file size or recursion depth in
_get_all_files
traversal - Python version change from 3.11 to 3.12 in pre-commit config creates inconsistency with CI pipeline
- Connector config lacks advanced settings like branch selection or path filtering
💡 (2/5) Greptile learns from your feedback when you react with 👍/👎!
10 file(s) reviewed, 8 comment(s)
Edit PR Review Bot Settings | Greptile
backend/tests/daily/connectors/github_pages/test_github_pages_connector.py
Outdated
Show resolved
Hide resolved
Signed-off-by: Akhilender Bongirwar <[email protected]>
6e1b30b
to
a5dc6d5
Compare
@yuhongsun96 please review |
This PR adds a connector for GitHub Pages.
fixes #2282
2025-03-08.22-11-35.mp4