Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RAG Eval Notebok #1113

Open
hardikjshah opened this issue Feb 14, 2025 · 0 comments
Open

RAG Eval Notebok #1113

hardikjshah opened this issue Feb 14, 2025 · 0 comments
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@hardikjshah
Copy link
Contributor

🚀 Describe the new functionality needed

Notebook flow showcasing the e2e flow of how we can do evals for RAG agents.
This will merge a lot of recent work around -- eval apis, agent updates, RAG updates, ReACT agents, etc

  • Local file with some benchmarking data
  • Upload file using /files
  • Make a dataset out of it
  • Make a benchmark ( Register this dataset via /benchmarks )
  • Run RAG agent Eval /eval apis
    • This RAG agent should have all the core changes around no adhoc RAG calling, tool_option, system_prompt, etc
  • Make changes to the Agent prompt or give extra tool and re-eval
  • [stretch] Use docling for indexing / chunking to showcase improvements

💡 Why is this needed? What if we don't build it?

This showcases a first use case of the e2e SDLC of a RAG agent.

Other thoughts

No response

@hardikjshah hardikjshah added the enhancement New feature or request label Feb 14, 2025
@hardikjshah hardikjshah added this to the v0.1.4 milestone Feb 14, 2025
@yanxi0830 yanxi0830 self-assigned this Feb 14, 2025
@hardikjshah hardikjshah modified the milestones: v0.1.4, v0.1.5 Feb 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants