New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Async streaming llamaindex openai agent example #71

Draft

Kvit wants to merge 3 commits into AnswerDotAI:main from Kvit:async-streaming-llamaindex-openai-agent-example

Kvit commented Jan 31, 2025

Example of async chat with OpenAI Agent wrapped with Lamaindex RAG
Handles input of OpenAI API key from the screen entry form

Vitali Khvatkov added 3 commits

January 31, 2025 18:49


          working version

fa5e64c


          documented version

b236a93


          Improve docstring formatting for ChatMessageMd function

73ad37f

Collaborator

Isaac-Flath commented Feb 1, 2025

Hi this is great start! Thank you for your work and the PR :)

I left some comments in the code for things to change, let me know if there's any questions.

Author

Kvit commented Feb 3, 2025

Hi this is great start! Thank you for your work and the PR :)

I left some comments in the code for things to change, let me know if there's any questions.

@Isaac-Flath Hi, where did you leave the comments? I do not see code reviews for some reason.

Collaborator

Isaac-Flath commented Feb 3, 2025

@Kvit - I added them inline comments in code. Here's what it looks like when looking at the files. Let me know if you can see them!

Screenshot 2025-02-03 at 11 42 22 AM

Author

Kvit commented Feb 3, 2025

@Isaac-Flath , no those I cannot see

Author

Kvit commented Feb 3, 2025

@Isaac-Flath perhaps you have to do review for me to see comments >/

Isaac-Flath requested changes

View reviewed changes

examples/applications/streaming_chat/app.py

+              # add Routers https://docs.fastht.ml/ref/handlers.html#apirouter
+              # Markdown support for daisy ui chat https://isaac-flath.github.io/website/posts/boots/FasthtmlTutorial.html
+              from fasthtml.common import Div, Span, Body, P, Form, Input, Button, Script, Link, Label, Nav, Title, Template, Style, serve

Collaborator

Isaac-Flath Feb 1, 2025

We can do from fasthtml.common import * to match the coding style of the other apps on the gallery.

examples/applications/streaming_chat/app.py

+              # =============== Router ===============
+              # in this example we will use the APIRouter to create a chat application and then mount it to the main app
+              r_chat = APIRouter()

Collaborator

Isaac-Flath Feb 1, 2025

We can define the app at the using app, rt = fast_app and use @rt as the decorator. The main advantage of API Router is it lets you import routes from other files easier in mutli-file apps. But since this is a single file app we don't need API Router.

examples/applications/streaming_chat/app.py

+              # Init global variables for OpenAI agent
+              agent = None
+              api_key_set = False # flag to check if API key is set

Collaborator

Isaac-Flath Feb 1, 2025

Is there any conflict with multiple users. Is it a problem if these values are set by one user and used by another? Does this need to be separated out by user or by session?

examples/applications/streaming_chat/app.py

+                  Raises:
+                      AssertionError: If msg_idx is None.
+                  """

Collaborator

Isaac-Flath Feb 1, 2025

Instead of numpy style docstrings let's use fastcore's docments. Generally we want things to match the fastai style guidelines (https://docs.fast.ai/dev/style.html) and minimize some vertical space.

examples/applications/streaming_chat/app.py

+                      send (callable): The function to send data back to the WebSocket client.
+                      session (dict): The session data for the current WebSocket connection.
+                  Workflow:
+. Checks if the OpenAI API key is set. If not, sends an error message to the client.

Collaborator

Isaac-Flath Feb 1, 2025

Can these workflow steps be put as inline comments right next to the thing where it's doing that instead of in a longer docstring to explain the code as it goes instead of the explanation and code being separate?

examples/applications/streaming_chat/app.py

+                      ), cls='container  mx-auto w-full px-4'),
+                         # Chat messages
+                          Div(

Collaborator

Isaac-Flath Feb 1, 2025

Can you minimize the verical space a bit more to match the style described in the fastai style guide? https://docs.fast.ai/dev/style.html

examples/applications/streaming_chat/app.py

+                  Script(type="module", src="https://cdn.jsdelivr.net/npm/zero-md@3?register") # zero-md for markdown rendering
+              )
+              # Set up the app, including daisyui and tailwind for the chat component
+              app = FastHTMLWithLiveReload(hdrs=headers, exts='ws', debug=True, htmlkw=dict(

Collaborator

Isaac-Flath Feb 1, 2025

Can you use FastHMTL or fast_app instead? LIve reload and debug is great when developing, but generally we want to remove that when we deploy it.

examples/applications/streaming_chat/app.py

+                      Exception: If there is an error during the initialization of the OpenAIAgent or querying the agent.
+                  """
+                  global agent, api_key_set
+                  agent = OpenAIAgent.from_tools(llm=OpenAI(model="gpt-4o", api_key=apikey))

Collaborator

Isaac-Flath Feb 1, 2025

With agent being a global variable shared by all users, does this cause issues where there could be conflicts when multiple users are using the app at the same time?

examples/applications/streaming_chat/app.py

+                     print(f"Hello from OpenAI: {hello}")
+                  except Exception as e:
+                      print(f"Error setting OpenAI API Key: {e}")

Collaborator

Isaac-Flath Feb 1, 2025

Can you catch the specific exception instead of all exceptions. This could be confusing if it fails for a different reason as it will print error setting OpenAI API key regardless of what the exception was for and won't give a stack trace

examples/applications/streaming_chat/app.py

+                  line_buffer = ""
+                  async for response in response_stream.async_response_gen():
+                      chunk_message = response
+                      print(chunk_message, end='', flush=True) # print message to console, remove for production

Collaborator

Isaac-Flath Feb 1, 2025

Can you go through and remove the print messages for the PR? I think there's a couple throughout which is great for debugging but don't want them in prod.

Collaborator

Isaac-Flath commented Feb 3, 2025

🤦

@Kvit How about now?

Kvit marked this pull request as draft

February 3, 2025 17:36

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet