Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for audio files #2037

Open
usmanm opened this issue Jan 10, 2025 · 5 comments
Open

Support for audio files #2037

usmanm opened this issue Jan 10, 2025 · 5 comments

Comments

@usmanm
Copy link

usmanm commented Jan 10, 2025

Similar to dspy.Image, it would be useful to add dspy.Audio. We've started using DSPy recently for our voice AI agent, but lack of support for audio is a blocker for many use-cases.

We're happy to upstream a patch, if we could get a bit of guidance in terms of how to get started.

@okhat
Copy link
Collaborator

okhat commented Jan 10, 2025

Thanks so much @usmanm ! This hasn't been a priority so far... I'm not sure it's planned but I'll tag @isaacbmiller to see if he thinks it's worthwhile

@isaacbmiller
Copy link
Collaborator

I think it would be really cool! I would wait for #1801 to be merged or patch based off of that branch.

I tried to design it such that other modalities would be not that difficult to implement, but I haven't used the audio api.

@ryanh-ai
Copy link

We are also interested in this and would collaborate or test a patch.

Any interest in video from this group?

@usmanm
Copy link
Author

usmanm commented Jan 11, 2025

Nice, we'll wait for your change to go in @isaacbmiller, and then take a stab at implementing audio file support.

We don't have a need for video, @ryanh-ai. But would be cool to have more folks test out audio support.

@AriMKatz
Copy link

also interested in audio

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants