Handling ValidationError in dspy when LLM output mismatches pydantic model #2148

hucorz · 2025-01-19T15:52:02Z

I’m encountering pydantic.ValidationError in dspy because the LLM output does not conform to the structure defined in my signature.

How do you typically handle cases where LLM output does not match the expected model structure?
Is there a way to bypass the ValidationError and return the raw LLM output for further inspection or processing?

AbhishekRP2002 · 2025-01-20T13:35:55Z

Ideally, if the LLM output fails to conform with the pydantic class structure or JSON obj signature, then we should have a custom parser that intakes raw LLM output and parses into structured format using regex or some parsing logic.

langchain follows this implementation:

https://python.langchain.com/api_reference/_modules/langchain_core/utils/json.html#parse_json_markdown

chenmoneygithub · 2025-01-22T22:59:25Z

@hucorz Thanks for reporting the issue! could you share a reproducible code?

@AbhishekRP2002 A best-effort parsing could make sense here, but I would like to understand the ratio of successful custom parsing on responses that fail the automatic parsing. What we believe is if it is not the case right now, in the near future, LLM can produce pretty reliable structured output given the right prompt, so we don't need a regex parser as fallback.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling ValidationError in dspy when LLM output mismatches pydantic model #2148

Handling ValidationError in dspy when LLM output mismatches pydantic model #2148

hucorz commented Jan 19, 2025

AbhishekRP2002 commented Jan 20, 2025

chenmoneygithub commented Jan 22, 2025

Handling ValidationError in dspy when LLM output mismatches pydantic model #2148

Handling ValidationError in dspy when LLM output mismatches pydantic model #2148

Comments

hucorz commented Jan 19, 2025

AbhishekRP2002 commented Jan 20, 2025

chenmoneygithub commented Jan 22, 2025