Structured JSON output key ordering issue causes incorrect reasoning steps #354

yuan-li · 2024-12-16T12:53:35Z

Description of the bug:

I’ve encountered a bug when using the structured output feature with the Gemini model. Specifically, I noticed that json keys in the returned response appear sorted alphabetically rather than in the order defined by my provided schema. This seems to interfere with the chain-of-thought reasoning steps.

Actual vs expected behavior:

I just copied the example from OpenAI (https://platform.openai.com/docs/guides/structured-outputs#chain-of-thought)

from pydantic import BaseModel

class Step(BaseModel):
    explanation: str
    output: str

class MathReasoning(BaseModel):
    steps: list[Step]
    final_answer: str

model = genai.GenerativeModel("gemini-1.5-flash-8b")
result = model.generate_content(
    "how can I solve 8x + 7 = -23",
    generation_config=genai.GenerationConfig(
        response_mime_type="application/json", response_schema=MathReasoning, temperature=0
    ),
)

The response is

{'final_answer': '-5',
 'steps': [{'explanation': 'Subtract 7 from both sides of the equation to isolate the term with x.',
   'output': '8x = -30'},
  {'explanation': 'Divide both sides of the equation by 8 to solve for x.',
   'output': 'x = -30/8'},
  {'explanation': 'Simplify the fraction.', 'output': 'x = -15/4'}]}

Here, the final answer appears before steps, which disrupted the reasoning steps and resulted in an incorrect final answer. It implies that the keys might be sorted alphabetically, so a workaround would be like this:

from pydantic import BaseModel

class Step(BaseModel):
    explanation: str
    output: str

class MathReasoning(BaseModel):
    calculation_steps: list[Step]
    final_answer: str

model = genai.GenerativeModel("gemini-1.5-flash-8b")
result = model.generate_content(
    "how can I solve 8x + 7 = -23",
    generation_config=genai.GenerationConfig(
        response_mime_type="application/json", response_schema=MathReasoning, temperature=0
    ),
)

And the response is as expected this time:

{'calculation_steps': [{'explanation': 'Subtract 7 from both sides of the equation.',
   'output': '8x + 7 - 7 = -23 - 7'},
  {'explanation': 'Simplify both sides of the equation.',
   'output': '8x = -30'},
  {'explanation': 'Divide both sides of the equation by 8.',
   'output': '8x / 8 = -30 / 8'},
  {'explanation': 'Simplify both sides of the equation.',
   'output': 'x = -30/8'},
  {'explanation': 'Simplify the fraction.', 'output': 'x = -15/4'}],
 'final_answer': '-15/4'}

Any other information you'd like to share?

This behavior suggests that the keys may be sorted alphabetically internally, rather than following the schema order. It would be helpful if the model could preserve the original field order to maintain the intended reasoning flow.

The text was updated successfully, but these errors were encountered:

Giom-V · 2024-12-17T08:03:37Z

Hello @yuan-li,
Thank you for the feedback. I'm routing it internally to the folk in charge of structured output.

yuan-li · 2024-12-17T09:37:26Z

Thank you @Giom-V

seniorb · 2024-12-19T10:34:49Z

I'm having this issue as well, where the json is output with alphabetically arranged keys and not in the order specified

Giom-V · 2025-01-09T09:15:59Z

The feature is still in the backlog, but in the meantime, have you tried the thinking model?

github-actions bot mentioned this issue Jan 1, 2025

Monthly issue metrics report markmcd/gemini-api-cookbook#10

Open

Giom-V added type:feature request New feature request/enhancement status:pending implementation Feature request pending implementation from the Eng team labels Jan 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Structured JSON output key ordering issue causes incorrect reasoning steps #354

Structured JSON output key ordering issue causes incorrect reasoning steps #354

yuan-li commented Dec 16, 2024

Giom-V commented Dec 17, 2024 •

edited

Loading

yuan-li commented Dec 17, 2024

seniorb commented Dec 19, 2024

Giom-V commented Jan 9, 2025

Structured JSON output key ordering issue causes incorrect reasoning steps #354

Structured JSON output key ordering issue causes incorrect reasoning steps #354

Comments

yuan-li commented Dec 16, 2024

Description of the bug:

Actual vs expected behavior:

Any other information you'd like to share?

Giom-V commented Dec 17, 2024 • edited Loading

yuan-li commented Dec 17, 2024

seniorb commented Dec 19, 2024

Giom-V commented Jan 9, 2025

Giom-V commented Dec 17, 2024 •

edited

Loading