Skip to content

Unpredictable Image Understanding Results #514

@chenchaolong0

Description

@chenchaolong0

Self Checks

  • This is only for bug report, if you would like to ask a question, please head to Discussions.
  • I have searched for existing issues search for existing issues, including closed ones.
  • [FOR CHINESE USERS] 请务必使用英文提交 Issue,谢谢!:)
  • Please do not modify this template :) and fill in all the required fields.

Steps to reproduce

The default image understanding tends to generate image descriptions rather than extract content in a strictly structured manner as expected. Issues include:
Chinese content in the image is returned as an English summary
Output resembles captioning or summarization instead of OCR
Users cannot adjust the output format for specific scenarios
A prompt template interface needs to be added to the Image Understanding node

✔️ Expected Behavior

No response

❌ Actual Behavior

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions