Self Checks
Steps to reproduce
The default image understanding tends to generate image descriptions rather than extract content in a strictly structured manner as expected. Issues include:
Chinese content in the image is returned as an English summary
Output resembles captioning or summarization instead of OCR
Users cannot adjust the output format for specific scenarios
A prompt template interface needs to be added to the Image Understanding node
✔️ Expected Behavior
No response
❌ Actual Behavior
No response
Self Checks
Steps to reproduce
The default image understanding tends to generate image descriptions rather than extract content in a strictly structured manner as expected. Issues include:
Chinese content in the image is returned as an English summary
Output resembles captioning or summarization instead of OCR
Users cannot adjust the output format for specific scenarios
A prompt template interface needs to be added to the Image Understanding node
✔️ Expected Behavior
No response
❌ Actual Behavior
No response