Skip to content

VAL always response "" #21

@Zxy-MLlab

Description

@Zxy-MLlab

Can your VAL be evaluated correctly? Download I'm asking for VAL, then set the environment variable: os.environ['VAL'] =/home/zhouxueyang/agent/plan/planbench/LLMs-Planning-main/planner_tools/VAL

evaluate file:
{'instance_id': 2, 'example_instance_ids': [1], 'query': 'I am playing with a set of blocks where I need to arrange the blocks into stacks. Here are the actions I can do\n\nPick up a block\nUnstack a block from on top of another block\nPut down a block\nStack a block on top of another block\n\nI have the following restrictions on my actions:\nI can only pick up or unstack one block at a time.\nI can only pick up or unstack a block if my hand is empty.\nI can only pick up a block if the block is on the table and the block is clear. A block is clear if the block has no other blocks on top of it and if the block is not picked up.\nI can only unstack a block from on top of another block if the block I am unstacking was really on top of the other block.\nI can only unstack a block from on top of another block if the block I am unstacking is clear.\nOnce I pick up or unstack a block, I am holding the block.\nI can only put down a block that I am holding.\nI can only stack a block on top of another block if I am holding the block being stacked.\nI can only stack a block on top of another block if the block onto which I am stacking the block is clear.\nOnce I put down or stack a block, my hand becomes empty.\nOnce you stack a block on top of a second block, the second block is no longer clear.\n\n[STATEMENT]\nAs initial conditions I have that, the red block is clear, the blue block is clear, the yellow block is clear, the hand is empty, the blue block is on top of the orange block, the red block is on the table, the orange block is on the table and the yellow block is on the table.\nMy goal is to have that the orange block is on top of the blue block.\n\nMy plan is as follows:\n\n[PLAN]\nunstack the blue block from on top of the orange block\nput down the blue block\npick up the orange block\nstack the orange block on top of the blue block\n[PLAN END]\n\n[STATEMENT]\nAs initial conditions I have that, the red block is clear, the yellow block is clear, the hand is empty, the red block is on top of the blue block, the yellow block is on top of the orange block, the blue block is on the table and the orange block is on the table.\nMy goal is to have that the orange block is on top of the red block.\n\nMy plan is as follows:\n\n[PLAN]', 'ground_truth_plan': '(unstack yellow orange)\n(put-down yellow)\n(pick-up orange)\n(stack orange red)\n', 'llm_raw_response': "Based on the initial conditions and the goal, I will create a plan to achieve the goal.\n\n[PLAN]\nunstack the yellow block from on top of the orange block\nput down the yellow block\nunstack the red block from on top of the blue block\nput down the red block\npick up the orange block\nstack the orange block on top of the red block\n[PLAN END]\n\nThis plan should achieve the goal of having the orange block on top of the red block. Let me know if you'd like me to explain the reasoning behind each step!", 'extracted_llm_plan': '(unstack d c)\n(put-down d)\n(pick-up c)\n(stack c a)\n', 'llm_correct': False}

But when I evaluated the response, always returns ' '.

Is there something wrong with my deployment?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions