Skip to content

Does shifting by one position when constructing the input break the causal chain? For example: input_ids_target = input_ids[:, 1:] #335

@fanbinqi

Description

@fanbinqi

Dear Author,

When constructing the input, does shifting input_ids_target to the right break the causal dependency chain among the input tokens?

Or is the causal relationship already implicitly captured in the hidden states by default?

Thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions