Request for Reproducible Script and Checkpoint of Llava-1.5-7B

Hi,

Thank you for sharing your wonderful study! The concepts of prefusion and vision compression are very intriguing. 

Despite several attempts to reproduce your work with the Llava-1.5-7B model, I have encountered failures: the training loss decreases similarly to that of the original Llava, but the downstream performance is much worse.
- My trials have included varying learning hyperparameters, conducting separate intermediate training for prefusion and compression while freezing the projector and LLM, and initializing from the Llava checkpoint.

Could you please share the exact training script used for the paper and release the model checkpoint of LLaVA-v1.5-Vicuna-7B for future research?

Thank you for checking this matter.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Request for Reproducible Script and Checkpoint of Llava-1.5-7B #21

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Request for Reproducible Script and Checkpoint of Llava-1.5-7B #21

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions