Llama 3.1 8B by suachong · Pull Request #813 · mlcommons/training

suachong · 2025-08-13T21:55:31Z

Continuing from the previous PR #799 with some cleanups.

github-actions · 2025-08-13T21:55:39Z

MLCommons CLA bot:
Thank you very much for your submission, we really appreciate it. Before we can accept your contribution, we ask that you sign the MLCommons CLA (Apache 2). Please use this [Google form] (https://forms.gle/Ew1KkBVpyeJDuRw67) to initiate authorization. If you are from an MLCommons member organization, we will request that you be added to the CLA. If you are not from a member organization, we will email you a CLA to sign. For any questions, please contact support@mlcommons.org.
4 out of 5 committers have signed the MLCommons CLA.
✅ @ZixianWangAMD
✅ @mmarcinkiewicz
✅ @hXl3s
✅ @suachong
❌ @zixian Wang
Zixian Wang seems not to be a GitHub user. You need a GitHub account after you become MLCommons member. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You can retrigger this bot by commenting recheck in this Pull Request}

ShriyaRishab · 2025-08-14T14:49:03Z

+
+<!-- # 6. Other
+
+#### Run model conversion


Can we drop this section because there is no starting checkpoint for this benchmark. Submitters should not start from the HF checkpoint and instead they need to start from randomly initialized weights

ShriyaRishab · 2025-08-14T14:50:41Z

+#     This is the checkpoint that we want to start with. 
+#     Each checkpoint should be a folder containing two sub-folders: context and weights. 
+#     And we need to pass this folder's path (the folder containing context and weights) here.  
+export MODEL_CKPT="/data/llama3_8b/model/Llama-3.1-8B_nemo"


There is no model ckpt, this should not be set.

ShriyaRishab · 2025-08-14T14:52:23Z

+export USE_CKPT=0
+# Model: Whether we are resuming from a NeMo-formatted HuggingFace checkpoint (weights only). 
+#     If set to 1, then checkpoint resuming code will not try to load the optimizer states. 
+export FROM_HF=1


Can we remove these flags since they are not relevant

ShriyaRishab · 2025-08-14T14:53:16Z

+#     This is the checkpoint that we want to start with. 
+#     Each checkpoint should be a folder containing two sub-folders: context and weights. 
+#     And we need to pass this folder's path (the folder containing context and weights) here.  
+export MODEL_CKPT="/data/llama3_8b/model/Llama-3.1-8B_nemo"


ShriyaRishab · 2025-08-14T14:53:28Z

+export USE_CKPT=0
+# Model: Whether we are resuming from a NeMo-formatted HuggingFace checkpoint (weights only). 
+#     If set to 1, then checkpoint resuming code will not try to load the optimizer states. 
+export FROM_HF=1


remove HF flags

ShriyaRishab · 2025-08-14T14:56:17Z

@@ -0,0 +1,10 @@
+if __name__ == "__main__":


This file is not necessary

ShriyaRishab · 2025-08-14T14:56:30Z

@@ -0,0 +1,23 @@
+#!/bin/bash


This file is not necessary

ShriyaRishab · 2025-08-14T14:58:00Z

There is an issue with the CLA again - this needs to be fixed #813 (comment)

suachong requested a review from a team as a code owner August 13, 2025 21:55

ShriyaRishab reviewed Aug 14, 2025

View reviewed changes

suachong closed this Aug 15, 2025

suachong force-pushed the master branch from e1a5600 to 2000892 Compare August 15, 2025 02:41

github-actions Bot locked and limited conversation to collaborators Aug 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Llama 3.1 8B #813

Llama 3.1 8B #813
suachong wants to merge 0 commit into
mlcommons:masterfrom
suachong:master

suachong commented Aug 13, 2025

Uh oh!

github-actions Bot commented Aug 13, 2025

Uh oh!

ShriyaRishab Aug 14, 2025

Uh oh!

ShriyaRishab Aug 14, 2025

Uh oh!

ShriyaRishab Aug 14, 2025

Uh oh!

ShriyaRishab Aug 14, 2025

Uh oh!

ShriyaRishab Aug 14, 2025

Uh oh!

ShriyaRishab Aug 14, 2025 •

edited

Loading

Uh oh!

ShriyaRishab Aug 14, 2025

Uh oh!

ShriyaRishab commented Aug 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

suachong commented Aug 13, 2025

Uh oh!

github-actions Bot commented Aug 13, 2025

Uh oh!

ShriyaRishab Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab commented Aug 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ShriyaRishab Aug 14, 2025 •

edited

Loading