Is not it bug in the given line https://github.com/DAMO-NLP-SG/multilingual_analysis/blob/main/neuron_detection/transformers/models/llama/modeling_llama.py#L246
return real_output, torch.sum(torch.abs(self.up_proj(x)), dim=1).squeeze().tolist(), torch.sum(torch.abs(self.up_proj(x)), dim=1).squeeze().tolist()
Should it not be self.up_proj(x) and self.down_proj(x)?
Can authors please clarify this?
Is not it bug in the given line https://github.com/DAMO-NLP-SG/multilingual_analysis/blob/main/neuron_detection/transformers/models/llama/modeling_llama.py#L246
return real_output, torch.sum(torch.abs(self.up_proj(x)), dim=1).squeeze().tolist(), torch.sum(torch.abs(self.up_proj(x)), dim=1).squeeze().tolist()Should it not be self.up_proj(x) and self.down_proj(x)?
Can authors please clarify this?