Skip to content

Commit

Permalink
[Bugfix] Fix TP > 1 for new granite (vllm-project#8544)
Browse files Browse the repository at this point in the history
Signed-off-by: Joe Runde <[email protected]>
Signed-off-by: Alvant <[email protected]>
  • Loading branch information
joerunde authored and Alvant committed Oct 26, 2024
1 parent d5f7c8b commit 128425b
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion vllm/model_executor/models/granite.py
Original file line number Diff line number Diff line change
Expand Up @@ -428,7 +428,8 @@ def compute_logits(
sampling_metadata: SamplingMetadata) -> Optional[torch.Tensor]:
logits = self.logits_processor(self.lm_head, hidden_states,
sampling_metadata)
logits /= self.config.logits_scaling
if logits is not None:
logits /= self.config.logits_scaling
return logits

def sample(
Expand Down

0 comments on commit 128425b

Please sign in to comment.