refactor: remove unused other_mimi MimiModel instance (~200MB memory savings) by mvanhorn · Pull Request #72 · NVIDIA/personaplex

mvanhorn · 2026-04-06T13:31:15Z

Summary

Removes the second MimiModel instance (other_mimi) from server.py and offline.py. Every call to other_mimi.encode() and other_mimi.decode() discards the result by assigning to _. The primary mimi instance produces all actual output. Removing other_mimi frees ~200MB GPU memory.

Why this matters

#46 identified that other_mimi loads a full copy of the Mimi codec weights but never uses any output. On memory-constrained deployments (consumer GPUs, Jetson), 200MB is significant.

Changes

moshi/moshi/server.py:

Removed other_mimi field from ServerState dataclass
Removed other_mimi parameter from __init__, warmup, and handle_chat
Removed other_mimi = loaders.get_mimi(...) instantiation in main()
Removed all _ = self.other_mimi.encode(chunk) and _ = self.other_mimi.decode(tokens[:, 1:9]) calls

moshi/moshi/offline.py:

Removed other_mimi parameter from warmup() and decode_tokens_to_pcm()
Removed other_mimi = loaders.get_mimi(...) instantiation in run_inference()
Removed all _ = other_mimi.encode(chunk) and _ = other_mimi.decode(tokens[:, 1:9]) calls

Net: -22 lines, +6 lines (signature adjustments).

Testing

Verified via grep -rn other_mimi --include="*.py" that zero references remain. The primary mimi encode/decode path is untouched.

No test suite exists in this repo.

Note for maintainer

If other_mimi was intended for a future streaming state feature or a separate voice prompt encoding path, please let me know and I'll revert. Based on the current code, all its outputs are discarded.

Fixes #46

This contribution was developed with AI assistance (Claude Code).

The second MimiModel instance (other_mimi) in server.py and offline.py processes the same audio as the primary mimi, but every encode/decode result is discarded (assigned to _). Removing it saves ~200MB GPU memory. Addresses NVIDIA#46

mvanhorn mentioned this pull request Apr 6, 2026

Clarification needed: Purpose of duplicate MimiModel instance (other_mimi) in server.py #46

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: remove unused other_mimi MimiModel instance (~200MB memory savings)#72

refactor: remove unused other_mimi MimiModel instance (~200MB memory savings)#72
mvanhorn wants to merge 1 commit intoNVIDIA:mainfrom
mvanhorn:refactor/46-remove-unused-other-mimi

mvanhorn commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mvanhorn commented Apr 6, 2026

Summary

Why this matters

Changes

Testing

Note for maintainer

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant