EXAMPLES/DEVICE/EP/CSRC: Removed extra params from ht functions.#1812
EXAMPLES/DEVICE/EP/CSRC: Removed extra params from ht functions.#1812rakhmets wants to merge 1 commit into
Conversation
|
👋 Hi rakhmets! Thank you for contributing to ai-dynamo/nixl. Your PR reviewers will review your contribution then trigger the CI to test your changes. 🚀 |
📝 WalkthroughWalkthroughThis change removes low-latency-mode parameters from HT dispatch and combine kernel declarations, launch paths, and call sites, and adds a host-side BF16 validation before ChangesHT dispatch and combine cleanup
Sequence Diagram(s)sequenceDiagram
participant Buffer
participant notify_dispatch
participant dispatch
Buffer->>notify_dispatch: launch without low_latency_mode
Buffer->>dispatch: launch without low_latency_mode
sequenceDiagram
participant Buffer
participant cached_notify
participant combine
Buffer->>cached_notify: enqueue barrier without low_latency_mode
Buffer->>combine: call with BF16-validated input and no type argument
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~25 minutes Poem
🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
ColinNV
left a comment
There was a problem hiding this comment.
Also hard-wiring scalar type?
The merge-base changed after approval.
Signed-off-by: Raul Akhmetshin <rakhmetshin@nvidia.com>
02df4a8 to
b5ee466
Compare
|
/build |
What?
Removed unused
low_latency_modeparameter from the functions fromhtnamespace.Removed
typeparameter fromht::combine, and moved the assert fromht::combinetoBuffer::ht_combine.Why?
Reduced hanges in #1793 to facilitate review.
Summary by CodeRabbit
Summary
Refactor
Bug Fixes