-
Notifications
You must be signed in to change notification settings - Fork 701
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: support num_experts_per_tok=10 in turbomind backend
#4665
opened Jun 9, 2026 by
irexyc
Collaborator
Loading…
refactor: simplify multimodal preprocessing expansion
#4663
opened Jun 9, 2026 by
CUHKSZzxy
Collaborator
Loading…
Fix client-disconnect session leaks in PyTorch MP engine
Bug:P0
#4655
opened Jun 6, 2026 by
grimoire
Collaborator
Loading…
1 task done
refactor(proxy): split monolithic proxy into modular serve/proxy package
#4647
opened Jun 4, 2026 by
lvhan028
Collaborator
Loading…
Improve engine health monitoring and wakeup scheduling
Bug:P0
#4645
opened Jun 4, 2026 by
lvhan028
Collaborator
Loading…
support disaggregated weight update
planned feature
#4638
opened May 29, 2026 by
irexyc
Collaborator
Loading…
modify save model in lite module
improvement
#4624
opened May 26, 2026 by
43758726
Contributor
Loading…
feat(turbomind): support priority schedule policy
#4614
opened May 22, 2026 by
4mengy
Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605
opened May 21, 2026 by
windreamer
Collaborator
Loading…
1 of 4 tasks
[WIP]: Support reuse routed experts on eviction
#4599
opened May 19, 2026 by
RunningLeon
Collaborator
Loading…
docs(advance): add Add a New Speculative Decoding Method guide
documentation
Improvements or additions to documentation
#4589
opened May 17, 2026 by
SuperMarioYL
Loading…
4 tasks done
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-06.