Skip to content

Split grouped quantize/activations and dbias for faster compilation on multicore machines#2983

Draft
ptrendx wants to merge 3 commits into
NVIDIA:mainfrom
ptrendx:pr_split_compilation
Draft

Split grouped quantize/activations and dbias for faster compilation on multicore machines#2983
ptrendx wants to merge 3 commits into
NVIDIA:mainfrom
ptrendx:pr_split_compilation

Conversation

@ptrendx
Copy link
Copy Markdown
Member

@ptrendx ptrendx commented May 12, 2026

Description

The actual right fix would be to nvRTC them, but this will at least make it slightly more manageable.

Type of change

  • Documentation change (change only to the documentation, either a fix or a new content)
  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • Infra/Build change
  • Code refactoring

ptrendx added 2 commits May 12, 2026 15:55
Signed-off-by: Przemek Tredak <ptredak@nvidia.com>
Signed-off-by: Przemek Tredak <ptredak@nvidia.com>
@ptrendx ptrendx force-pushed the pr_split_compilation branch from 44a7d09 to 30880ef Compare May 12, 2026 22:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant