[Quantization] [Compressed Tensors] Support Transforms, Fix Tests #42887
+105
−69
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Purpose
Background
Transforms are extra weights added to a model which improve accuracy recovery from quantization. These extra weights are required to be shared in order to reduce memory requirements of the model.
Changes
0.11.0(to support transform features)_update_transforms_tied_weights, which leverages @Cyrilvallez 's refactoredtie_weightsfunctionality!_update_transforms_tied_weightsspecifies which transform weights are tied, andPreTrainedModel.tie_weightsingests the tied weights map and searches for the loaded weight to tied with shared weightsExample
_tied_weights_keys:Testing
CompressedTensorsTest, added an online quip-style transformed model for testingSuggested Reviewers
@SunMarc @Cyrilvallez @Rocketknight1