-
Notifications
You must be signed in to change notification settings - Fork 31.4k
Add LongCat-Flash #40730
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Add LongCat-Flash #40730
Changes from 1 commit
Commits
Show all changes
62 commits
Select commit
Hold shift + click to select a range
21ac639
working draft for LongCat
molbap c939eb2
BC changes to deepseek_v3 for modular
molbap 2535c28
format
molbap bac973f
Merge branch 'main' into new_moe
molbap cddaba5
various modularities
molbap 67943a4
better tp plan
molbap d765b18
better init
molbap eebb41c
minor changes
molbap 414ba61
make modular better
molbap 7586dd7
clean up patterns
molbap b4584ad
Revert a couple of modular commits, because we won't convert in the end
molbap 76e4555
make things explicit.
molbap c7c5a3d
draft test
molbap 6e58487
toctree, tests and imports
molbap 8bb172d
drop
molbap 726828d
woops
molbap df11c0e
make better things
molbap fa3aacf
update test
molbap 07af563
update
molbap 927a55e
fixes
molbap 36c3dbb
style and CI
molbap d85c3e3
convert stuff
molbap 8cb4dc2
up
molbap 1343b65
ah, yes, that
molbap 275374a
enable gen tests
molbap f9d35c5
fix cache shape in test (sum of 2 things)
molbap 74d2728
fix tests
molbap 1c9b49f
comments
molbap 967259a
re-Identitise
molbap da61426
minimize changes
molbap 9ff6f95
better defaults
molbap d75311c
modular betterment
molbap 87b5687
fix configuration, add documentation
molbap e39779d
fix init
molbap c85a7ea
add integration tests
molbap 3846289
add info
molbap 1ec96f4
simplify
molbap 6778512
update slow tests
molbap 88e3114
fix
molbap 563f9e0
conflicted
molbap 67fd0d1
style
molbap ae5fcbc
Merge branch 'main' into new_moe
molbap c85afdd
Merge branch 'new_moe' of github.com:huggingface/transformers into ne…
molbap f208aa4
some additional long tests
molbap a3be847
cpu-only long test
molbap cf09a0b
Merge branch 'main' into new_moe
molbap c0f965f
fix last tests?
molbap 2a76079
Merge branch 'new_moe' of github.com:huggingface/transformers into ne…
molbap 7dafc04
urg
molbap 7910e57
cleaner tests why not
molbap 0666611
fix
molbap fd6df4f
Merge branch 'main' into new_moe
molbap a9b040e
improve slow tests, no skip
molbap b95af0a
style
molbap f0dfec7
don't upcast
molbap 8463c5b
Merge branch 'main' into new_moe
molbap 8cd2bb4
one skip
molbap 68943ca
Merge branch 'new_moe' of github.com:huggingface/transformers into ne…
molbap f0eb7af
Merge branch 'main' into new_moe
molbap c85b064
finally fix parallelism
molbap f385373
Merge branch 'new_moe' of github.com:huggingface/transformers into ne…
molbap 66b414a
Merge branch 'main' into new_moe
molbap File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
don't upcast
- Loading branch information
commit f0dfec7e8aea30e656add5c93424cb05c67b8e75
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.