[kernels] refactor function kernel calling #41577

MekkCyber · 2025-10-14T12:58:37Z

What does this PR do?

This should simplify lazy kernel loading in Transformers.
We simply define a mapping between each kernel name and the repository it should be pulled from, then load it using the lazy_load_kernel function. This function adds the kernel to a global cache shared across all models.
If the kernel isn’t available, we check whether it’s installed as a module for backward compatibility; otherwise, we return None.

MekkCyber · 2025-10-14T12:59:22Z

src/transformers/integrations/hub_kernels.py

+def lazy_load_kernel(kernel_name: str, mapping: dict[str, Optional[ModuleType]]):
+    if kernel_name in mapping and isinstance(mapping[kernel_name], ModuleType):


the main utility function applied to the case of causal-conv1d

MekkCyber · 2025-10-14T13:02:11Z

run-slow: falcon_mamba, mamba

github-actions · 2025-10-14T13:03:46Z

This comment contains run-slow, running the specified jobs:

models: ['models/falcon_mamba', 'models/mamba']
quantizations: [] ...

HuggingFaceDocBuilderDev · 2025-10-14T13:10:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

ncie

ArthurZucker · 2025-10-16T08:51:51Z

src/transformers/integrations/hub_kernels.py

        raise RuntimeError("register_kernel_mapping requires `kernels` to be installed. Run `pip install kernels`.")


+_KERNEL_SIMPLE_MAPPING: dict[str, str] = {


why "simple" ?

ArthurZucker · 2025-10-16T09:00:44Z

src/transformers/models/falcon_mamba/modular_falcon_mamba.py

+        causal_conv1d = lazy_load_kernel("causal-conv1d", _KERNEL_MAPPING_ACROSS_MODELS)
+        causal_conv1d_update, causal_conv1d_fn = (
+            (causal_conv1d.causal_conv1d_update, causal_conv1d.causal_conv1d_fn)
+            if causal_conv1d is not None
+            else (None, None)
+        )


we should not need to pass the kernel mapping, lazy load kernel can handle it!

github-actions · 2025-10-16T12:43:19Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: falcon_mamba, mamba

ArthurZucker

Thanks 🤗

ArthurZucker · 2025-10-16T13:11:25Z

src/transformers/integrations/hub_kernels.py

    ALL_MASK_ATTENTION_FUNCTIONS.register(attn_implementation, ALL_MASK_ATTENTION_FUNCTIONS["flash_attention_2"])


+def lazy_load_kernel(kernel_name: str, mapping: dict[str, Optional[ModuleType]] = _KERNEL_MODULE_MAPPING):


should be None, if none default to kernel mapping python will cry otherwise!

ArthurZucker · 2025-10-16T13:11:35Z

src/transformers/models/falcon_mamba/modeling_falcon_mamba.py

+        causal_conv1d = lazy_load_kernel("causal-conv1d")
+        causal_conv1d_update, causal_conv1d_fn = (
+            (causal_conv1d.causal_conv1d_update, causal_conv1d.causal_conv1d_fn)
+            if causal_conv1d is not None
+            else (None, None)
+        )


nice and simple

* refactor function kernel callling * nit * don't pass the mapping * use _kernels_available * rm import

refactor function kernel callling

e235075

MekkCyber requested a review from ArthurZucker October 14, 2025 12:58

MekkCyber commented Oct 14, 2025

View reviewed changes

nit

b2f4025

MekkCyber mentioned this pull request Oct 16, 2025

Added kernels from kernel hub for Bamba model #41540

Merged

5 tasks

ArthurZucker reviewed Oct 16, 2025

View reviewed changes

don't pass the mapping

168b6dd

MekkCyber added 2 commits October 16, 2025 12:47

use _kernels_available

8467792

rm import

3740404

ArthurZucker approved these changes Oct 16, 2025

View reviewed changes

MekkCyber merged commit 1fb3fc4 into main Oct 16, 2025
23 checks passed

MekkCyber deleted the add_function_mapping_kernels branch October 16, 2025 13:43

kaixuanliu mentioned this pull request Oct 17, 2025

add rotary kernel support to Qwen3 model #41147

Merged

ngazagna-qc pushed a commit to ngazagna-qc/transformers that referenced this pull request Oct 23, 2025

[kernels] refactor function kernel calling (huggingface#41577)

964e184

* refactor function kernel callling * nit * don't pass the mapping * use _kernels_available * rm import

		def lazy_load_kernel(kernel_name: str, mapping: dict[str, Optional[ModuleType]]):
		if kernel_name in mapping and isinstance(mapping[kernel_name], ModuleType):

		raise RuntimeError("register_kernel_mapping requires `kernels` to be installed. Run `pip install kernels`.")


		_KERNEL_SIMPLE_MAPPING: dict[str, str] = {

		ALL_MASK_ATTENTION_FUNCTIONS.register(attn_implementation, ALL_MASK_ATTENTION_FUNCTIONS["flash_attention_2"])


		def lazy_load_kernel(kernel_name: str, mapping: dict[str, Optional[ModuleType]] = _KERNEL_MODULE_MAPPING):

[kernels] refactor function kernel calling #41577

[kernels] refactor function kernel calling #41577

Uh oh!

Conversation

MekkCyber commented Oct 14, 2025

What does this PR do?

Uh oh!

MekkCyber Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber commented Oct 14, 2025

Uh oh!

github-actions bot commented Oct 14, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 14, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 16, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants