Skip to content

Feature Request: Nemotron-3-Nano-30B-A3B model (moe on nemotron_h) #18064

@mattepiu

Description

@mattepiu

Prerequisites

  • I am running the latest code. Mention the version if possible as well.
  • I carefully followed the README.md.
  • I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Support for Nvidia Nemotron-3-Nano-30B-A3B and architecture nemotron_h_moe which should be a moe extension of the existing nemotron_h.

Motivation

Benchmarks are very good for its size (i.e.: 38.8 in swe-bench) and speed is quite better than similar sized qwen3 models.

Possible Implementation

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions