-
Notifications
You must be signed in to change notification settings - Fork 227
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs: Add notes for FP8 recipe in docs/fp8.md
CI:docs
Run doctest
documentation
Improvements or additions to documentation
#1829
opened Jan 26, 2026 by
guyueh1
Loading…
4 tasks
feat: add lora config for dpo dtensor backend
CI:L1
Run doctests, unit tests, and functional tests
#1826
opened Jan 26, 2026 by
RayenTian
Loading…
4 tasks
feat: add FT launcher config and resiliency dependency [1/4]
CI:docs
Run doctest
documentation
Improvements or additions to documentation
#1824
opened Jan 23, 2026 by
yashaswikarnati
Loading…
4 tasks
ci: Allow repo to self publish docs
CI
Relating to CI
#1821
opened Jan 23, 2026 by
chtruong814
Loading…
4 tasks
perf: Update cudnn to 9.14
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1820
opened Jan 23, 2026 by
guyueh1
Loading…
4 tasks
fix: fix statistic of probs_ratio_clamped_min/max
CI:L1
Run doctests, unit tests, and functional tests
#1818
opened Jan 23, 2026 by
yuki-97
Loading…
fix: Unify custom model logits extraction across all inference methods
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1815
opened Jan 23, 2026 by
zpqiu
Loading…
4 tasks
chore: cuda13 support
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1803
opened Jan 21, 2026 by
guyueh1
Loading…
4 tasks
feat: Timer for the data sharding and job submission
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1802
opened Jan 21, 2026 by
guyueh1
Loading…
4 tasks
feat: Support lora in dtensor grpo workflow by merging weight
CI:L1
Run doctests, unit tests, and functional tests
feat: add speculative decoding during post-training
#1785
opened Jan 15, 2026 by
isomap
Loading…
2 of 4 tasks
feat: NeMo Gym GRPO on-policy fix params; Per-agent group-level rewards
CI:L1
Run doctests, unit tests, and functional tests
#1779
opened Jan 15, 2026 by
bxyu-nvidia
Loading…
4 tasks
[don't merge] split train and val dataset in preference dataset
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
[docs] Document Gym + RL integration design
documentation
Improvements or additions to documentation
feat: refactor train utilities for dtensor policy v2
#1757
opened Jan 10, 2026 by
hemildesai
•
Draft
4 tasks
feat: Support lora in dtensor grpo workflow[3/3]: async vllm
CI:L1
Run doctests, unit tests, and functional tests
#1752
opened Jan 9, 2026 by
RayenTian
Loading…
7 tasks
feat: Support lora in dtensor grpo workflow[2/3]: sync and non-colocated setup
CI:L1
Run doctests, unit tests, and functional tests
#1751
opened Jan 9, 2026 by
RayenTian
Loading…
4 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2026-01-24.