process-reward-model

Here are 5 public repositories matching this topic...

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

A comprehensive collection of process reward models.

r1 o1 large-language-model process-reward-model

Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.

Library for training process reward models

prm process-reward-model

Add a description, image, and links to the process-reward-model topic page so that developers can more easily learn about it.

To associate your repository with the process-reward-model topic, visit your repo's landing page and select "manage topics."