rl llm
Rejection Sampling in β¦
Understanding Rejection Sampling: From Basics to DeepSeek-R1 When dealing with complex probability distributions, directly sampling from them can be challenging. Rejection sampling is a clever statistical technique that allows us to overcome this hurdle. This method is not only foundational in β¦