Created in October 21, 2023
2023
We released Safe RLHF: Safe Reinforcement Learning from Human Feedback.
AK's Daily Papers