Created in May 16, 2023
2023
We released Safe-RLHF: Constrained Value Alignment for LLMs.
机器之心报道:国内首个可复现的RLHF基准,北大团队开源 PKU-Beaver