about
publications

Safe Rlhf

Created in May 16, 2023

2023

We released Safe-RLHF: Constrained Value Alignment for LLMs.

机器之心报道：国内首个可复现的RLHF基准，北大团队开源 PKU-Beaver

© Copyright 2024 Jiaming Ji. Last updated: June 09, 2024.