RLHF: How To Train Reinforcement Learning Agents Using Human Evaluation RLHF: How To Train Reinforcement Learning Agents Using Human Evaluation 01/09/2023 Alignment
Accurate Modeling Of Human-like And Interestingness Of Generated Text: MAUVE Accurate Modeling Of Human-like And Interestingness Of Generated Text: MAUVE 25/02/2022 Natural Language Processing