Difference between revisions of "Reinforced Learning with Human Feedback"
Jump to navigation
Jump to search
(Created page with "RLHF is also known as Reinforced Learning with Human Feedback.") |
|||
Line 1: | Line 1: | ||
[[RLHF]] is also known as [[Reinforced Learning with Human Feedback]]. | [[RLHF]] is also known as [[Reinforced Learning with Human Feedback]]. | ||
<noinclude> | |||
{{PagePostfix | |||
|category_csd=SFT,RLHF,AI,GPT,Machine Learning | |||
}} | |||
</noinclude> |