Reinforced Learning with Human Feedback
Jump to navigation
Jump to search
RLHF is also known as Reinforced Learning with Human Feedback. One way to think of RLHF at scale is to allow all human annotation and data editing history as a part of RLHF. This is particularly possible with a web-based interface that captures human inputs on portable networked devices.
References
Related Pages