Revision as of 06:37, 27 May 2023

RLHF is also known as Reinforced Learning with Human Feedback. One way to think of RLHF at scale is to allow all human annotation and data editing history as a part of RLHF. This is particularly possible with a web-based interface that captures human inputs on portable networked devices.

References

Related Pages

@@ Line 1: / Line 1: @@
-[[RLHF]] is also known as [[Reinforced Learning with Human Feedback]].
+[[RLHF]] is also known as [[Reinforced Learning with Human Feedback]]. One way to think of [[RLHF]] at scale is to allow all human annotation and data editing history as a part of [[RLHF]]. This is particularly possible with a web-based interface that captures human inputs on portable networked devices.
 <noinclude>
 {{PagePostfix

Difference between revisions of "Reinforced Learning with Human Feedback"

Revision as of 06:37, 27 May 2023

References

Related Pages

Navigation menu

Difference between revisions of "Reinforced Learning with Human Feedback"

Revision as of 06:37, 27 May 2023

References

Related Pages

Navigation menu

Search