Difference between revisions of "Supervised finetuning"
Jump to navigation
Jump to search
Line 1: | Line 1: | ||
{{WikiEntry|key=Supervised learning|qCode=118129371}} is a type of ML technique for model improvement. The term [[supervised finetuning]] is also abbreviated as [[SFT]]. | {{WikiEntry|key=Supervised learning|qCode=118129371}} is a type of ML technique for model improvement. The term [[supervised finetuning]] is also abbreviated as [[SFT]]. This is an approach that is "easier" than [[RLHF]] according to [[Andrej Karpathy]]<ref>{{:Video/State of GPT - BRK216HFS}}</ref>. | ||
<noinclude> | <noinclude> | ||
{{PagePostfix | {{PagePostfix |
Latest revision as of 06:55, 27 May 2023
Supervised learning(Q118129371) is a type of ML technique for model improvement. The term supervised finetuning is also abbreviated as SFT. This is an approach that is "easier" than RLHF according to Andrej Karpathy[1].
References
- ↑ Karpathy, Andrej (May 25, 2023). Microsoft Developer, ed. State of GPT - BRK216HFS. local page: Microsoft Developer.
Related Pages