Difference between revisions of "Supervised finetuning"

From PKC
Jump to navigation Jump to search
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
{{WikiEntry|key=Supervised learning|qCode=118129371}} is a type of ML technique for model improvement. The term [[supervised finetuning]] is also abbreviated as [[SFT]].
{{WikiEntry|key=Supervised learning|qCode=118129371}} is a type of ML technique for model improvement. The term [[supervised finetuning]] is also abbreviated as [[SFT]]. This is an approach that is "easier" than [[RLHF]] according to [[Andrej Karpathy]]<ref>{{:Video/State of GPT - BRK216HFS}}</ref>.
<noinclude>
<noinclude>
{{PagePostfix
{{PagePostfix
|category_csd=SFT,RLHF,AI,GPT,Machine Learning
|category_csd=SFT,RLHF,AI,GPT,Machine Learning,Social Meaning of Data
}}
}}
</noinclude>
</noinclude>

Latest revision as of 06:55, 27 May 2023

Supervised learning(Q118129371) is a type of ML technique for model improvement. The term supervised finetuning is also abbreviated as SFT. This is an approach that is "easier" than RLHF according to Andrej Karpathy[1].

References

  1. Karpathy, Andrej (May 25, 2023). Microsoft Developer, ed. State of GPT - BRK216HFS. local page: Microsoft Developer. 

Related Pages