[ad_1]
We skilled “critique-writing” fashions to explain flaws in summaries. Human evaluators discover flaws in summaries way more typically when proven our mannequin’s critiques. Larger fashions are higher at self-critiquing, with scale enhancing critique-writing greater than summary-writing. This reveals promise for utilizing AI techniques to help human supervision of AI techniques on tough duties.
