Shepherd: A Critic for Language Model Generation (from Meta) (performance) - AI -Artificial Intelligence - City-Data Forum

LechM · 08-10-2023, 07:36 PM

https://arxiv.org/pdf/2308.04592.pdf
https://github.com/facebookresearch/Shepherd

Simple methods can enhance the performance of LLMs (such as GPT-4) using iterative refinement. One approach involves having an LLM critique generated text and then self-improve based on that feedback. In this study, the authors developed a small (7B, thus fast and cheap to run) model which surpasses other competing models, including ChatGPT.