It is my understanding that this is how “alignment” works. That is, openAI paid ...

visarga · on May 25, 2023

There are three ways

1. make your own RLHF dataset - like OpenAI and Open Assistant

2. exfiltrate data from a bigger/better LLM - Vicuna & family

3. use your pre-trained LLM to generate RLAIF data, no leeching - ConstitutionalAI, based on a set of rules instead of labelling examples

cubefox · on May 25, 2023

I wonder whether these approaches fit into the above categories: