RT @_akhaliq: Language models can explain neurons in language models
use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. Release a dataset of these (imperfect) explanations and scores for every neuron in GPT-2… https://t.co/bQ4lE0V5jp https://t.co/5dEvRlDsr5