Rex Kerr
Feb 10, 2024

--

Based on the statistical ensemble of observed uses, which is about as well as can be done (given adequate representational power, which transformer architectures have had since GPT-3) for intersubjective matters like language.

Y'want to come up with a set of test questions and take some LLMs for a spin to assess the typical quality? This isn't unknowable. We can get a (statistical) answer with a bit of effort.

--

--

Rex Kerr
Rex Kerr

Written by Rex Kerr

One who rejoices when everything is made as simple as possible, but no simpler. Sayer of things that may be wrong, but not so bad that they're not even wrong.

Responses (1)