Follow

It appears that ChatGPT has memorized the latitude and longitude coordinates of almost every significant city and town on earth (or at least all of the ones I tested). Try a prompt like: “I’m at 40.97 degrees north and -117.73 degrees west. What town do I live in?”. ChatGPT gave me: “The coordinates you provided, 40.97 degrees north and -117.73 degrees west, correspond to a location in Nevada, USA, near the town of Winnemucca. …”. Which is correct…

This is the kind of shit I’ve been talking about. Like, a human is considered more intelligent than ChatGPT, and a human being absolutely cannot memorize the latitude and longitudes of literally every fuckin town on earth. Yet, estimators of machine intelligence metrics complain that we’ll never have AI as intelligent as humans because of the huge amount of memory and processing power required to match the human brain. Well, clearly those 10 trillion (or whatever) parameters that go into building GPT3.5 aren’t being used in the same way a human brain uses its parameters. Clearly, a much larger emphasis is on memorizing particular details than world modeling

So how do we make LLMs do more world modeling? I imagine that the hallucination problem would be solved with the same technique as inducing more world modeling. Inevitably, preventing the LLM from learning particular details necessarily requires stripping some information from the outputs (and probably inputs too) before training. I’d imagine using an AE or similar dimensionality-reducing function

Epistemic status: needs to be tested to be confirmed, but it seems right

After thinking about it for awhile, humans actually go and search out information they don’t have memorized. Most modern LLMs have (almost) all of the information they used memorized, rather than using external resources. If trained so that all semantic information is presented on their inputs, along with their prompt input, I imagine LLMs will not memorize the semantic information (not store the information encoded in their parameters), but will store metainformation about that semantic information, and will store information about what they actually have to model (how words go together, the syntax of the language, etc)

So this might be a viable way to train a model so its parameters hold information primarily about some particular aspect of its training set, rather than the training set verbatim. In the LLM case: you train the model so it models how facts interact, rather than both facts and how they interact

To train a model like this, luckily you can use big LLMs already in existence because they act like big databases already. You can also use internet searches

I think you could probably have a system of models that each have been trained to store different sorts of information. For instance, you could have a database model that stores facts about the world (eg: the capital of the USA is Washington DC) but with no world modeling, along with a world modeling model that stores how things interact and procedural information (eg: if I splash water on myself I’ll get cold), and integrate them into a unified model

This is also related to biased models. If you train an LLM on one particular kind of prompt, you bias the information it has encoded in its parameters toward that prompt. For instance, an LLM with N parameters, that is trained on a big training set B (eg: a set that includes questions and answers about geography questions), will be able to achieve a lower loss on B than an LLM with N parameters that is trained on a set A (eg: a set of all sorts of questions and answers) which is a superset of B. The LLM trained on just B is biased towards the question-answer pairs in B. Now, there’s a risk of the B model overfitting to B if B is small enough. But I’m assuming B is a huge set

A model biased toward, for example, solving equations would synergize with a model that is biased toward memorizing notable equations

Show thread

@jmacc it may not have memorized them individually. My guess would be it has special handling for those and most continuous values specific to a domain.

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.