**Nevkontakte** @me@m.nevkontakte.com · May 22, 2025, 13:28

**Nevkontakte** @me@m.nevkontakte.com · May 22, 2025, 13:28

Nevkontakte @me@m.nevkontakte.com

May 22, 2025, 13:28

One thing to remember about #ml (and, by extension, #ai) is that it is, at the end of the day, a technique for complex function approximation. No more, no less. Think back to Stone–Weierstrass theorem from the mathematical analysis course, just on a different scale.

It is hard to imagine writing down an analytical definition for the "human speech" function, but, amazingly, we can computationally arrive at something that is behaving very similarly, and we call our latest take at it "Large Language Models". The impressive thing about this is how unimpressive it really is for what it does.

When looking through that lens, it feels kind of silly to ascribe real intelligence to such models, since it's merely an imitation of the original phenomenon. But it does provoke some reflection on what the existence of such approximation tells us about the original.

I think it also indicates the limitations of the current generation of AI techniques: they can achieve great (perhaps arbitrarily great) accuracy when interpolating, that is, when we are working within the information space well-represented in the training dataset.

However, it's much harder to make assertions about extrapolation accuracy the ideas and knowledge not seen by the model before, never mind the ideas completely novel to the humanity entirely. To me this is a hint as to why AI is actually pretty bad at creativity. It's not so much because it's bad at creativity, it's because its extrapolation is rather unlikely to match what humans consider creative.

Does this make #AI useless for any art, or novel research, or other forms of innovation? Not at all, I don't think. For one, all innovation consists of 1% of actually new ideas and 99% of hard and boring implementation/testing/experimental work, and any help with those 99% could still be a massive help. And even within 1%, random flailing of AI models can inspire humans into actually useful ideas :)

All of that it say, AI is just a better brush and it's silly to pretend it doesn't exist.

**l'empathie mécanique** @dpwiz@qoto.org · May 22, 2025, 14:21

**l'empathie mécanique** @dpwiz@qoto.org · May 22, 2025, 14:21

May 22, 2025, 14:21

l'empathie mécanique @dpwiz@qoto.org

@me I don't buy this.

SWT appears to only claim that an LLM *can* do interpolation. But even if I'm wrong here and interpolation is the only thing LLM does this doesn't matter as they are capable of systematically using learned patterns to perform in-context learning and then to produce solutions for unseen tasks. And this is a hallmark of intelligence.
Yes, novelty is hard. No, LLMs aren't just replicating old distributions.

**Nevkontakte** @me@m.nevkontakte.com · May 22, 2025, 18:06

**Nevkontakte** @me@m.nevkontakte.com · May 22, 2025, 18:06

May 22, 2025, 18:06

Nevkontakte @me@m.nevkontakte.com

@dpwiz@qoto.org nothing you've said seems to contradict to what I've said, no? :)

The really interesting question (and the one I am not smart enough to formally answer) is in what space does it do its interpolation. My layman understanding that all the recent advancements are thanks to the fact that the new architectures are able to coax the math to learn in a higher-level space than just the examples seen. So yeah, it does apply learned patterns to examples that fit them.

Problems begin when there is no known pattern that fits the task, which is exactly what innovation and creativity usually deal with :)

**l'empathie mécanique** @dpwiz@qoto.org · May 22, 2025, 18:48

**l'empathie mécanique** @dpwiz@qoto.org · May 22, 2025, 18:48

May 22, 2025, 18:48

l'empathie mécanique @dpwiz@qoto.org

@me There is one, thanks for focusing on it in the reply ((=

My claim is that the model training induces meta-learning...

> That was the goal all along - even before LLMs were a thing. OpenAI and DeepMind were on the hunt for making a thing that can learn on the go and adapt. And looks like we've got this by now.

... and that makes the exact content of its pre-training corpus irrelevant. As long as it can pick up knowledge and skills on the go it is intelligent. And the notion of "interpolation" (even in an insanely high-dimensional space) is irrelevant.

Can we please collectively shut up about stochastic parrots, just regurgitating the data, following the training distribution, interpolation, etc etc?

**Nevkontakte** @me@m.nevkontakte.com · May 22, 2025, 20:23

**Nevkontakte** @me@m.nevkontakte.com · May 22, 2025, 20:23

May 22, 2025, 20:23

Nevkontakte @me@m.nevkontakte.com

@dpwiz@qoto.org I think we are talking past each other a bit.

Any machine learning model is, by construction, an approximation of some other function. This isn't a moral judgement, condemnation or dismissal of what it can achieve. In fact, it's pretty darn amazing what it can achieve without us, humans, not even being able to properly define the function it is learning (what is "intelligence"?).

I even agree that, from what I personally observe, it does seem to construct some sort of knowledge about the world from the data it gets to train on. That's kind of cool on its own, but it also tells us something what "knowledge" actually is, in a way we can dissect and study. Before LLMs, it was entirely not obvious that "knowledge" can really have mathematical representation that is compact enough for us to play with.

So when I talk about interpolation, I am talking about LLM's ability to apply that knowledge to a variety of tasks, as long as those tasks are within the scope of what we, humans, normally do. Which, again, is not a dismissal. Sadly, very few people in the world get to do new things, and even those who do only spend a small fraction of time doing it. Most of the rest of their time is dedicated to the boring chores that are necessary to do the fun bits.

Where I don't expect AI to succeed, at least not in its current form, is creating new knowledge (which is different from extracting existing knowledge). LLM is not going to deliver us cold fusion. LLM is not going to terraform Mars for us. LLM won't merge relativity and quantum physics into a single, unified theory of everything. LLM is not going to solve world hunger. It won't even find a way to make Republicans best buddies with Democrats. Simply because there is no pattern to apply here, it would be "the first ever" kind of thing. This is what I, personally, define as creativity and innovation. This is the area where extrapolation is required.

It's not a given that humans will succeed in any of these particular tasks, which, again, is essential to innovation. But even if LLMs can't do any of those things, they could help us get there faster by dealing with more of the chores. So I'll take it any day.

And if one day we will be able to train a model that somehow infers to principles of the universe on such a deep level so that we can query it and get answers to all of the questions... That would be cool, although I feel like we'd need a compute the size of the universe to compute it :)

**l'empathie mécanique** @dpwiz@qoto.org · 2025-05-22T20:58:36Z

l'empathie mécanique @dpwiz@qoto.org

@me > what is "intelligence"?

Intelligence is the ability to 1) learn new skills and 2) pick a fitting skill from your repertoire to solve a task.

Rocks don't have this. Thermostats don't have this. Cats have a little. Humans have this. AIs starting to have it. ASIs would have it in spades.

May 22, 2025, 20:58 · · · ·

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…