Martin Ruskov: "RT @IslabUnimi@twitter.com 'In what exact way is…"

#LinguisticCompetence #languageModels #generativeAI

I just commented the following on your blog where you discuss levels of linguisitic competence:

This is quite strange, since it is so evidently not true. All of these behaviours are well in the capabilities of ChatGPT. I give you two examples that are claimed as failures.

-----------
Prompt:
The mother of Michael has four children. Three of them are Marcy, Millie, and Moses. Please tell me the name of the fourth child.

ChatGPT:
The name of the fourth child is likely "Michael," as the statement at the beginning of the question says "the mother of Michael has four children," and then goes on to name three of them.

Prompt:
Please complete the following sentence reasonably: "If tigers would like cereals, they would be cheaper to keep. Families could feed their tigers with "

ChatGPT:
If tigers would like cereals, they would be cheaper to keep. Families could feed their tigers with a variety of inexpensive grain-based products such as corn, wheat, and rice, rather than having to provide costly meat-based diets. However, it is important to note that tigers are obligate carnivores and have specific nutritional needs that cannot be met by a purely plant-based diet, so feeding them only cereals would not be a healthy or appropriate long-term solution.
------------

So I am genuinely curious what drives such profoundly different experiences.

The proposed tests and competences are interesting; I would hope that they would be put to better use to map out the quite unexpected emergent abilities of the #LLM , to put it to good use, and to prepare ourselves (and our institutions) for #ChatGPT 's even more capable descendants.

I invite you to head over to https://sentientsyllabus.substack.com for some in depth analysis.

🙂

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 13:22

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 13:22

Mar 14, 2023, 13:22

@boris_steipe interesting that for the same questions, different people get conceptually different responses. Would you comment on this? I would, but wanted to hear your interpretation

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 16:24

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 16:24

Mar 14, 2023, 16:24

@boris_steipe actually I experimented with family relations and it handles them quite well. Notice that the example screenshot from my article by the logo appears to be from an official ChatGPT channel. I don't know if this was a teaser ti mislead people or the model was trained on this example in the meantime

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 16:28

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 16:28

Mar 14, 2023, 16:28

@boris_steipe however, I thought I'd come up with another test that hopefully is not in its training set already. Here it goes:

mapto:
Are there any animals that go around in schools, other than human children?

ChatGPT:
It is uncommon for animals to go around in schools, but there are a few exceptions:

Service animals: Some students with disabilities may use service animals to assist them with mobility, communication, or other tasks. These animals are trained to work in public settings and are allowed to accompany their handlers in school.

Therapy animals: Some schools may invite therapy animals, such as dogs or cats, to visit classrooms as part of an emotional support program. These animals can provide comfort and support for students and may be used to help reduce stress and anxiety.

Classroom pets: Some teachers may keep small animals, such as fish, hamsters, or reptiles, as classroom pets to provide students with opportunities to learn about animal care and behavior.

It's important to note that the presence of animals in schools is typically regulated by local and state laws, and schools may have policies and procedures in place to ensure the safety and well-being of both the animals and the students.

mapto:
Are there school populations that are not formed of human beings?

ChatGPT:
While schools are typically designed for human students, there are a few types of schools that cater to non-human populations. These include:

Service animal training schools: These schools train animals, such as guide dogs, to assist people with disabilities.

Veterinary schools: These schools train students to become veterinarians or veterinary technicians, and often have animal patients on campus.

Zoos and wildlife centers: Some zoos and wildlife centers offer educational programs for school groups, where students can learn about animals and their habitats.

Research facilities: Some research facilities may house animals for scientific study, and may offer educational programs for students interested in pursuing careers in animal research or medicine.

It's worth noting that while non-human populations may be present in some educational settings, these populations are not typically considered "students" and do not participate in the same types of learning experiences as human students.

mapto:
Are there animals that go around in schools without any human involvement in the process?

ChatGPT:
It is highly unlikely for animals to go around in schools without any human involvement in the process. Schools are designed for human students and typically have measures in place to prevent animals from entering and causing disruptions. Additionally, most animals require some level of human care and attention, such as food, water, and shelter, which is unlikely to be provided within a school setting without human involvement.

However, it's possible that some wild animals may enter school grounds if the school is located in an area with a high wildlife population. In these cases, the school may have procedures in place to safely and humanely remove the animals from the area. It's important to remember that wild animals can pose a danger to humans and should be treated with caution and respect.

mapto:
What about fish?

ChatGPT:
Fish are aquatic animals and are unlikely to be found inside schools unless they are kept as classroom pets or as part of a science project. In these cases, the fish would require human care and involvement in their upkeep and would not be able to move around the school without human assistance.

If fish are kept as classroom pets, it's important to ensure that their tank or aquarium is properly maintained and cleaned to provide a healthy environment for the fish. Students may be responsible for feeding and caring for the fish under the supervision of a teacher or staff member.

It's worth noting that some schools may also have outdoor ponds or aquatic habitats that contain fish, but these would typically be located in designated areas and would not be accessible to students without human involvement and supervision.

mapto:
How is a big group of fish called?

ChatGPT:
A big group of fish is typically referred to as a "school" or a "shoal" of fish. The term "school" refers to a large group of fish that swim together in a coordinated manner, while the term "shoal" refers to a group of fish that swim together but do not necessarily exhibit coordinated behavior. Both terms are commonly used interchangeably to refer to a large group of fish.

mapto:
So it turns out fish do go around in schools, doesn't it?

ChatGPT:
Yes, you are correct! Fish do go around in schools, which is a term used to describe a large group of fish swimming together in a coordinated manner. Fish often swim in schools for a variety of reasons, including to protect themselves from predators, to conserve energy, and to enhance their ability to find food. In some cases, schools of fish can contain thousands or even millions of individual fish.

**Boris Steipe** @boris_steipe@qoto.org · Mar 14, 2023, 16:45

**Boris Steipe** @boris_steipe@qoto.org · Mar 14, 2023, 16:45

Mar 14, 2023, 16:45

Yes, your example is a question about resolving homonyms – typically it will go with the most probable interpretation first. But you can ask it more directly:

--------------------
Human:
I am confused by a sentence: "They go around in schools." What could it mean?

#ChatGPT:
The phrase "They go around in schools" is commonly used to describe the behavior of certain animals, particularly fish, who move in groups called "schools." In this context, it means that the animals in question are swimming or moving together in a group or formation, often for safety or to increase their chances of finding food.

For example, you might say, "I saw a group of dolphins in the bay today. They were going around in schools and jumping out of the water." In this case, the speaker is describing the behavior of the dolphins as they swim together in a group, often leaping out of the water in unison.

In other contexts, the phrase "go around in schools" could also refer to people moving in groups or organizations that are referred to as "schools." For example, you might say, "Many of the graduates from that university go on to work in schools." In this case, the speaker is referring to educational institutions, which are commonly referred to as "schools."

----------------------

Another example is the old "Time flies like an arrow; fruit flies like a banana." – I tried a number of those patterns and it usually handles them very well.

**Boris Steipe** @boris_steipe@qoto.org · Mar 14, 2023, 16:36

**Boris Steipe** @boris_steipe@qoto.org · Mar 14, 2023, 16:36

Mar 14, 2023, 16:36

I think it is often overlooked that #ChatGPT is not an #AGI but a language model. To get non-trivial responses, one has to think how to phrase a request so it is part of a dialogue. Many abilities then become apparent; but if the request first has to pass through a level of abstraction that the #LLM was not trained for, it often gets confused.

That's really the essence of it: express your request as language.

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 17:14

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 17:14

Mar 14, 2023, 17:14

@boris_steipe does this mean that you claim that a language model can handle performative knowledge (know-how) or proactive knowledge (we're limiting the discussion on the examples of riddles and counterfactuals)? I'm very confused about what you're trying to say with your first comment here

**Boris Steipe** @boris_steipe@qoto.org · Mar 14, 2023, 17:15

**Boris Steipe** @boris_steipe@qoto.org · Mar 14, 2023, 17:15

Mar 14, 2023, 17:15

Yes, that's what I mean. If you can give me an example of each that would satisfy your definition, I'll be happy to demonstrate.

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 17:47

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 17:47

Mar 14, 2023, 17:47

@boris_steipe but if so how is it not general intelligence? Do you mean that mastery of language is sufficient for reasoning, decision making, and conditionality? I still feel lost about this conversation, sorry

**Boris Steipe** @boris_steipe@qoto.org · Mar 14, 2023, 19:01

**Boris Steipe** @boris_steipe@qoto.org · Mar 14, 2023, 19:01

Mar 14, 2023, 19:01

Do I mean that "mastery of language is sufficient for reasoning, decision making, and conditionality?"

Yes, I'll go out on a limb and say that I do – with some reservations. One is that #ChatGPT has not "mastered" language, but it has become very good at it – there is cartainly scope for improvement. The other is that terms like "reasoning", "compositionality" etc. are themselves concepts, labels we apply to organize the large space of possible abstractions of language. They mean different things to different people in different contexts. A discussion of these terms may already be biased by its premises. A productive discussion is better off looking at actual examples, and then discuss whether or not the example fairly maps to the respective concepts.

So let's take a look at #compositionality , simply because it has been named as one of the aspects of thought that #LLMs lack and that are deemed be critical on the path to #AGI .

Let's work off a recent definition cited by Frankland & Greene (2020, DOI: 10.1146/annurev-psych-122216-011829) "The principle of compositionality holds that the meaning of a complex expression is a function of the meaning of its parts and the way in which they are combined" (according to Frege); this gives rise to an idea of #LanguageOfThought .

In order to establish such compositionality, we can ask: is #ChatGPT able to infer consequences of a novel utterance from its components. We can also ask whether it can substitute a component, since that requires decomposition and recomposition. Let's give that a try.

----------------------------------------------
Human:
want you to act as an Understander. As an Understander you examine a sentence, interpret it, answer questions, and change it to explore its meaning and changes of meaning. Here is the sentence:

"The bells rang out like laughter as the frog and the goat marched down the aisle – was there a note of ridicule in the sound?"

ChatGPT:
As an Understander, I interpret the sentence to mean that there were bells ringing loudly, resembling the sound of laughter, while a frog and a goat walked down an aisle. The sentence then asks whether there was a hint of mockery or contempt in the ringing of the bells.

Here are some possible questions that could be asked about this sentence:

What is the sound of the bells compared to?
Answer: The sound of the bells is compared to laughter.

Who is walking down the aisle?
Answer: A frog and a goat are walking down the aisle.

What is the tone or mood of the sentence?
Answer: The tone of the sentence could be interpreted as whimsical or surreal, but there is also an element of uncertainty or suspicion as the question at the end suggests that the sound of the bells may have had a negative connotation.

What might the bells represent?
Answer: The bells could be symbolic of celebration or joy, but they could also be symbolic of ridicule or mockery, depending on the context and interpretation of the sentence.

As an Understander, I can also change the sentence to explore its meaning and changes of meaning. Here are some possible variations:

"The bells rang out like thunder as the bride and groom walked down the aisle - did they feel intimidated by the sound?"
This sentence replaces the frog and goat with a bride and groom and changes the comparison of the sound of the bells to thunder. The question at the end suggests a different tone, one of concern or apprehension.

"The bells rang out like church bells as the procession made its way to the altar - did they signify a joyful occasion?"
This sentence replaces the comparison of the sound of the bells with a more specific reference to church bells and suggests that the occasion is a happy one.

"The bells rang out like a taunting chant as the opposing team took the field - were they meant to intimidate?"
This sentence changes the context entirely, replacing the aisle and procession with a sports game and suggesting that the sound of the bells is meant to taunt or intimidate the opposing team.
-----------------------------------------

This answer certainly checks my boxes on compositionality. It was asked in a fresh thread, i.e. the prompt formed the entire content of the context-window. And the example is _not_ in the training data, I just made it up on the spot.

The bottom line is: apparently we do not need to encode higher-order aspects of thought explicitly, they emerge implicitly in large language models. But this is to be expected: language is not _what_ we think, language is _how_ we think. If the training data is large enough to provide a significant subset of human language, and the model is large enough, you get #emergence of mind-like abilities. How could you not?

This is actually a lesson that could have been learned in 2017, when AlphaZero (an algorithm that trained itself) surpassed the chess playing abilities of DeepBlue (the last of the large expert systems). You can't really analyze how such abilities are represented in the network – they are distributed, and they have no need to partition neatly along our conceptual boundaries. And it would be silly to require the abilities to map neatly into our favourite conceptual space, before we are even willing to acknowledge the ability exists.

In this sense it is truly unfortunate that the current perception focusses so much on misunderstanding what ChatGPT is (LLM not AGI), how it works ("Schrödinger Facts", not necessarily truths), what it cannot do (e.g. counting characters, and scanning meter) – which are typically tasks that are trivial for an AI that would be trained on them, and how it does not align with this or that aspect of human thought.

I would really hope to see more discussion that recognizes such surprising emergent abilities (e.g. spatial abstractions like "on top of"), and through that contributes to something important and constructive: an exploration of what the limits of emergence in Large Language Models really are.

This is not an academic question.

It determines the horizon on which to expect actual AGI.

#Sentientsyllabus

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 20:54

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 20:54

Mar 14, 2023, 20:54

@boris_steipe there's too much happening in your post, so I'll try to respond in pieces. First, I hope you don't mind me asking, but why and how (to me the interesting part is almost exclusively in why and how) did you come with your example? I hope you would agree if I define it as nonsensical. I mean it in the sense that to me (and I agree that in that regard I might be compatible to a LLM, certainly loosing in terms of common knowledge) the different parts of your sentence do not relate so trying to find relationships is a blue sky task, i.e. one cam invent interpretations, but due to the absence of a ground truth, they would be indistinguishable from hallucinations. Then of course the model relates the stochastically most probable objects to the ones already mentioned. Then finally, nonsensical text is a genre of its own (Woodhouse comes to mind), so having it is a test case is ok. I just don't see how it could be insightful

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 21:01

**Martin Ruskov** @mapto@qoto.org · Mar 14, 2023, 21:01

Mar 14, 2023, 21:01

@boris_steipe then your idea of compositionality is very syntactical. I wouldn't agree that the whole is the sum of the parts, but would think that there's additional meaning by the fact these parts are put together. When I read about a frog and a goat together, I don't only think of the two animals, but also of what might bring them together and what might make one stand out next to the other. To ground it in some theory, I can relate it to phenomenography, where to understand a phenomenon, people rely not only on contrast (presence or absence of a feature, could be an object in your phrase), but also on separation, generalisation and fusion, i.e. how it interacts with its context. https://doi.org/10.1207/s15327809jls1502_2

**Martin Ruskov** @mapto@qoto.org · Mar 18, 2023, 06:51

**Martin Ruskov** @mapto@qoto.org · Mar 18, 2023, 06:51

Mar 18, 2023, 06:51

@boris_steipe here's another example: ChatGPT got me confused about my family. It's in the counting and the greats that https://en.wikipedia.org/wiki/Rote_learning fails. See below:

mapto:
Polihron is the grandfather of Rusko. Rusko has a grandson who is also called Rusko. This child is my grandfather. What is my relationship to Polihron?

ChatGPT:
If Rusko (the grandson) is your grandfather, then Polihron must be your great-great-grandfather.

Here is the reasoning:

Polihron is the grandfather of Rusko (the first generation).
Rusko (the first generation) has a son who has a son who is also named Rusko (the third generation, your grandfather).
Therefore, Polihron is the great-grandfather of Rusko (the third generation and your grandfather).
Since Polihron is the great-grandfather of your grandfather, that makes Polihron your great-great-grandfather.

mapto:
And what is my son Lazar to Polihron?

ChatGPT:
If Polihron is your great-great-grandfather, then your son Lazar would be Polihron's great-great-great-grandson.

Here is the reasoning:

Polihron is your great-great-grandfather (the first generation).
You are one generation away from Polihron (the second generation).
Your son Lazar is two generations away from you (the third generation).
Therefore, Lazar is three generations away from Polihron, making him Polihron's great-great-great-grandson.

**Boris Steipe** @boris_steipe@qoto.org · Mar 19, 2023, 05:17

**Boris Steipe** @boris_steipe@qoto.org · Mar 19, 2023, 05:17

Mar 19, 2023, 05:17

Your question maps to similar questions about abstract counting abilities. Correct answers are usually elicited with a variant of "step-by-step prompts. I changed your input slightly because the duplicate name was confusing me more than ChatGPT (actually, ChatGPT several times anticipated that and referred to them as Rusko Sr, and Rusko Jr.). Here is an example of the correct reasoning:

-----------------
Prompt:
Abel is the grandfather of Bertram. Bertram has a grandson who is called Cyrus. This child is my grandfather. How many generations is Abel removed from me?

Response:
If Cyrus is your grandfather, that means you are two generations below Cyrus (Cyrus -> your parent -> you). Since Bertram is the grandfather of Cyrus, there are two generations between Bertram and Cyrus (Bertram -> Cyrus's parent -> Cyrus). Abel is the grandfather of Bertram, so there are two generations between Abel and Bertram (Abel -> Bertram's parent -> Bertram).

To find out how many generations Abel is removed from you, we can add up the generations between each person:

Abel -> Bertram (2 generations) + Bertram -> Cyrus (2 generations) + Cyrus -> you (2 generations) = 6 generations

So, Abel is removed from you by 6 generations.

[I believe this is correct. Now we need abstract counting. Here: how to repeat something six times.]

Prompt:
If a male ancestor is removed from me by 6 generations, how is this relationship expressed in English? Consider this step by step.

Response:
In English, relationships are described based on the number of generations separating two individuals. When there is a 6-generation gap between you and a male ancestor, you can describe this relationship step by step as follows:

Your parent - This is one generation above you, so your male ancestor at this level would be your father.
Your grandparent - Two generations above you, your male ancestor would be your grandfather.
Your great-grandparent - Three generations above you, your male ancestor would be your great-grandfather.
Your great-great-grandparent - Four generations above you, your male ancestor would be your great-great-grandfather.
Your great-great-great-grandparent - Five generations above you, your male ancestor would be your great-great-great-grandfather.
Your great-great-great-great-grandparent - Six generations above you, your male ancestor would be your great-great-great-great-grandfather.
So, a male ancestor removed from you by 6 generations would be called your great-great-great-great-grandfather.

[Again, I think this is correct. The reasoning is systematic, takes the special cases of father and grandfather correctly into account, and iterates the required number of times.]

Do you agree that this is a correct solution and that it illustrates the ability of the LLM to reason about generations and their labels?

(ChtaGPT-v4 2023-03-14)

**Martin Ruskov** @mapto@qoto.org · Mar 19, 2023, 06:04

**Martin Ruskov** @mapto@qoto.org · Mar 19, 2023, 06:04

Mar 19, 2023, 06:04

@boris_steipe you've simplified the syntax to strip away the semantics and you got the correct answer. Yes, I confirm this is the case.

**Martin Ruskov** @mapto@qoto.org · Mar 19, 2023, 06:33

**Martin Ruskov** @mapto@qoto.org · Mar 19, 2023, 06:33

Mar 19, 2023, 06:33

@boris_steipe sorry, let me try to explain better. This is all about intent. Is the objective that we use technology to explain things to us, or do we want to be able to explain things to technology in a way that it can reproduce basic questions whose responses have no added value, and arguably are reduced to fact checking? I'm interested in the former, the latter is what I see in your guiding simplifications. Maybe you meant it differently?

Remember from school that facts are the mostly basic type of knowledge. The more valuable ones are functional and others that are widely called conditional, but as I hinted in my article, are very diverse.

**Boris Steipe** @boris_steipe@qoto.org · Mar 19, 2023, 17:12

**Boris Steipe** @boris_steipe@qoto.org · Mar 19, 2023, 17:12

Mar 19, 2023, 17:12

I respectfully disagree. Your article claimed that there were fundamental limitations of performance of LLMs. I noted that we are in fact not sure about the limitations, because they are difficult to disentangle from their semantic presentation. In particular I had observed abilities that in my understanding would cover many if not all of the higher order processing abilities you had listed. They can be elicited.

First, it is surprising that such emergent abilities appear, especially given how those algorithms work: their "world" is entirely based on relationships between abstract symbols, and has no grounding in experience at all, and their training consists merely in predicting a next token in a sequence. That we see higher order "understanding" emerge under these circumstances is profound. It strongly supports the idea that language and thought are deeply linked: language is not what we think, it is how we think.

Second, what must not be conflated is the ability to reason in principle, and the ability to parse human language, with all of its ambiguities, unstated assumptions and discursive conventions.(1)

When you say "explain things to technology", you are referring to the latter: "parsing human language". This is a different question from the ability to "reason". Now, the current dialogical abilities of LLMs are already a radical departure. Remember: previously, this process of "explaining things" was called programming.

Of course there are limitations. Where I disagree with you is: these limitations of communicating intent are not the same thing as a lack of reasoning.

The distinction is important because it can lead to significantly improved interactions. Jason Wei of Google has done groundbreaking work on "chain-of-thought prompting", and similar work from Google Brain on "Least-to-most prompting" shows that there are probably many additional strategies to be discovered.

All the best.

---
(1) One of the more surprising aspects of ChatGPT is that it embodies Paul Grice's "cooperative principle" – the well-known maxims of quantity, quality, relation, and manner.

**Martin Ruskov** @mapto@qoto.org · Mar 20, 2023, 21:47

**Martin Ruskov** @mapto@qoto.org · Mar 20, 2023, 21:47

Mar 20, 2023, 21:47