CatComConM2023: 5. Harnad, S. (2003) The Symbol Grounding Problem

Monday, August 28, 2023

5. Harnad, S. (2003) The Symbol Grounding Problem

Harnad, S. (2003) The Symbol Grounding Problem. Encylopedia of Cognitive Science. Nature Publishing Group. Macmillan.

or:

Harnad, S. (1990). The symbol grounding problem. Physica D: Nonlinear Phenomena, 42(1), 335-346.

or:

https://en.wikipedia.org/wiki/Symbol_grounding

The Symbol Grounding Problem is related to the problem of how words get their meanings, and of what meanings are. The problem of meaning is in turn related to the problem of consciousness, or how it is that mental states are meaningful.

If you can't think of anything to skywrite, this might give you some ideas:

Taddeo, M., & Floridi, L. (2005). Solving the symbol grounding problem: a critical review of fifteen years of research. Journal of Experimental & Theoretical Artificial Intelligence, 17(4), 419-445.
Steels, L. (2008) The Symbol Grounding Problem Has Been Solved. So What's Next?

In M. de Vega (Ed.), Symbols and Embodiment: Debates on Meaning and Cognition. Oxford University Press.
Barsalou, L. W. (2010). Grounded cognition: past, present, and future. Topics in Cognitive Science, 2(4), 716-724.
Bringsjord, S. (2014) The Symbol Grounding Problem... Remains Unsolved. Journal of Experimental & Theoretical Artificial Intelligence (in press)

288 comments:

CATCOMCONSeptember 18, 2023 at 7:30 AM
NOTE TO EVERYONE: Before posting, please always read the other commentaries in the thread (and especially my replies) so you don't just repeat the same thing.
ReplyDelete
Replies
Aashiha BabuSeptember 26, 2023 at 2:44 PM
I found this reading to be very kid-Sib friendly and I enjoyed how it succinctly summarized the material we have discussed in the past month. I was unsure about what the Symbol-Grounding Problem is during the first week and reading the Wikipedia article only confused me further. Reading the Scholarpedia article today had a different impact; I think I have a basic understanding of this problem! Put simply, it explores the relationship between a symbol representing a thing, someone interpreting the symbol and understanding that the symbol refers to the thing in question. The article also explored the idea of consciousness (i.e. our capacity to ground the symbols to their referent).

In Linguistics, we say that the meaning of complex expressions arise from the meaning of the constituent expressions. I suppose that grounding is what allows us to get the meaning in the first place, but I find it interesting that the Symbol Grounding Problem lightly touches on how we connect syntax and semantics (my favourite part of Ling).
ReplyDelete
Replies
Jenny CarbertSeptember 29, 2023 at 12:29 PM
The dictionary-go-round example was really useful for me to further understand symbol grounding as it makes clear exactly how we must use symbol grounding to connect words and referents together. Specifically, the second version of this example, describes how attempting to learn Chinese as a first language from only the use of a Chinese/Chinese dictionary would prove to be essentially impossible because you have no way to assign any meaning to the symbols as you wouldn’t know the other symbols used to describe it. Thus, resulting in an endless cycle of trying to understand the language without any way to know what any of it means. This is because you have no possible way of grounding these symbols in meaning from previous knowledge or experience.
ReplyDelete
Replies
Kaitlin PearsonSeptember 29, 2023 at 2:01 PM
Is there a way to maybe pin this comment?
ReplyDelete
Replies
Kaitlin PearsonSeptember 29, 2023 at 3:41 PM
Side note: am I able to use "refer" or "reference" under this context?
ReplyDelete
Replies
Kaitlin PearsonSeptember 29, 2023 at 4:10 PM
As for Harnad's 2003 paper, I concur that this paper is much more straightforward than the Wikipedia page.
This also gave me a lot more clarity on the nature of ChatGPT and what we were talking about with “statistical parrots”. While it is able to use statistical analysis to retrieve and summarize the vast amount of information it has been fed, it still lacks many capabilities because it cannot understand what these things mean or interact with them in a real life context. The question then is how to implement grounding in a non-human system. To do so, we would need to establish feature-detecting mechanisms in the system that can detect the sensorimotor features of the environment. Then, we would need something to entrain the system on how to select proper referents. To do so, we would first need to establish what the correct way to do that is, but I believe that is feasible within the realm of cognitive science. The other property essential to symbol grounding discussed by Harnard is consciousness, where we then once again run into the hard problem. However, speaking in terms of a T3-Turing test, it may be enough for the system to be able to do what we do, without knowing how.
ReplyDelete
Replies
Shona KellySeptember 29, 2023 at 6:00 PM
Professor Harnad’s article “The Symbol Grounding Problem” touches on a question I have had throughout the duration of this course—if cognition is not just computation, what is that additional ‘thing’ that is going on inside our heads that makes it so we feel like we understand? Searle’s CRA highlights that simple computation may result in indistinguishable output, such that it appears one understands, but it lacks this elusive quality of feeling like one understands. Through the CRA Searle shows that computation alone does not result in our feeling of understanding—so what does? This article puts forward symbol grounding as part of the answer.
In this article Professor Harnad clarifies that a symbol system (“a set of symbols and syntactic rules for manipulating them on the basis of their shapes”) on its own does not have the capacity for grounding (the ability for a symbol to pick out their referents). Grounding therefore is not purely computational, and is instead “a dynamical (implementation-dependent) property.” Grounding is necessary for meaning, as it allows one to “detect, categorize, identify, and act upon the things that words and sentences refer to.” Thus, the idea of grounded symbols allows us to better understand how “meaningless strings of squiggles become meaningful thoughts” in our brain, as it explains how the symbol system in our mind can be connected to the referents we interact with, and derive meaning from, in the sensory world.
ReplyDelete
Replies
Adrien ZwaansSeptember 29, 2023 at 10:31 PM
This article establishes a distinction between meaning of words and symbol manipulation under the light of the computationalism hypothesis. As it’s said in the first paragraph, « computation in turn is just symbol manipulation » but if computationalism assumes that brain is computation, then thinking is reduced to a form of symbol manipulation, which brings us back to the importance of the Turing Test in delimiting what is thinking from just answering questions with the most probable word following each other as would ChatGPT do.
If I understood the aim of this article correctly, to distinguish between symbols or shapes and meanings, we need to understand what is the referent of the word. But, since this process is implementation-dependent and must be associated with sensorimotor properties, I don't understand how we could assume that computationalism is true when it’s said to be implementation - independent.
ReplyDelete
Replies
Melika YadmelatSeptember 29, 2023 at 10:59 PM
The Harnad 2003 reading opened my eyes to the symbol grounding problem and the reasons why meaning and reference are not the same thing. From my understanding, a word's referent is the thing it refers to, whereas meaning relates to the word’s sense. To better understand this, let’s look at an example where "red" and "flower" are characteristics of roses. The term "rose" can apply to any "red flower." These fall under categories, each of which has a name. As a result, the definition only provides you a new name provided you already know what those names and their categories relate to. However, without previous knowledge of what "red" refers, even looking it up in the a dictionary will not get you far, as it would only lead to a cycle of endless meaningless definitions and here is where the symbol grounding problem comes in. In such a case, looking for meaning would be ungrounded. Here, we see that meaning and reference are not synonymous, and are in fact two very different things.
ReplyDelete
Replies
Melika YadmelatSeptember 29, 2023 at 11:16 PM
The Harnad 1990 reading highlighted one of the most interesting notions of the symbol grounding problem in regards to the idea that learning Chinese as a first language requires a great deal more than just using a monolingual Chinese dictionary in order to really understand this language. In my child development class, we saw that babies start learning their native language as early as in the womb! In fact, exposure to the speech of their mother allows the malleable and developing brain of babies to start the learning process of their mother tongue. And as they grow up, they learn to form categories of the world around them, such as forming categories of “objects”, “animals”, etc. With time, these categories become more complex and have subcategories within them. This reading provided me with a deeper understand of this learning process, by highlighting that before even learning what things certain words refer to, categorization of these things from sensorimotor interactions with them comes first and is thus fundamental.
ReplyDelete
Replies
Josie BrooksSeptember 30, 2023 at 7:59 AM
I particularly enjoyed the section 1.3 "Connectionist Systems" as a theme that keeps coming up gets explained in a little more detail here. The theme being, do we fully need to understand the brain to understand cognition. This section highlights again that we know very little(although slightly more now) about the brain's vegetative functions, let alone about the brain's higher level functioning. However, the article suggests that this doesn't mean we can't learn more about cognition, and that taking the symbolic model of the mind and/or connectionism will be promising tools in better understanding cognition.

To me, this again begs the question: Do we actually need T4, or does T3 or even T2 suffice?
ReplyDelete
Replies
Selin Ida OmurSeptember 30, 2023 at 11:50 AM
When I was reading the Symbol Grounding Problem, I was confused a little bit until Searle’s Chinese Room Argument. What I thought about when reading that part was that symbol grounding comes with meaning, as Searle was able to manipulate the symbols without understanding the meaning behind the Chinese words, but the symbols were not grounded(?). It is mentioned that Searle wouldn’t have a conscious understanding, which points to our unconscious mind, however, if Searle had never been exposed to Chinese before, how would there be an unconscious understanding?
ReplyDelete
Replies
InstructorSeptember 30, 2023 at 12:12 PM
Yes, "refer" and "referent" (sic) are the right words; but "association" is a weasel-word.

The connection between the symbol and its referent (i.e., between the word "apple" and the apples in the world that it refers to) is made through the sensorimotor capacity to recognize and identify apples and to do with apples what you do with apples but not with axes or axioms. That is sensorimotor grounding.

"Intentionality" and "intrinsic" are weasel-words. ("Intrinsic" is just referring to the fact that grounding is a connection between a word in a speaker's head and the referent of the word: that connection must be direct, not indirect, through a second speaker's head -- but the connection can be made through a definition in words given to the first speaker by the second speaker, as long as the defining words are already grounded for both speakers. This is indirect grounding through language; this will become much clearer across the next few weeks.)

To do be able to connect words to their referents, you need sensorimotor capacities: (1) to move, manipulate, see, touch, smell, and taste; (2) to learn what apples are by learning to detect the sensorimotor features (size, shape, color, taste) that distinguish apples from axes; and (3) to learn what's the right thing to do with apples (pick them, eat them, not use them to try to chop wood).

[What is information and categorization?]

[Can you think of any other ways to ground "apple" in its referent?]

To ground the word "axiom" you would need more: indirect grounding through (grounded) language. (We'll get to that, once we get to language.)

What is arbitrary is the shape of the symbol that we all agree to use to refer to apples: "apple" does not look like an apple. (Neither does « pomme » or "alma" or 蘋果.)

I'm not sure where you got the "cow/moo" story, but children have to learn the difference between the features of cows, calves, bulls and bison, as well as their sounds, before they can call a cow a "moo." (Where did you get that example?) (Imitating a "moo" would have been a good example for a non-language way to call someone's attention to a cow -- if they knew the look and sound of a cow, but "moo" is not an arbitrary symbol: why not?)

"Consciousness" (even though I too am guilty of having used it in the past) is also a weasel-word. The non-weasel version is: "a state that it FEELS LIKE SOMETHING to be in." The latinate form is "sentience."

Understanding is T3-grounding + the feeling of understanding.
ReplyDelete
Replies
miriamhotterSeptember 30, 2023 at 3:14 PM
I'm not sure I'm convinced that the symbol grounding problem has been solved. Although Steels' robots are autonomous, there are many organisms that are autonomous but lack genuine cognition (ex:plants, bacteria, etc). Professor Harnad also states that symbol grounding requires more than autonomy; it requires analog and dynamic sensorimotor processes, which Steels' robots lack. I think that resolving the problem using robots that ground colours through sensorimotor perceptions seems restricted and it's unclear how this process extends to more intricate tasks. I just don't see how his robots are anything more than t0 and am not convinced that they are valuable in solving the symbol grounding problem...(But I'm open to having my mind changed if I haven't fully understood)
ReplyDelete
Replies
Garance BarnoinSeptember 30, 2023 at 3:34 PM
Once again in the second reading Harnard, S. (2003), in the formal symbols section – I must disagree with the “how” we came up with the symbols. Right now it is true that the symbols we are using do not have a clear and ostentatious link with the objects they refer too. However, there was a time when some symbols did have a clear link. For example, 2 did relate to two-ness, it used to be written in a way where the symbols had 2 angles hence 2. Though, I get that there are still gaps between two-ness, the number 2, the fact that we call it two and two objects. I also agree with how powerful the conversion to meaning in our heads is extremely powerful.
ReplyDelete
Replies
InstructorSeptember 30, 2023 at 4:59 PM
1. What it a Turing Machine? What can and can't it do?

2. That is a T3-passing robot? What can and can't it do?

You've answered your own question.
ReplyDelete
Replies
miriamhotterSeptember 30, 2023 at 6:19 PM
Hi Megan! Like you said, a Turing Machine is only computational, as it's only capable of executing algorithmic processes. Since it can only operate based on predefined rules to manipulate symbols on an infinite tape, it lacks sensorimotor abilities and is ,therefore, uncapbale of symbol grounding. On the other hand, a T3 robot possesses sensorimotor abilities which allows it to connect symbols with real-world meaning.
ReplyDelete
Replies
Tina XueOctober 1, 2023 at 12:24 AM
After reading the two -pedias, I realize more and more that symbol grounding is also a discussed topic in philosophy. For example, the process of picking referents has similar notions in the study of phenomenology, whose focus is to study conscious experiences. Phenomenology says that each cognize-capable creature has its own "world," and that only the things it has interacted with are in its world. Not only the world has physical environments, but it also includes personal perceptions and understanding. The symbols are connected to our living experiences, thus we can derive meanings from them.
What I've noticed so far is that, cognitive science and some branch of philosophy are really studying the same thing, yet the terminology barriers prevents disciplines to communicate effectively and combine their efforts. However, the differece is that cog sci also considers scientific investigation, in which doing experiments to ground our conceptual understanding of consciousness.

I am no experts on phenomenology nor philosophy in general, I consult with ChatGPT (:D) to have a better grasp of them, do correct me if I'm wrong please.
ReplyDelete
Replies
InstructorOctober 1, 2023 at 7:46 AM
That's it.
ReplyDelete
Replies
Shona KellyOctober 1, 2023 at 1:13 PM
The articles on symbol grounding assigned this week connect to the distinction between T2/T3 machines that have been discussed in class—especially the question of why we cannot merely equip a T2 with a camera, such that we can say the T2 takes in ‘sensory’ stimuli, and say that it is equivalent to a T3.
In the scholarpedia article, Harnad argues that the way to ground symbols is through the addition of “nonsymbolic, sensorimotor capacities” that facilitate autonomous interaction with the “objects, events, actions, properties, and states that its symbols are systematically interpretable. . .as referring to.” Is this autonomous interaction with the environment, which is possible with a T3 machine, what separates a T3 from a T2?
ReplyDelete
Replies
Lillian WilliamsOctober 1, 2023 at 4:38 PM
I found the “Symbol Grounding Problem” Scholarpedia reading to be very straightforward in explaining a difficult concept. In this problem, a symbol is an object within a symbol system, which are sets of symbols and manipulation rules. Symbols are manipulated according to their shape, and importantly not their meaning. The symbols can be interpreted as having meaning and referents, but the shape of the symbols themselves don’t have direct relation to their meanings and referents. If I understand correctly (borrowing an example from a children’s book) a pen could just as easily have been called a “frindle” without the object changing. The choice of the letters p-e-n, don’t have any resemblance to the actual object, we just interpret the sequence of letters to refer to a pen. Within the symbol grounding problem, grounding refers to the ability of symbols within a brain to pick out their referents. This process is implementation-dependent, and isn’t seen in symbols written on the page or in computer code. In order to achieve groundedness, a symbol system would have to have sensorimotor abilities, and be able to interact with its surroundings, ie, perceive and use a pen in accordance with its own understanding of a pen’s meaning. Professor Harnad argues that this necessity for groundedness means that the T2 Turing Test involving only symbols is insufficient, and must instead be replaced by a hybrid symbolic/sensorimotor test. Expanding on this, in order to achieve groundedness the hybrid robot must be able to categorize. For example, it must be able to put a “pen” in the category of “writing tools” and interact with it accordingly.
ReplyDelete
Replies
Jessica LauOctober 1, 2023 at 9:49 PM
While reading the section in The Symbol Grounding Problem (Harnard, 2003) about how we use categorical representation to identify/name things, I started thinking about how our boundaries for what constitutes a member of a category (in other words, the invariant features of that category) change depending on the language that we speak. For example, I remember learning in a linguistics class that in Russian they have a different word for light versus dark blues, thus affecting how they discriminate colours. This relates to the Whorf Hypothesis which we discussed in class, and also shows that the invariant features that establish categories, which in turn affects how our symbols are grounded, are arbitrary to some extent.
ReplyDelete
Replies
Ethan ChurchillOctober 1, 2023 at 9:52 PM
I was wondering if I understood the section “A complementary Role for Connectionism” from the 1990 article as it was intended. In section 4 on connectionism, it states that combining the symbolic and connectionist approach (described throughout the article) is the closest we can currently get to solving the symbol grounding problem. Associating a symbol to its referent is inherent in the human brain (implementation-dependent), and we do that through categorization and discrimination based on our lived experience (a connectionist approach). Once this connection is made and we ground elementary symbols this way, the rest of the symbol strings if a natural language will inherit the grounding of these elementary grounded symbols that they are composed of (the more symbolic approach).

I was also wondering how supervised learning fits into all of this. From my understanding, I form categories as I observe others interact with the environment around them as well as being explicitly told what categories x object belongs with. Is supervised learning how connections initially form, that allow me to ground symbols in my brain?
ReplyDelete
Replies
Aashiha BabuOctober 2, 2023 at 11:02 AM
If I understand this correctly, in a nutshell:

T2 + symbol-grounding ability (which includes sensorimotor abilities) = T3

...Is this correct?
ReplyDelete
Replies
Megan TierneyOctober 2, 2023 at 11:07 AM
I think that an interesting question to think about is what is the smallest amount of symbol grounded language that would be necessary to be able to understand and interact with every object. The example of needing to know what a zebra is without ever learning the word zebra but already knowing stripe and horse is what made me consider this. I think young school-aged children are sort of an example of this idea, because they have enough language to understand most of the world, but their vocabulary is still limited as compared to an educated adult. I don’t think that this would necessarily be helpful in reverse-engineering the mind, but it is an interesting thought.
ReplyDelete
Replies
elliot durkeeOctober 2, 2023 at 11:41 AM
At first, it was a bit hard for me to understand why sensorimotor abilities would avoid an infinite regress and would allow a system to be grounded, but I think I've got it. First I had to understand that the function of sensorimotor abilities must be more than just data-collection about the world, otherwise the Big Gulp would have done the trick, and we wouldn't need a T3. So then what's the point of sensorimotor abilities in the meaning grounding problem if it's not just data-collection?

From what I understand, the importance of sensorimotor abilities is data-collection in a particular (probably active) way, AND about the ability to ACT ON the world, that allows us to ground meaning in experience, and not just in data (but this does still seem a bit circular in that you need to be able to feel in order to experience, and you probably need to have grounded the world in meaning to feel, so maybe I am super wrong on this last bit). The analogy that comes to mind is that of Active vs. Passive Kittens.

I think this understanding of the role of sensorimotor abilities is what leads us to Categorization as vitally important for doing the things that a thinking thing does.
ReplyDelete
Replies
Valentina MartinezOctober 2, 2023 at 12:18 PM
I found it a little bit hard to understand what the symbol grounding problem is and the proposed solution to it, but based on what I think I understood from the reading (Harnad 1990), an evident solution for the symbol grounding problem has not been found yet, however, according to the reading, the closest we can get to solving this problem is with a hybrid system that combines symbolism and connectionism. In fact, connectionism can explain how exposure to sensory projections and feedback allows us to create a link and connect objects that we see to symbols. The symbolic system then shows that once we ground these elementary symbols, the rest of the symbol strings of a natural language can be generated by symbol composition and they will inherit the grounding of the elementary symbols that they are composed of. So basically, using the Zebra = horse + stripes example, this hybrid system would begin by connecting objects such as a “horse” and what it identifies as “stripes” to symbols and then, if it encounters the image of a “Zebra”, the symbols for “horse” and “stripes” would combine and lead the system to find a symbol for zebra that would be grounded in the same way that horse and stripes are grounded.
ReplyDelete
Replies
Csenge MagyarOctober 2, 2023 at 1:20 PM
I first read the scholarpedia text on Symbol grounding. I think it helped me finally distinguish the easy problem from the hard problem. The easy problem is how we DO things - doing the right thing with the right kind of thing, categorization being “grounded” in its functional state. Solving the symbol problem is the solution to the easy problem. But how we FEEL things is still not explained by a symbol grounding. There seems to be something there that cognitive science might never be able to answer, and this is the hard problem.
ReplyDelete
Replies
Adrienne VandenbergOctober 2, 2023 at 7:09 PM
To add to this, one of the things that distinguishes T2 and T3 capacities that is highlighted in the article is the ability to categorize. A T2 machine can describe an apple with information it has been fed, but to be able to categorize (do the right thing with the right kind of thing), it must be able to interact with an apple using sensorimotor capabilities. This is why I think T2 is not sufficient for reverse-engineering cognitive capacity, it cannot interact with the world around it in order to mean something when it says "apple", "baseball", or "house", it can only use the words as ungrounded symbols.
ReplyDelete
Replies
Joann GuyoncourOctober 2, 2023 at 7:28 PM
First comment: Why are motor capacities required for grounding?
Sensorimotor capacities are presented as the solution to the symbol grounding problem in the Scholarpedia article: “So ultimately, grounding has to be sensorimotor”. However, I’m wondering why motor abilities are required for grounding and not just sensory capacities. I see how interacting with the environment can be useful in grounding language by allowing us to learn causal relations between objects, actions, properties, and events. But this would imply that people lacking motor capacities since birth don’t have a sense of grounding. However, that feels wrong to me since I work with physically disabled people who sometimes have no capacity to physically interact with their environment, yet they do have a sense of grounding, i.e., they know what the word “apple” means. One could argue that these people are not totally lacking motor capacities since some of them are necessary for basic functions like eating, breathing, etc., and for communication (how could I know that this person knows what an apple is otherwise) but these are really low-level interactions with the environment. Moreover, we could think of a computer program that has access to a realtime camera and which is able to analyze the images and recognize objects. This program would be able to identify what the word “apple” refers to. Yes, it won’t have a sense of what it tastes, smells, or feels like, but it does have a sense of what it looks like. One might respond that it is just analyzing a numerical image composed of pixels, but isn’t this also what the brain is doing minus the subjective experience? While it seems that such a program doesn’t have a sense of what the word “apple” means (because it seems to require some level of consciousness according to this article), it still knows what the word “apple” refers to - which is grounding (at least in the visual sense). So, why couldn’t sensory abilities be enough for grounding?
ReplyDelete
Replies
Thomas LemoineOctober 2, 2023 at 9:47 PM
The Scholarpedia article serves as a fascinating exploration of analytical thinking under conditions of limited empirical evidence, a viewpoint that, in my opinion, is partially contradicted by developments in modern machine learning.

Within the article, we are presented with some theoretical framework in which to analyse words and their meaning. The word, or symbol, points to some referent or reference class within the world, and also points to the word’s meaning, which is the brain's way of identifying a referent. One could think of meaning as either a generating rule or a compressed method for mapping "object-space" to referents that are distinguished by a specific trait. Then, words that cannot be selectively applied to some objects over others lack meaning. For instance, the term 'blurgle' is meaningless, as one couldn't realistically identify an object that qualifies or disqualifies for 'blurgle' status. Then comes the question: How does the brain, and in fact any thinking machine, match the symbols to their meaning, without first being given a minimum amount of words to use to define the rest? It is simple enough for a young child who knows English to learn new words to add to their vocabulary, but how do they initially match symbols to words? This is the Symbol-Grounding Problem.

It appears that modern machine learning algorithms, among other learning mechanisms, can indeed acquire languages like Chinese without external guidance, utilizing a sort of 'Chinese/Chinese Dictionary-Go-Round' mechanism. Gradient Descent, the learning algorithm behind modern machine learning, creates Universal Function Approximators (UFAs), meaning that, in the limit (size, training data), Artificial Neural Networks (ANNs) can match any functional input to any functional output, and as they grow the approximation improves until the limit of possibility. However, there are stronger theoretical reasons to think computation can learn Chinese, and the grounded meaning of symbols, from a mere string of “symbols”, whose simplest form is of course the bits 0 and 1.

Not only can large language models (LLMs) learn Chinese from Chinese text, but they can also learn the relations between words, whose structures match the structure of the outside world. In information theory, there's a concept known as Solomonoff Induction, a way to optimally predict future observations based on existing data. It weighs hypotheses based on how well they fit the data and how complex they are—simpler is usually better. Full disclosure: GPT-4 told me a bunch of things about Solomonoff induction, but I had heard of the concept before.

This is relevant to the Symbol-Grounding problem. Imagine a machine that learns Chinese symbols (words) and their meanings without prior knowledge. Using Solomonoff Induction, the machine starts with a range of hypotheses about the structure and meaning of these symbols. As it collects more data, incorrect hypotheses get ruled out or are given less weight, while correct ones gain in probability.

Over time, the machine learns not just to recognize characters but also understands the structure and grammar of the language, effectively grounding the symbols to their real-world referents. In essence, Solomonoff Induction suggests that computational agents can solve the Symbol-Grounding problem autonomously, moving beyond mere function approximation to achieve a deeper, semantic understanding. They can talk about their understarnding, they can show you by moving a robot’s wheels or displaying text, they can make predictions about the data, and invent theorems to investigate implications of their knowledge. What more could understanding be?
ReplyDelete
Replies
Aya AmerOctober 2, 2023 at 10:16 PM
I think the ability of chatGPT and similar models to manipulate language meaningfully (i.e. in a way that is meaningful to a human reader. I don't mean to suggest that chatGPT is capable of understanding) is evidence that using language does NOT require grounding, instead of evidence that grounding can be achieved purely by relational grounding or syntax. Searle’s Chinese Room argument showed that you can manipulate language in a manner similar to chatGPT without grounding (or understanding either). It also makes sense when you consider that language is a symbol system as defined in the Scholarpedia article, which means that it is a set of rules (syntax) for the manipulation of arbitrary symbols. Since the symbols, or words in this case, are arbitrary, you can manipulate them without issue even if they don’t have referents assigned. I believe this is part of what chatGPT does. Of course, the application of syntactic rules to words doesn’t explain chatGPT’s ability to give semantically appropriate responses to a prompt, only why these responses are grammatical.
ReplyDelete
Replies
Lillian WilliamsOctober 2, 2023 at 10:25 PM
In the 1990 article on the Symbol Grounding Problem, Professor Harnad explains the symbol grounding problem, the question of how a system can attach meaning to symbols such that the meanings are an inherent property of the system, rather than relying on outside interpreters. The reading then explains that a solution to the problem should include certain human capacities, like discrimination and identification. Discrimination meaning the ability to discern if two objects are the same and different, and identification meaning the ability to correctly identify the object. According to Professor Harnad, in this model we must be able to form iconic representations of the objects or inputs in question. The article references specifically the visual representation of a horse projected on our retinas that we can then compare to the next horse we see. I was just wondering how this process would work with other sensory modalities; I don’t doubt the model holds true, I'm just curious how it would apply. It’s easy to imagine matching an auditory stimulus with an iconic representation. For example, you could hear a fire alarm and recognize the pitch, rhythm, etc. Would this work the same for someone who was blind, and operating by touch? For example if they were to pet a sheep and a dog, how would the iconic representation be formed, and how would they discriminate between them? Or would the representation here just be fur, that is possibly combined with an auditory representation like “baa” or a “bark” to make a judgment?
ReplyDelete
Replies
Michelle ChenOctober 3, 2023 at 11:47 AM
In “The Symbol Ground Problem” (2003) by Professor Harnad, it was discussed how the rule-based manipulation of symbols that computation involves lacks a critical component: meaning. To achieve meaning would require establishing a relationship between these symbols and the entities they are linked to in the real world. As highlighted in the reading, accomplishing this specific connection is called being “grounded,” and can be implemented by incorporating sensorimotor experience into the system.
This article covered aspects of “grounding” that really showcased the remarkable abilities the mind has, which is why I found this particular theme in the reading to be the most fascinating. More specifically, it is the idea that there seems to exist a process in the mind that is naturally capable of forming this link between words and meanings that strikes me as most impressive— it leads me to ask more questions related to the field of Cognitive Neuroscience. I’m interested in delving deeper into this: how exactly does the brain create a link between this gap? Is it the physiological processes that Cognitive Neuroscience researchers focus on, and what variables would they use to study this? Are common techniques in modern-day neuroscience research effective enough to study this? For example, would tracking cell activity or recording areas of the brain be the best way to approach these areas of research— or, as discussed in last week’s Fodor reading, would correlational studies only limit us in their lack of explaining the “how” of neural processes?
ReplyDelete
Replies
Rosalie ZhangOctober 3, 2023 at 12:09 PM
The Symbol Grounding Problem readings got me thinking about the semiotic triangle, where a symbol stands for a referent and symbolizes a thought/concept, and that thought/concept refers to the referent. It's possible for a referent to have multiple symbols, and vice versa. This flexibility is a key aspect of language and symbolism, where symbols can represent various meanings and concepts, and one symbol can refer to different things in different contexts. It's fascinating how people across the world use various symbols, i.e., languages, to refer to the same things.

When we learn second, third, or more languages, we often relate back to the symbols in our first language as a reference point. It's like building a set of symbols in our minds to connect with different languages.

However, it's quite a challenge to imagine someone with no language at all. Even if you were to give them a dictionary, it would be impossible for them to learn and understand words and connect them to the real world because the words in the dictionary are “ungrounded" for them. The meaning of the words in someone’s head, those they do understand, are “grounded”, which implies that in the absence of comprehension and understanding, symbols are meaningless. A language is more than just a set of symbols; it's a complex system that involves shared meaning and understanding.
ReplyDelete
Replies
Thomas LemoineOctober 3, 2023 at 1:42 PM
Could "Understanding" be a weasel word? We ought to distinguish "phenomenological understanding", which is the feeling of having understood, and which may be, fundamentally, an illusion of some kind; and computational understanding, which is the mere ability to compute and infer different causal interactions between the referent and other stuff in the world; like knowing that letting go of an apple will make it fall if it's in a gravitational field, etc. It seems clear that GPT has computational understanding, but saying it "doesn't have understanding" in the first sense is pointless; how could we ever hope to know?
ReplyDelete
Replies
Fiona Wright-JonesOctober 3, 2023 at 2:49 PM
Here’s my summary. The meaning of a word on a page is ‘ungrounded’, but the meaning of a word in someone’s head, that they understand, is grounded because it’s connected in some way to the referent. Searle’s Chinese symbols would not be grounded because he didn’t understand them and therefore did not make a connection between the symbols and the things that the symbols are referring to (the referents). What our brain has which an arbitrary symbol system doesn’t is this ability to pick out referents, and is this simply due to our ability to interact with the world?
Now I'm going to digress. I find the idea of the robotic Turing Test viable in that the robot would have the ability to interact with the world, but I argue that the robot would need to have flexibility; something comparable to plasticity. Meaning for us humans seems to change with experience as we understand things in our environments in different ways, and although the robot may be able to go for a while pretending to be a human, it would have to be flexible enough to have these meanings change over time in order to continue being a decoy human.
ReplyDelete
Replies
Aya AmerOctober 3, 2023 at 4:22 PM
Here is my attempt at a summary of the Encyclopedia of Cognitive Science entry: the two leading approaches to developing a model for cognition are symbolism and connectivism. Symbolism is basically computationalism, since it holds that all cognition is the manipulation of a symbol system, or set of arbitrary symbols that can be combined according to symbolically represented rules to create interpretable statements. Its issue as a model of cognition is that it cannot explain how we can connect these symbols to their real-world referents. Connectionism says that cognition is the result of interactions between nodes, through weighted connections. The weights of these connections can be modified in response to feedback, to make the desired output more likely in the future. However, it doesn’t capture the compositional aspects of cognition. We can come up with a better model if we combine the connectionist and symbolist approaches. First we build up a library of terms that are iconically and categorically grounded. Iconic grounding is the idea that we have analog, nonsymbolic representations of sensory information in our heads. Categorical grounding is achieved through a connectionist system that takes icons as inputs and outputs their most defining features, which are then used to classify the icons as members or nonmembers of a given category. Then we are able to treat these category labels as symbols in a symbol system, thereby gaining the ability to combine and recombine them to produce new propositions. These propositions retain their groundedness because they can be eventually decomposed into grounded icons or categories.
ReplyDelete
Replies
MadeleineOctober 3, 2023 at 4:35 PM
I found these readings about the symbol grounding problem very interesting, as it grounded (no pun intended) my own understanding of how we think about meaning. The part of the scholarpedia page that mentioned the importance of a sensorimotor component to the one perceivinginterested me especially – I am curious specifically about whether this necessity is predicated more so on the ability to sense characteristics of external stimuli to facilitate accurate categorization or the ability to exert some effect on these stimuli and provide appropriate motor responses. Or, whether one must be able to do both the individual sensory and motor component to be able to completely understand and categorize objects. It makes sense to me that we need some capacity to interact with the world around us, otherwise there would be nothing in which we could ground our understanding of symbols, but it raised the question for me of whether these distinct aspects of sensing and reacting are equally important to contextualizing one’s understanding of words and their meaning.
ReplyDelete
Replies
Can KökOctober 3, 2023 at 4:46 PM
According to my understanding of the Harnad (2003) paper, symbol grounding problem explains the relationship between symbols (e.g., words) and their referents, in which a symbol system by itself does not have the capacity to “pick out their referents”. This is when grounding takes place, in which a symbolic system “would have to be augmented with nonsymbolic, sensorimotor capacities”. Thus, grounding necessitates sensorimotor capabilities. It can either be direct (sensorimotor) or indirect (linguistically). Does the symbol grounding problem suggests that categorical understanding is necessary for grounding? From my understanding, one needs to have sensorimotor abilities to interact with external objects to be able to categorize things, which inherently links the symbol to its referent? If categorization is the necessary condition in which a word (or any symbol) gets grounded, then how can meaning be achieved? Is it only left to the hard problem (like Searle’s CRA)? Meaning can only be achieved through a FEELING of understanding combined with sensorimotor capabilities that allows symbol grounding. Thus, even if we have a perfect T3 with sensorimotor capabilities, we don’t know the answer to “does it really understand”, since that is what the hard problem is all about, which intuitively seems impossible to make any progress with?
ReplyDelete
Replies
InstructorOctober 3, 2023 at 5:46 PM
Not quite; it feels like something (i.e., conscious) to understand the meanings of propositions (language) and to know the referents of their words. But that's only an unexplained correlation until someone solves the Hard Problem.

"Mind" is a weasel-word. Words are in the head: what is grounding?
ReplyDelete
Replies
InstructorOctober 3, 2023 at 6:37 PM
Valentina, yes, sensing is needed to sense -- but why is feeling needed? Ask ChatGPT to define the difference and let us know. I suspect a lot of weaselly mumbling (along with the usual notion that "feeling" is just emotion.
Adrienne, yes, to understand and to mean feels like something. But for categorization, reference and language, why isn't DOing enough?
ReplyDelete
Replies
Megan LauenerOctober 3, 2023 at 9:34 PM
After reading the Wikipedia text, it is my understanding that the symbol grounding problem refers to the problem of connecting symbols such as words to what they represent in the world. It addresses the issue of how symbols adopt meaning that is represented in the physical world. It was explained that words on paper would be ungrounded because there is nothing connecting them to their referent, whereas the meaning of a word in a person’s head would be grounded. This made me think of the philosophical thought experiment that questions if a tree would make a sound if no one was there to hear it. It is similar in the sense that there is no meaning to a word if there is no mind to mediate the connection between that word and what that word is supposed to represent in the world.
ReplyDelete
Replies
Marie HughesOctober 4, 2023 at 9:22 AM
I first read the Scholarpedia article to get a good understanding of the basic concepts of the symbol grounding problem. That to not have an infinite regress we would need to make connections not just between words, but between words and their real-world referents makes complete intuitive sense to me. I can explain to you in words the properties of an apple till I'm blue in the face, but unless you can make a connection between those properties and real-world objects and in fact point at a real apple we are just going in circles. What is an apple? a red round fruit. What is red? Our perception of a certain wavelength of light. What is light? ect. My only question arises when we go to more abstract concepts, things that don't have referents in the real world. How do I ground justice? I think I know what it means but I can't point to it in the real world only examples of just acts which doesn't seem to be the same thing.
ReplyDelete
Replies
Marie HughesOctober 4, 2023 at 9:59 AM
The second reading I did was Prof. Harnads' 1990 paper "The Symbol Grounding Problem." I understand the rejection of a purely 'free-floating symbolic system' as such a system divorces the symbols from their referents. My understanding is that we reject pure connectionism, as it is not systemic. The best it can do in terms of meaning is taxonomy, which is certainly not all we want to explain. So we propose we create symbols based on the iconic and categorical representations we get from connectionism, which then allows us to combine them while still being able to keep them grounded. I wonder then if this isn't a part of how chat gpt is so good unexpectedly. It seems possible that if we build the symbolic system up from grounded connections, then the symbolic system might somehow, if one could look at the entire picture, through its form, point to these grounded connections. The how of that I am completely unsure of, however.
ReplyDelete
Replies
Willem RosenbergOctober 4, 2023 at 10:03 AM
The symbol grounding problem succinctly captures a pervasive problem in cognitive science that I’ve never seen so clearly laid out until now. I find that a lot of pondering about cognitive science is an exercise of catching ourselves when falling into pits and discerning that they are often, in fact, the same pit.

I have a question reguarding clarification on why we deny that adding arms and wheels to (for example) adding wheels and a camera to ChatGPT would not instantiate a T3 passer. What I want to focus on in this example is the arbitrary-ness of what could pass T3 and when.

*If you were to build a pile of sand grain by grain, painstakingly picking up each grain and putting it in the same spot you’d know at each step exactly how many grains their were, so to you it is not just some abstract pile it is 7326 grains of sand, but to some new naive observer they could be pointed out the grains of sand and just say “yup that’s a pile of sand”. This to me is analagous to the incremental T3 discussion: it seems as though we do not want to believe that ChatGPT will ever pass T3 because we are there at each stage of adding each grain of sand. But someone coming along and not knowing how many grains of sand there were would obviously say that it is a pile (passes T3)*
ReplyDelete
Replies
Paniz KhamooshiOctober 4, 2023 at 11:59 AM
A part of the Scholarpedia reading that I found particularly interesting was the precise definition of a “symbol”, which clarified a lot for me in terms of what can, and can’t, have meaning in the first place (without trying to explain what that meaning is itself). It is crucial for a symbol to be part of a symbol system, as a single symbol, in a vacuum, is not useful. In that sense, although we don’t know the meaning of the components of a symbol system themselves, would it be safe to assume that we know their meaning in a relative sense? For instance, a “0” only makes sense when there is a “1” to which it can be compared, and differentiated from. This can be synonymous with the definition of categorizing, which from my understanding, can also rely on relative meanings. Now, this doesn’t answer the question of objective meaning, or the “grounded” meaning that we are so concerned about in the field of cognitive science, but as long as we can categorize symbols based on their relative meaning in a symbol system, is that not sufficient for cognizing? It may be true that this might be cheating (as in the case of chatGPT and the “big gulp”, associating a whole bunch of words with each other without actually having had the grounded experience of the word), and a form of “Zombie” cognition, but it could potentially still be able to pass the Turing test without having had the grounded experience of each word.
ReplyDelete
Replies
InstructorOctober 4, 2023 at 1:27 PM
Joann, you write that:

"ChatGPT seems to have acquired a sense of grounding without sensorimotor capacities by processing a massive amount of human textual data."

What do you mean by "grounding"? Direct grounding is sensorimotor, and ChatGPT (T2) has no sensorimotor capacity (T3).

(What I have changed my mind about (if ChatGPT has passed T2 and the "Big Gulp" of text data is not cheating) is that ungrounded computation (algorithmic symbol manipulation) can pass T2: "Stevan Said" only a grounded T3 could pass T2.

"How can Large Language Models (LLMs) be so good at manipulating language in a highly meaningful way without any grounding?"

Good question. But it's only meaningful to us, users, not to LLMs. They don't mean or understand a thing. They are ungrounded and cannot recognize or interact with the referents of their own words.

"Do LLMs gain an indirect sense of grounding through the syntactic formal structure?"

LLMs don't have any kind of sense, direct or indirect. Please see the replies to the other comments on direct (sensorimotor) grounding vs. indirect (verbal) grounding. The only way to gain the capacity to ground referents indirectly is bottom-up, from the sensorimotor ground (Chapter 8).

The symbol grounding problem will be solved when Cogsci successfully reverse-engineers T3 capacity. All it has produced so far is ungrounded toys.

Aya, Searle's argument was conditional: Even IF computation alone could pass T2, THEN it would not understand language.

ChatGPT (if it can pass T2 and the Big Gulp is not cheating) would show that ungrounded symbol manipulation can pass T2, hence that it can "manipulate language... in a way that is meaningful to a human reader", and can do that interactively, indistinguishably to and from a human. That's the essence of T2.

But that does not reverse-engineer and explain language capacity for any system that has not swallowed the Big Gulp. And we still don't understand how the Big Gulp enables ChatGPT to do what it can do. The computation alone (the algorithms) does not explain it.
ReplyDelete
Replies
InstructorOctober 4, 2023 at 2:06 PM
Joann, about "vectorization" see 'Vector Grounding'.
ReplyDelete
Replies
Delaney RiehleOctober 4, 2023 at 4:04 PM
(skywriting 1) Chat GPT typed out that the difference between sensing and feeling is that “sensing is the objective process of perceiving external stimuli through the senses, while feeling is the subjective experience or emotional response that arises from the interpretation of sensory information.” You were right about the usual notion that feeling is just an emotional response. From what I gathered from the 2003 reading, feelings are needed in order to distinguish that we are in a state of meaning rather than just in a functional state; whereas, sense is utilized in terms of grounding and is more technologically focused, is more of an input output function that can merely be created, but feeling cannot be made through inputs and outputs.Additionally, I gathered sense has more to do with symbols in comparison to feeling.
ReplyDelete
Replies
Delaney RiehleOctober 4, 2023 at 4:21 PM
(skywriting 2) After reading and dissecting the Harnard (2003) paper, it seems to me that the concept of formal symbols is a concept created by our minds rather ta]han by the system itself. While the formal systems have rules and shapes that perpetuate symbols, without our minds, it does not hold any value or meaning. In the next section it refers to language as another example of a formal symbol system, and this reminded me of last week’s reading that mentioned language versus communication and how they relate to one another. Language is a tool to use in order to communicate and hold little to no value without our minds and knowledge used to understand and manipulate these symbols. The section “Natural Language and the Language of Thought” prompted a question. Does it explain that in order to be grounded, the symbol system must be able to hold meaning and relevance without the use of a human mind? Or that it has to be more fitting to the things the symbols are describing?
ReplyDelete
Replies
Can KökOctober 4, 2023 at 4:57 PM
There are direct (through sensorimotor capacities, trial-error, corrective feedback) and indirect (language) ways of grounding and learning about categories. I was wondering about the particular differences between these two ways, and do they operate in a completely distinct way? We can’t observe or interact (through sensorimotor capabilities) with more abstract concepts like “good” or “evil”, but we seem to be able to categorize them. Is this fully based on language, or verbal explanations from another person, or can there be a role of trial-and-error learning or corrective feedback in the linguistic learning for categories? Is it solely dependent on the other person teaching us the categories? I was thinking about Thorndike’s Law of Effect and was wondering if something similar (?) to that can play a role in the indirect way for grounding? Or does that only apply to observable things we can interact with via sensorimotor abilities? Can we learn linguistically through making errors and correcting them/repeating and strengthening correct responses?
ReplyDelete
Replies
Zoe YurmanOctober 4, 2023 at 7:45 PM
After reading a couple of the texts on the Symbol Grounding Problem written by Professor Harnad, I took a look at Steel’s chapter on representation and meaning. One section in particular, which reviewed a child’s image of a city bus, got me thinking. In the image, certain features of the bus are exaggerated or focused on, while others are completely discarded or misrepresented (e.g. the bus is very tall with many windows, however there are over twenty wheels). Steel states that this depiction emphasizes aspects of the bus which have been key characteristics based on the child’s experiences. The bus being large, having many more windows than a regular car, and featuring a conductor who takes tickets are likely all parts of the child’s experience riding a bus which stand out. I think this same idea translates to how people use symbols to represent the world, as the characteristics or details which tend to stand out to us are more prominent in our symbol system and how it is used. In the English language, for example, there are 3,000 words describing emotions, suggesting that the importance we place on human emotion is represented by the vastness of our emotional lexicon. Furthermore, when describing an experience of mine, I may use language which conveys certain details about things that matter to me and disregard others. If I go for dinner where I eat my favourite pasta and drink a glass of water, I may use different words to emphasize each depending on if I was very thirsty or very hungry beforehand. In this way, symbol grounding is not just grounded in our external world, but our internal response to it as well.
ReplyDelete
Replies
Nicole Vilkoff (Nico)October 5, 2023 at 1:03 AM
I am trying to think of any other way to ground “apple” to its referent. Drawing apples is not enough as this is just a simulation. As mentioned, explaining “apple” would be similar to the dictionary example, potentially causing infinite regress as other symbols and meanings are referred to. However, if every component of the explanation is grounded in the learner’s head, then they can build indirect grounding.
An essential part of grounding is the ability to discriminate between different inputs and correctly categorize them (different apples can both be apples, fruits are not all apples). By categorize I mean assign the objects into groups of a higher level of abstraction (less specific) in order to do the right thing with the right kind of thing. Perhaps another way to ground would be to show/explain two objects/concepts, and have only one of them be familiar. For example, showing or explaining a pear (grounded) and apple (not grounded), and saying “one of these is a pear and the other is an apple”. In this scenario, the apple would become grounded indirectly through elimination.
These readings made me wonder, are there levels of grounding? And at which point do we understand a meaning? For example, surely someone who has tasted and seen an apple is more grounded for “apple” than someone who has only seen an apple. Does someone need the full experience to be grounded? And if not, how much understanding is enough for grounding? My guess for the minimum would be: one sense's input and the feeling of understanding.
ReplyDelete
Replies
Jiajun ZhuOctober 5, 2023 at 2:21 AM
The below text is compressed and modified a bit by chatgpt.

Grounding is spotlighting a key flaw in modern AI systems. Although neural networks (NN), especially ones demonstrating human-like capabilities like DeepMind's GATO, remain predominant, which may considered as T2 to some extend, they still tread a path fraught with scientific inaccuracies for cognitive science even Harnad raised the issue in 1987.(Harnad, S. (1987b) Category induction and representation. In S. Harnad (Ed.) Categorical perception: The groundwork of Cognition. New York: Cambridge University Press) Despite their ability to process inputs into numerical vectors, used in functions to derive decisions, these systems symbolize a core challenge in making meaningfully conscious AI. Harnad emphasized that symbol systems lack meaning without consciousness, presenting a significant obstacle in reverse engineering since they don’t physically correlate with reality, operating strictly in a statistical domain without addressing real-life complexities. The crux involves surpassing mere calculation and statistical functionality in NNs to genuinely bridge AI decision-making with physical robotics and their environment, thereby navigating beyond the existing confines of unified, yet meaninglessly vector-oriented, processing. This transition beyond pure statistical operation to fully proceed sen the Harder problem in evolving AI systems and T3 TT, and Harnad also makes his explanation and prediction in the state-of-art vector grounding paper.
ReplyDelete
Replies
Siyuan ChenOctober 5, 2023 at 2:48 AM
The statements in “The symbol Grounding problem” impresses me ：“The expectation has often been voiced that "top-down" (symbolic) approaches to modeling cognition will somehow meet "bottom-up" (sensory) approaches somewhere in between. If the grounding considerations in this paper are valid, then this expectation is hopelessly modular and there is really only one viable route from sense to symbols: from the ground up. A free-floating symbolic level like the software level of a computer will never be reached by this route (or vice versa) -- nor is it clear why we should even try to reach such a level, since it looks as if getting there would just amount to uprooting our symbols from their intrinsic meanings (thereby merely reducing ourselves to the functional equivalent of a programmable computer).” (Harnad)

In my understanding, top-down processing and bottom-up processing are two types of processing.And these two kinds of processing cannot be carried out alternately.

Top-down processing starts with expectations & context to help sense/interpret incoming data stream; while bottom-up processing starts with distal stimulus or sensory data and build up representation. It is very strange that when two processing will meet together at some point and it doesn’t make sense. If they meet at one point, that might be only the stage of “perception” and “thought process”; however, it is not meaningful because they will be back to none since the end point on their journey is the opposite. End point of top-down processing is the starting point of bottom-up processing, vice-versa.
ReplyDelete
Replies
Siyuan ChenOctober 5, 2023 at 2:49 AM
I have some thoughts on this sentence from “Symbol grounding problem” :
“The problem of discovering the causal mechanism for successfully picking out the referent of a category name can in principle be solved by cognitive science. But the problem of explaining how consciousness can play an independent role in doing so is probably insoluble, except on pain of telekinetic dualism” (Harnad)

Modern technology can actually perform batch tasks in a short time. Whether it is classifying or sorting data, machines can complete the tasks. However, the symbols in the article require the machine to connect to the next point by itself to complete the task, but there is a problem, consciousness. Machines want to connect symbols, but current technology is limited to human control. Humans act as the consciousness of the robot to help the machine complete its tasks.
I think this is not a bad thing. Although the speed of completing the task will not be reached, making the machine conscious is a difficult problem according to current science and technology. Secondly, if machines are conscious, they will pose a certain threat to human life. They will slowly develop their own independence and try to escape from human control. Regarding consciousness, and whether machines should have it, harm is more than the good.
ReplyDelete
Replies
Jiajun ZhuOctober 5, 2023 at 3:40 AM
Professor Harnad's notable contribution to artificial intelligence philosophy, notably the Symbol Grounding Problem (SGP), not only critiques the limitations of contemporary AI but also highlights the critical role sensors play within cognitive systems. The prevalent strategy in AI learning currently involves converting natural language into a machine-readable programming language, an approach inherently subject to entropy losses - with entropy referring to total informational content in this context. AI demonstrates that directly translating external systems into symbols without perceptual intermediaries results in informational loss, particularly when the input is in human language. Introducing conscious awareness to this process, the substantial informational loss during reverse engineering becomes catastrophic, a problem deeply rooted in grounding issues. Moreover, contrary to what is often assumed in numerous simulation environments, real-world agents do not receive neatly structured input vectors but are rather subjected to a ceaselessly variable stream of sensory stimuli, largely dependent on the agent’s current behavior. Additionally, research indicates that when GPT-4 is given a prompt, allowing two instances of GPT to converse, they begin to recycle their responses after a finite number of exchanges, unable to break free from the existing semantic space. Hence, Language Learning Models (LLM) intrinsically lack meaning, with humans being the ones assigning meaning to LLM’s language outputs.Therefore, an ungrounded LLM does not have any practical significance, and may even fall into a philosophical trap.
ReplyDelete
Replies
Anais RubsamenOctober 5, 2023 at 10:32 AM
The difference between grounding and meaning is essential to understand what the symbol grounding problem really is. Grounding is a bridge between the sensory input we get from the external objects we interact with, and the internal symbols (words, pictures) our brains have stored. In the context of language learning, let’s say you know that “piros” means “red”, and “zöld” means “green”. Those symbols become grounded. If I then point towards a red apple saying “piros alma”, and a green apple saying “zöld alma”, it is possible for you to infer and understand, using contextual cues, that “alma” means “apple”, which then becomes grounded. If neither words were grounded, I could tell you to point at the green apple saying the string of words “zöld alma” without you understanding what it means. Therefore, although groundedness is necessary for meaning, it is not sufficient for there to be meaning.
ReplyDelete
Replies
Stephen CresswellOctober 5, 2023 at 12:51 PM
Post #1: Harnad, 1990
I very much appreciated the clear explanation of what a symbol system is, I thought it made it much easier to understand how connectionist approaches differ from symbolic ones. We must answer the question: how does language gain meaning such that it is part of the language system itself and not external (only has meaning via some interpreter). Without connecting these symbols to the real world, they would be meaningless, Harnad points out it'd be like trying to learn the meaning of a Chinese word by looking it up in a Chinese-Chinese dictionary. Harnad argues that any understanding of a semantic interpretation of a formal symbol system comes from the reader, not from the symbol system itself and this point of view is supported by Searle's CRA. Harnad points to a common symbolist which is "just give the formal symbol system wheels and a camera" and in the footnote claims this argument is homuncular (I'd love an explanation why) and that it trivializes the symbol grounding problem. Instead he proposes a hybrid non-symbolic/symbolic system where the symbols are grounded either directly through sensorimotor experiences of by their relation to other symbols which have been grounded in sensorimotor experiences. If I understand correctly, these experiences give rise to categorical and iconic representations of words which ground their meaning in the real world. I'm really not sure I understand this part. What are these representations? Where are these representations? How do they give meaning to the symbol system that is not external? It seems to me that the forming of these categories and icons could be accomplished computationally (symbolically) so all you would need is access to the stimuli but Harnad claims that just adding wheel and a camera is a homuncular point view. I feel as though I've totally misunderstood something or missed something so any help that could be provided would be much appreciated!
ReplyDelete
Replies
Kareem AbualiOctober 5, 2023 at 2:54 PM
I found Prof. Harnad's 1990 reading very interesting. Particularly that the most promising answer to the symbol grounding problem involves a synthesis of both symbolic and connectionist systems. Connectionism helps us understand how exposure and feedback enable the association of objects with symbols through learning, relying on consistent patterns of sensory projections. On the other hand, the symbolic approach demonstrates that once we establish a foundation for elementary symbols in this manner, the remaining symbol combinations in a natural language inherit the grounding from these foundational symbols. When these apparently conflicting approaches are combined, they create a hybrid system that stands as the most viable solution in addressing the grounding problem.
ReplyDelete
Replies
Kareem AbualiOctober 5, 2023 at 3:13 PM
Prof Harnad's 2003 article about the symbol grounding problem has deepened my understanding of its connection to many of the topics we've looking at. The crucial attribute enabling symbol grounding is the brain's ability to identify their corresponding real-world entities. This process requires our sensorimotor capabilities, enabling autonomous interaction with objects and properties in a manner that aligns with how humans interpret symbols as referring to these entities. For a symbol to be grounded, it must establish a connection to the entities it represents, ensuring that our sensorimotor interactions with the world align with the symbols' intended interpretations. A connection based on trial and error induction guided by the feedback from the reward of a correct categorization.
ReplyDelete
Replies
Aimee Tran Ba HuyOctober 5, 2023 at 3:20 PM
I am wondering whether looking at the symbol grounding problem from the field of developmental psychology would be helpful. Maybe looking at child development, and access how a child can learn how to relate language to tangible objects at first, and subsequently more complicated concept, would give us some clues about the underlying process.
ReplyDelete
Replies
Jordan WebsterOctober 5, 2023 at 7:15 PM
All three articles seem to presuppose that all referring words have referents. For example, the scholarpedia article says "[s]urely all the (referring) words on this page, for example, have meanings, just as they have referents". But I don't think that we can take this for granted. I align myself more with Chomsky's view on this matter, namely that NO word has an objective referent (which I believe is the important criterion of "referring" here, since we are talking about the Symbol Grounding Problem, I will explain more later). The example of Aristotle's that I remember is the word "house": part of the word house is the physical components of the house, but the "form" of the house that we really just construct in our head is also necessary for it to be a house. The physical realization of the house does little to restrict what us humans think of as a house, the house could be anything as far as physical reality is concerned, even "a paperweight for a giant" (if giants were real) as Chomsky says. Or equally, think of the word "river": you can never cross the same river twice because its physical composition is constantly changing, but of course, we humans think of it as crossing the same river twice. The same goes for "Tony Blair", whose cells are constantly dying and being replaced, and for any living organism, and for really anything at all that a word in a natural language can pick out. Words aren't really grounded to referents at all, and just like meaning, referring also seems to be dependent on internally constructing what a thing is since anything that is out there in the world cannot be picked 1-to-1 by a word in human language.
ReplyDelete
Replies
KrOctober 5, 2023 at 9:12 PM
in section 3.3 of the 1990 Princeton university reading the example of combining horse and stripes to form zebra is laid out. this example helped me to think that what splits names and symbols and how we identify a zebra is not just the stringing together of the concepts horse and stripes. if I'm being picky about it: the stripes of a zebra really aren't just flat out stripes (like the ones on a mime t shirt). while we can infer that if we see a horse the stripe-like pattern of a zebra it is probably a zebra (because horses don't typically have stripes) we cant just look at a horse painted in parallel horizontal stipes and say this is a zebra and still be right
ReplyDelete
Replies
KrOctober 5, 2023 at 9:21 PM
reading the scholarpedia article, made me think about referents which are less grounded than others. for example: we can point to the apple and say apple and mean "fruit with core that I can bite into", we can correctly call an apple an apple and proceed to do the right things with an apple and that's hard enough to work out as it is. but what about things which are more abstract like "courage" or "belief" and how do we categorize these referents such that we can also do the right things with them. not sure.
ReplyDelete
Replies
InstructorOctober 5, 2023 at 10:09 PM
Nicole, excellent questions and answers. But the trouble with your hybrid grounding by elimination [saying or showing a grounded category + showing an ungrounded one] is that you can't show an ungrounded category! You can just show a membertell its distinguishing features, then you've made it almost all instruction (telling). Do you see that?

But your ideas are good, and dictionaries (and instructors) sometimes do hybrid definitions, not just describing features verbally, but also showing an illustration.

Your question about levels is good too. First, not only all definitions but also all learned feature-detectors are approximate (just as theories are). The approximations can be tightened, but never exhaustive (except in formal maths and logic: necessary and sufficient conditions). We'll talk about this more in Week 6, on categories.

There is also a hierarchy in dictionary definition space (Week 8) like the one you try to describe. (J is defined in terms of features H and I, which are defined in terms of features D, E, F, G etc. These are definitional distances in indirect verbal grounding. But it can't be indirect grounding all the way down; some categories and their names have to be grounded directly (by trial and error, like on the mushroom island, to find the distinguishing features; and to tell those to someone too, they too need to become learned, named categories, like the features of the edible mushrooms).

And there is another hierarchy even in direct grounding, as you ground more and more general categories: apples, fruit, food. The higher-level, more general categories share fewer and fewer of the sensorimotor features (that's how they become more "abstract"). But that does not mmean they become less sensorimotor

And to make it (seem) more complicated still, generalization and feature-abstraction is not strictly hierarchical in one upward direction: You can have apple, fruit, food; or you can have seed, apple, tree.
ReplyDelete
Replies
Andrae WangOctober 6, 2023 at 1:28 AM
Harnad, S. (2003) The Symbol Grounding Problem
The scholarpedia summarizes everything we have learned so far section by section, in a way that is simple and easy to understand. I found this particularly interesting - “But if groundedness is a necessary condition for meaning, is it a sufficient one? Not necessarily, for it is possible that even a robot that could pass the Turing Test, "living" amongst the rest of us indistinguishably for a lifetime, would fail to have in its head what Searle has in his: It could be a Zombie, with no one home, feeling feelings, meaning meanings” If everything in the world is pretty much meaningless until we humans gave meaning to it, then does it really matter if the robot understands the true meaning of the symbols and has grounded experiences if it is still able to connect/relate the information it knows?
ReplyDelete
Replies
Andrae WangOctober 6, 2023 at 1:29 AM
Harnad, S. (1990) The Symbol Grounding Problem.
Compared to the scholarpedia, this article explains the symbol grounding problem in a more detailed manner. Two theories about cognition are introduced: the symbol system - a set of symbols manipulated only based on its shape according to a set of rules, and the connectionist system - “dynamic patterns of activity in a multilayered network of nodes or units with weighted positive and negative interconnections.” To understand cognition, we can start with computation, then use connectionism to explain how the symbols reach its semantics.
ReplyDelete
Replies
Liam VespasienOctober 6, 2023 at 11:19 AM
It seems that the text posits that for computers to exhibit the same kind of understanding that humans do, the symbols used by the computers would have to be grounded. They would need to be capable of identifying referents, and to achieve that, the symbol system would require the ability to interact with the external world. I would argue that, since knowledge is built upon previous knowledge, interaction with the external world would only be necessary for a set of fundamental units of the external world. The symbol system could then perform computations based on these fundamental units.
ReplyDelete
Replies
InstructorOctober 7, 2023 at 11:55 AM
Please explain Searle's Periscope. What does it show, how?

We have not been discussing how perception varies between individuals. Human words are grounded: How?

The HP is explaining "how and why we can feel at all", not "how and why some people feel this and others feel that."

What are Cogsci's EP and HP? And what is reverse-engineering?
ReplyDelete
Replies
Ishan AbhiOctober 7, 2023 at 5:14 PM
While reading the article "Solving the Symbol Grounding Problem," I could not help but become increasingly confused by the Zero Semantical Commitment condition. The first element of the condition states that a solution to the SGP cannot possess any form of innatism. That is, no semantic resources should be presupposed in the artificial agent. To me this seems to be much too strong of a condition, and frankly, one that seems to ignore the role of Darwinian evolution. This is most evident when the paper refutes Sun (2000)'s Intentional Model. Sun states that the AA would first interact with the environment in a random, trial and error way, paying attention to the structure of the environment and the innate biases of the AA. The paper argues that these innate biases already violate the Z condition, and therefore cannot be a solution to the SGP. But this seems to ignore the innate biases that would have manifested through Darwinian evolution mechanisms. It does not seem like it would be possible to ground symbols without some degree of innate semantic resources guiding the interactions the AA is having with the world. It is most definitely the case with us that we possess a degree of innatism in regard to semantic resources.
ReplyDelete
Replies
InstructorOctober 7, 2023 at 7:05 PM
1. Cogsci is just trying to reverse-engineer and test T3 (or T3 & T4) for Turing indistinguishable DOing capacity (and hoping that it can also FEEL, but not being able to test that (because of OMP) or even able to explain, if it does feel, how and why (because of HP).

2. Sensorimotor functions are physical, not just computational.

3. The brain does not "associate" words to their referents, it makes a direct sensorimotor connection, through robotic interaction, manipulation and feature detection. You can't get that, T3-scale, by just adding a camera and wheels to a computer. See again the Sorites reply.

4. The difference between computational and dynamical is the difference between a computer simulation of an ice-cube and an ice-cube.

5. I won't be able to reply to such long commentaries again. Please keep it under 150 words.
ReplyDelete
Replies
InstructorOctober 8, 2023 at 10:25 AM
Yes, there are some categories that are unclear, or that we don't all agree on. All categories (hence their distinguishing features, whether sensorimotor or verbally described) are approximate and provisional, not exhaustive or definitive (except in maths). That's true of categorizing (on the mushroom island) as well as the categories of high-level abstract discourse.

And examples are not to be sneezed at. You can learn by trial and error what most people call "fair" or "just" from positive and negative examples plus social feedback. In that sense it's not just sensorimotor categories, like the mushrooms, whose distinguishing features can be learned either directly, by trial and error, or indirectly, by (grounded) verbal descriptions of their features: You could learn what most people consider "fair" or "just" and "unfair" and "unjust" from examples, by trial and error with social feedback (and perhaps some feedback from your empathy mirror-neurons). But in both cases (mushrooms and equity) a provisional (but grounded) verbal definition from someone who knows to someone who doesn't know is incomparably faster and more efficient (and less risky) than trial and error.

Of course disinformation can make hearsay categories really risky too...
ReplyDelete
Replies
InstructorOctober 8, 2023 at 11:39 AM
See the other replies on this. You're not the only one who is struggling with the HP.

Turing was right that Cogsci cannot hope to do better than solving the EP by reverse-engineering DOing capacity and T-Testing (T2, T3, and T4) to test whether it has succeeded.

This is not just because of the OMP, which prevents testing for the presence of "consciousness" (feeling), because feeling is unobservable (except by the feeler).

Even if an omniscient deity could guarantee that your reverse-engineered TT-passer was not a zombie, we still could not explain how or why. So the HP would not be solved. (But don't ask me if an omniscient deity could solve it!)

We'll get to why the HP is so hard in Week 10. The short answer is that it's because of the solution to the EP.
ReplyDelete
Replies
maria mcguinnessOctober 8, 2023 at 2:00 PM
I believe I understand the basic ideas in the symbol grounding problem, however, I am slightly perplexed by the idea that computers cannot ground symbols. I see how they just manipulate symbols and give output, but couldn’t there be a way to make connection networks and that are tied to a referent and store them, similarly to how we ground symbols to their referents. I may be missing something, but could someone clarify this for me.
ReplyDelete
Replies
InstructorOctober 9, 2023 at 10:22 AM
For sensorimotor symbol grounding a T3 robot needs a body and head, not just a neural net. What the neural net is good at is learning to detect the features that distinguish the members of a category (e.g., apples) from others (e.g., billiard balls) so the T3 robot can do the right thing with the right kind of thing.

If the category has a name, the features ground the symbols that are inside the robot's head in the robot's capacity to detect and act on the referent of the category-name. Algorithms manipulate symbols, but robots manipulate apples. (How much of what's going on inside a T3 robot that is just computation still remains to be discovered by Cogsci.)
ReplyDelete
Replies
Michelle ChenOctober 10, 2023 at 3:48 PM
Extra skywriting: In the 1990 and 2003 readings for “The Symbol Grounding Problem,” Professor Harnad proposes that the symbol grounding problem can be addressed through categorization. “Categorization” involves the appropriate handling of the correct type of thing (Harnad, 2003). More specifically, categorizing skills can be innate or learned, depending on what exactly is being categorized; this ability, done through sensorimotor interactions with the world, acts as a medium for which “grounding” can be achieved (Harnad, 1990). Regarding these parts of the reading, I have a few questions. In this discussion of innate vs. learned categorizing, what would be specific examples of this? Would language acquisition, as highlighted in Chomsky’s Universal Grammar Theory, be an example of an innate ability to categorize? Are there forms of categorizing that could be both innate AND learned?

Categorization can be complex and have many subjective layers. Given the subjectivity that can be involved, to what extent can grounding abilities vary across different people? How might these differences show on a neurophysiological-level? To add, as discussed, categorizing an abstract or intangible concept would involve “indirect grounding.” To my understanding, this means that “grounding” is executed not through sensorimotor experience; instead, abstract concepts would be “grounded” by having to connect to previously “grounded” information. With this in mind, are there limitations to grounding through categorization? In other words, if we know that the human brain is naturally able to create meaning, do these ideas of direct and indirect “grounding” through categorizing really capture the full complexity that may be involved in this process?
ReplyDelete
Replies
InstructorOctober 10, 2023 at 4:49 PM
An example of learnable grammar is Ordinary Grammar (OG) (Week 8 and 9). We can learn that by imitation, trial and error and correction, or verbal instruction.

With Universal Grammar (UG), we don't make any errors, and neither does anyone else. So we can't learn UG by imitation, trial and error and correction, or verbal instruction. But we don't make any UG errors, so it must be innate.

Maybe phonemes like ba/da/ga are partly innate and partly learned.

I don't know what you mean by "create meaning." Indirect grounding just means learning the referent of a word you don't know through words. This can be done from a dictionary definition -- as long as all the (content-) words in the definition have already been grounded for you, either by direct sensorimotor category learning or indirectly, by (grounded) definition.

"Concept" is a weasel-word for; usually it really just means a category, and sometimes something something as vague as an "idea."

There are limitations to grounding word referents' directly, but none on grounding them indirectly, through definition, description, explanation and instruction -- all through words.

Many categories are grounded both ways, by telling (words) and showing (as in illustrations with a dictionary definition.

But you can't learn a category by just showing: why not?
ReplyDelete
Replies
Emma VrignaudOctober 11, 2023 at 12:37 PM
(Skywriting 1) The Symbol Grounding problem, 2003. I really liked this reading. The description of "a symbol system" was particularly interesting to me.
Harnad writes "The symbols are systematically interpretable as having meanings and referents, but their shape is arbitrary in relation to their meanings and the shape of their referents."

I was instantly reminded of the computationalist approach to cognition. The idea that these symbols only have meaning if they are within a system and that their shape is independent from their meaning is an exact parallel to: different parts of the brain have different functions but it does not matter exactly which one. It made me realize the reason we segregate different areas of the brain is in order to create functions.

Because the brain is a unit of sorts, we have to create the system ourselves... I thought it was an interesting thought.
ReplyDelete
Replies
Stefan VujicicOctober 11, 2023 at 1:35 PM
In the realm of robotics, symbol grounding can lead to robots that interact more intuitively with their environment. Instead of relying solely on pre-programmed rules, robots could use their sensory input to ground symbols in real-world situations. This would enable them to adapt to dynamic environments, collaborate with humans, and even learn from their interactions. The practical applications of symbol grounding extend beyond AI and robotics, potentially revolutionizing fields like healthcare, education, and customer service. By deepening our understanding of how symbols are grounded in real-world experiences, we can enhance the capabilities of machines and systems, making them more effective and integrated components of our daily lives.

What are the ethical implications of symbol grounding (if any) would be present in the context of artificial intelligence and machine learning given it is even possible?
ReplyDelete
Replies
Jocelyn WongOctober 11, 2023 at 4:40 PM
Harnad, S. (1990). [The symbol grounding problem](http://cogprints.org/615/1/The_Symbol_Grounding_Problem.html). *Physica D: Nonlinear Phenomena*, 42(1), 335-346.

Harnad tries to reconcile the purely symbolic and purely connectionist models of the mind by combining combining the two into a hybrid model. What stuck out to me was the link back to behaviourism and how this reconciliation echoes its sentiments. This argument is at its core what of the biggest obstacles in cognitive science; we can never definitively say “whether a semantic interpretation will bear the semantic weight placed on it,” for example. The tests we use to form judgements can at most be used to generate an educated guess. Also, who is to say whether a test is passed? In the case of the behavioural test, there is an infinite amount of objects that it is impossible to comprehensively assess whether a semantic interpretation can “discriminate, identify and describe all the objects and states of affairs to which its symbols refer”.
ReplyDelete
Replies
Daniel ZhouOctober 11, 2023 at 5:29 PM
If referring to Searle's Chinese Room, it demonstrated that understanding isn't achieved merely by manipulating symbols, as a program would.

While human words are grounded, for example, the word "red" is grounded in our visual experience of red objects.
It’s because words are rooted in our sensory and motor experiences. This article helped put a term down that I couldn’t put an exact finger down for this concept we’ve been gleaming on since week 2.

The HP of cognitive science is about understanding consciousness—why and how sensations arise. EP (Easy Problem) deals with explaining cognitive functions, like discerning and responding to stimuli.

Reverse-engineering is deconstructing a system
to understand its design and functioning, which can provide insights into human cognition if we see robotics as a reflection of our neural processes.
ReplyDelete
Replies
Nicolas CurtosiOctober 12, 2023 at 12:35 AM
Hey Stefan, I agree with you in that artificial intelligence's ability to symbol ground and utilize sensorimotor interaction with referents would absolutely take it to another level. In regards to how advanced we perceive it to be in its ability to present itself as what is typically deemed 'conscious'. However, I would refer you to Anaïs. T3 passing sensorimotor bot which is capable of symbol grounding using referents of the real world. I don't believe it is possible to prove Anais' consciousness, however, I also don't believe it matters much, given if it appears conscious to me, that is the only way I can judge it, in the same way it is the only way I can truly judge your consciousness.
ReplyDelete
Replies
Nicolas CurtosiOctober 12, 2023 at 12:44 AM
I also had very similar thoughts about the consciousness and the zombie passing TT example. The example of a zombie passing the TT without consciousness, in the sense of lacking subjective experience, implies that a system could exhibit intelligent behaviour and even pass a TT without having consciousness or subjective experiences. This is a strong challenge to the assumption that consciousness is an absolute prerequisite for intelligent behaviour or meaning in my opinion. If subjective experience can be ruled out of consciousness, what is left that defines us as human? It seems as though the threshold to being truly conscious or human drops with every week of class hahaha.
ReplyDelete
Replies

Add comment

CatComConM2023

Monday, August 28, 2023

5. Harnad, S. (2003) The Symbol Grounding Problem

288 comments:

PSYC 538 Syllabus

Report Abuse