This is not just distributional information analysis in the sense that ‘tokens’ are grounded in other ‘tokens’. They’ve grounded these calls in naturalistic situational context. This is hard won data.
If I understand, the finding here is that bonobo calls are “non-trivially” compositional, e.g., the semantic embeddings of pairs of vocalizations point in different directions surrounding the base vocalization. But it seems there is no “trivial” compositionality in the sense that constructions like [good __] might point in a similar direction. I would expect this latter result. This seems like a conspicuous absence? Is this really compositionality? Not sure what to make of it.
Some interesting context: bonobos and other (non-human) great apes are believed to have more intentional and flexible control over their gestural repertoire than their vocal repertoire and that these gestural repertoires are larger. Human language likely evolved from gesture (or so some believe). So, if their vocalizations are in fact compositional, it may be a separate evolutionary prong.
Usually it would be called "noncompositionality", the phenomenon that distinguishes words from phrases or sentences. As you might expect from that term, it's the opposite of compositionality. It also goes under the fancier name "the arbitrariness of the sign" or the even fancier "l'arbitraire du signe".
For example, hat, sat, bat, and flat are all semantically unlike each other, and flare, phlegm, flick, and flood are also all semantically unlike each other. It isn't the case that you can predict the meaning of flat by knowing what hat, sat, bat, flare, phlegm, flick, and flood mean.
The article draws an analogy to a general syntactic rule:
> But, so far, only “trivial compositionality” has been identified in non-human animals, whereby each unit adds independently to the meaning of the whole. For example, the phrase “blonde dancer” has two independent units: a blonde person who is also a dancer. Humans were thought to be unique in also having “non-trivial compositionality”, where the words in a combination means something different to what they mean individually. For example, the phrase “bad dancer” doesn’t mean a bad person who also dances.
Bad dancer is compositional in that you can know the meaning of the phrase by knowing the meanings of bad and dancer. So far so good; the article agrees that it is compositional.
They observe that the two words in the phrase aren't independent; a one-legged dancer is a dancer who has one leg, but a bad dancer isn't a dancer who is bad in general; it's a person who is bad at dancing.
The reason this is still compositional is that you can use any appropriate adjective this way: you can be a strong dancer, an energetic dancer, a timid dancer... so this is just a rule about the formation of certain English phrases, and knowing what bad means in general is sufficient to know what bad means in the phrase bad dancer.
They don't appear to be making a similar claim for the bonobos:
> For example, “high-hoot + low-hoot” combines the calls that seem to mean “pay attention to me” and “I am excited” to say “pay attention to me because I am in distress”
This example looks more like an arbitrary sign to me. But that might be an artifact of poor data management:
> they used a technique from linguistics to create a cloud of utterance types, placing vocalisations that occurred in similar circumstances closer together. “We kind of established this dictionary,” says Berthlet. “We have one vocalisation and one meaning.”
There is no reason to expect that one vocalisation should possess only one meaning.
They also have poor methodology:
> Once they had this semantic cloud, they could see whether the individual calls in a combination had distinct meanings, and found that the combinations were close to the units that they were made of, which would suggest compositionality. Using this approach, they identified four compositional calls
By comparison, throw, throw up, and throw away are all similar in meaning, which is why they have similar forms, but they are not compositional - knowing what one of them means won't help you with the others.
Given his penchant for exploring jumping techniques, it is probably unsurprising that Fosbury would retire to become an engineer!
Cultural evolution (both in terms of materials and practices) is a nice lens to view the history of Track and Field. There are many other innovative techniques in the sport, not all of which became IAAF legal: https://www.youtube.com/watch?v=dXGB51C_dRE
It’s hard not to see this angle as making the capacity for cultural evolution very fortunate for us (or unfortunate for other species). What other animals can exhaust primary food sources and ‘quickly’ pivot to another?
I’ve often wondered how much rituals and social routines for affiliation and bonding drove for this capacity (which we often associate with lithic tools) as opposed to the need for food. Perhaps some of this is due to another scarcity: hair. How did hairless apes manage relationships without grooming?
“Huh?” Is strong candidate, if you accept it is a word. Here is a fun talk in the subject https://youtu.be/rHHJ3hSppEA?feature=shared
It seems the demands of asking for clarification in conversation shapes the word to be easy and fast to pronounce.
They have tried this with juvenile non-human primates! For example, Horner & Whiten (2005) tried this with 2-6 year old chimpanzees. Clay & Tennie (2017) tried this with juvenile bonobos. Neither group overimitated. They do play, but overimitation is probably underpinned by the ability/proclivity to infer Gricean intentions, which non-human primates lack.
There is a strong normative element to this, as well as the play element you mentioned — I expect, as adults, we’ve all engaged in some form of overimitation as an act of conformity.
This is not just distributional information analysis in the sense that ‘tokens’ are grounded in other ‘tokens’. They’ve grounded these calls in naturalistic situational context. This is hard won data.
If I understand, the finding here is that bonobo calls are “non-trivially” compositional, e.g., the semantic embeddings of pairs of vocalizations point in different directions surrounding the base vocalization. But it seems there is no “trivial” compositionality in the sense that constructions like [good __] might point in a similar direction. I would expect this latter result. This seems like a conspicuous absence? Is this really compositionality? Not sure what to make of it.
Some interesting context: bonobos and other (non-human) great apes are believed to have more intentional and flexible control over their gestural repertoire than their vocal repertoire and that these gestural repertoires are larger. Human language likely evolved from gesture (or so some believe). So, if their vocalizations are in fact compositional, it may be a separate evolutionary prong.