Meeting 1er mai 2012 Front National Crédits : Blandine Le Cain
Sciences Po research delves into the elections
22 March 2017
Renewable energy and network structuring in Europe.
11 September 2017

When the Internet sheds light on the transformation of messages

We know that the transmission of messages between individuals is often imperfect and/or incomplete. The transformation of a single word or group of words can change the initially intended meaning. A number of researchers are working to deconstruct the dynamics and effects of this phenomenon.
This is the subject that Sciences Po professor and medialab researcher Camille Roth and Sébastien Lerique, a junior researcher at the École des hautes études en sciences sociales, explored in research they share in “The Semantic Drift of Quotations in Blogspace: A Case Study in Short-Term Cultural Evolution”, which will be published in Cognitive Science.

A new research area: the blogosphere

The study’s originality lies in its use of real situations – an in vivo approach – in its examination of the transmission of messages in the blogosphere. This type of research usually takes place in a laboratory. Thus, instead of studying “fictional” situations where the information is simply passed from one participant to the next during an experiment organized in vitro, the researchers seized an unprecedented opportunity to tackle this issue by processing an immense volume of digital traces.

Little Red Riding Hood. From Childhood's Favorites and Fairy Stories, by Various. Domaine public

When diversity converges: the importance of cultural attraction

The idea guiding Camille Roth and Sébastien Lerique was to test the validity of Cultural Attraction Theory (CAT)*, which posits that cultural attractors and preferential reformulation phenomena explain the convergence of content at the collective level despite the more or less faithful reformulation of statements from one individual to the next.

The adventures of Little Red Riding Hood 

A simple example illustrates this theory: there is no one version of Little Red Riding Hood. This (quasi)-universal fairytale underwent reformulations during its “travels”. The reformulations varied according to the environment: in a given place, one element of the story might have better survived, while another element became secondary or disappeared; in other places these elements were arranged differently. Notwithstanding the different versions of Little Red Riding Hood, a key element remains: to elicit fear. This element is the cultural attractor, because fascination with fear is shared by all of humanity. 

Identifying the transformation of messages on the blogosphere:  methods and results

To study this phenomenon, Camille Roth and Sébastien Lerique chose to explore the transformation of words that are part of quotes reposted by blogs. This choice was primarily made because quotation marks make it easy to identify this type of context via data processing “scripts”. Furthermore, these alterations are necessarily due to human reformulation rather than cut-and-paste.
In order to describe the change in phrases when they are reused, the authors first developed a series of models enabling a reconstitution of the most likely relationships between the different quotes.
Thus, they were able to describe the substitutions and identify the factors influencing them by drawing on standard linguistic variables, such as the age of acquisition of a word, its frequency, its length and the gap between the significance of the original term and that of the substitute term.

Capture écran de résultats sur un moteur de recherche. Public Domaine

Capture results screen on a search engine. Public Domain

From “foolish” to “crazy”
An illustrative example of this process is the transformation of the quotation of a verse by a Burmese poet (Saw Wai). The original poem’s verse is: “Senior general Than Shwe is foolish with power”, but several blogs quote “Senior general Than Shwe is crazy with power”. This case allowed them to make several observations:

  • The authors of this transformation were inspired by the original formulation and did not reproduce reformulations made by others: thus, there was an individual dimension.  
  • The word “crazy” is used six times more often than “foolish” when the verse is quoted: this is convergence
  • “crazy” and “foolish” have more or less the same meaning; “crazy” is shorter; it is generally learned earlier that foolish (at age 5-6 rather than age 8-9). Cultural attractors can therefore be identified: the replacement of the original word by words with very similar meanings, the preference for shorter words and the dominance of words that are acquired earlier.

This experiment confirmed all aspects of the theory of cultural attractors.

The contribution of data science to the study of transformations


data sciences. Crédits : http://www.kiwidatascience.com/

Data sciences. Copyrights : kiwidatascience

On the basis of the encouraging results from this first experiment, Camille Roth and Sébastien Lerique are currently developing a series of experiments to explore more complicated transformations like that of entire segments of a phrase or reformulations of syntax, their mechanisms and their incidence.

More broadly, through this study the researchers demonstrated the feasibility and relevance of data science “experiments” that use web data. They also showed that it was possible to study phenomena as complex as the relationship between microscopic and macroscopic content development processes; and finally, that is possible to establish a link between psycholinguistics, the study of new media and the field of cultural evolution.

* The theory of cultural attractors was developed by Dan Sperber, an anthropologist, linguist and cognitive science researcher, in his book “Explaining Culture: A Naturalistic Approach”, Wiley-Blackwell, Sept. 1996