lørdag den 17. januar 2015

Studying the Nahuan family with Methods of Molecular Biology

In this blog post I will give a sneak-peek on some ongoing work that I am doing which involves learning the techniques biologists use to build family trees (called phylogenies) for species or other groups of organisms and applying those techniques to understanding the relations between Nahuan dialects.

As we know, languages form "families", which basically means that a group of languages can trace their lineages back to a shared ancestor. Language families appear because most languages split into daughter languages each of which diverge from the parent in a different way. This process is what we can observe when we look at dialects of a single language. Each dialect is a variety of the language that is in the process of breaking away from the parent language and form its own language. Divergence like this happens because all languages innovate, by introducing new ways of speaking, that didn't exist in the parent language.

The same is of course what happens in biological organisms where new mutations accumulate and spread in the DNA of a population causing daughter populations to diverge from the parent population. Biologists working with molecular data analyze how these mutations can be compared and traced so that we can reconstruct the genealogy of an entire family of related organisms by grouping those organisms that share specific mutations together.

Because of each new way of speaking that arises and spreads in a community can be compared to a mutation in the DNA that spreads in a population, we can use the same methods that biologists use to trace biological phylogenies to understanding the divergence of dialects.

Nahuan Dialects:

I am in the process of understanding how all the different regional varieties (dialects) of Nahuatl are related to each other. This would help us understand where the Nahuan parent language (called proto-Nahua by linguists) originated and how it spread. In order to do this I have to find out what the shared innovations that characterize each dialect area are, and then trace them back to build a family tree. I am well underways with this project, and I presented a preliminary version of this at the Meeting of the Friends of Uto-Aztecan in Nayarit this summer (Here is a link to the paper).

But what I am doing now is mapping this information into the same kind of model that biologists use, in order to apply statistical methods to better understand the relations. Doing this allows me to generate nice graphic trees using a program called Mesquite, and this is what I want to let you peek at. 

Here is an example of how each dialect (identified with a three letter code)
is plotted into a matrix of shared innovations.
These are only some of the characters I am using.

The data is entered in binary form: each innovation present in a dialect is entered as a 1, and the lack of a specific innovation is entered as a zero (i.e. when the original form is retained). Each position of 1/0 is called "a character". By listing all the characters that I am tracking, each dialect is identified with a string of ones and zeros (called a Markov chain). And by analyzing all the characters together a phylogeny is formed. For example the dialect area of Morelos is identified with the string: 011110101 - 100000100001 – 1000000000 Where each 1 is an innovation the dialects in Morelos have, and each 0 an innovation they don't have. 

I am working with 22 distinct dialects, most of them are regional, though a few are limited to only a single community. The communities and the codes I use to identify them are: Durango/Mexicanero (DUR). Michoacan (around Pomaro) (MCH), Mexico State (around Toluca) (MEX), North Guerrero (around Coatepec Costales), Federal District (mainly Texcoco)  (DFE), North Puebla (around Huauhchinango) (NPU), Morelos (MOR), Tetelcingo, Morelos (TET), SOuth Puebla (around Tehuacan) (SPU), Zongolica, Veracruz (ZON), Tlaxcala (TLX), Western Huasteca (Hidalgo/San Luis), Eastern Huasteca (Hidalgo/Veracruz), Sierra de Puebla (SDP), Isthmus (Area north of Coatzacoalcos, Veracruz) (IST),  Pajapan, Veracruz (Town in the Isthmus area), Tabasco Nawat (TAB), Chiapas Nahuat (extinct) (CHS), Pipil Nawat of El Salvador (PIP), Central Guerrero Nahuatl (GRO), Oapan (a town in Central Guerrero) and Southern Guerrero Nahuatl (CGR). Each of these have been assigned a specific binary chain, although I still need to double check some of them in the literature to be sure I am assigning the right values, and perhaps adding more characters to the chains.

The preliminary result of my analysis is a tree that looks like this, which is interesting because it has a much more treelike branching structure than most of the previous classifications. Otherwise it does not diverge in major ways from the recent classifications by Canger (1988) or Kaufman (2001). That is partly because I have used their analyses to identify the innovations that have relevance for classification.

The bushiness comes from the fact that I have been able to identify chains of innovations in the eastern branch, where particularly the development of the negation provides a signal that I believ can can be traced along several steps. Previous classifications have been shallow bushes because they have not to the same extent tried to trace independent innovations in different areas. Canger and Dakin introduced the basic split between eastern and western branches, based on a single innovation. I have been able to add a few further innovations to the basic split. But this is about the only deep isogloss in the standard classification - the rest of it is basically a synchronic classification of the distinct areas assigning them to one or the other of the two branches. By looking at the relations between the areas within the two branches I have been able to get a bushier tree.

The tree has two main branches an Eastern and a Western branch. The Western branch is divided into a Periphery (including the dialects of Durango, Michoacan, Mexico state and North Guerrero), and a Center (including the dialects of the D.F. Morelos, North Puebla, South Puebla, Zongolica and Tlaxcala). The Eastern branch is divided into the Eastern Periphery (including the dialects of the Huasteca, the Isthmus, and Central America), and Guerrero (Central and South). The Eastern periphery is subdivided into a Huastecan and an Isthmian branch, with the Sierra de Puebla dialects in a kind of intermediate position between the two.

The next step after double and triple checking the data for the tree is to map each dialect to its geographical correlates and use a phylogeographic mapping program to calculate the probable paths that each community took to arrive to their current locations.

Meanwhile, here is the pretty tree for you to take a look at:

lørdag den 3. januar 2015

Chicolatl not *xocolatl!

Following my last blogpost about the etymology of cacao I got some requests to also write about the etymology of the word Chocolate which like cacao also entered the English language through Spanish and Nahuatl. In this case everyone agrees that the word originates in Nahuatl, but nonetheless there is also a bit of controversy here. Specifically the question is how to analyze its original components and meaning. And as it turns out there is also controversy about the phonological form of the original word. So it gives me another chance to dispel a common misconception about a Nahuatl etymology, which is a chance I can not let go past.

The "Xocolatl" Etymology.

All over the internet and in a lot of published works we will find the claim that the original Nahuatl word was "xocolatl" /xokola:tl/ and that it had the meaning "bitter water". For example here in the Online Etymological Dictionary:

"Chocolate: c.1600, from Nahuatl (Aztecan) xocolatl, possibly from xocolia "to make bitter" + atl "water." Brought to Spain by 1520, from thence to the rest of Europe. Originally a drink; as a paste or cake made of ground, roasted, sweetened cacao seeds, 1640s."

This etymology is repeated so widely that it is even the name of a relatively well known artisanal chocolate company, with a store on Manhattan's Fifth avenue and a large sign that says XOCOLATL.

Well, maybe after they read this blogpost they will want to change that sign, because that is probably not actually what the Aztecs called the chocolate beverage that they liked to drink.
Ki? WTF? 

There are a number of problems with the "xocolatl" etymology:

  1. The word xocolatl does not seem to be actually attested anywhere. Alonso de Molina has "xocoatl - cierta beuida de mayz" (a certain drink made of corn), but there is little reason to think that this drink included cacao. Probably it was water with corn dough turned sour through fermentation. This drink is still prepared by Nahuas in Mexico who sometimes call it xocoatolli, "sour Atole" (found in the RAE as xocoatole). It is and was used for medicinal purposes.
  2. What is attested is a word chocolatl with ch, which is found to day in most Nahuan varieties. For example in Salvadoran Pipil as /chukulat/, in Hueyapan Nahuatl and in Zongolica as /chokolatl/ and in Xoxocotla Morelos as /chikolatl/ (note that here is an /i/ in the first syllable). 
  3. The word xococ does not mean "bitter" (that would be chichik) but rather "sour". And unsweetened chocolate is not sour.
  4. The verb xocoya "to become sour" and the adjective xococ "sour" does not fit the form xocol with a final -l- that is in xocoLatl, and it is difficult to find a way that it could be converted into xocol-. Karttunen suggests it could be through the verb xocolia the applicative of xocoya with the meaning "to make something sour". But it is hard to see how this could be compounded with atl and keep the l. That cannot be done through normal derivational processes. 
So basically xocolatl is a very weakly supported etymology and it is difficult to know exactly where it comes from. It seems that its popularity may have originated with Frances Karttunen's Analytical Dictionary, although she only suggests it tentatively as a possibility when under the entry chocolātl she writes:
 "Possibly the initial element is related to M[olina]'s xocolia "to make something bitter or sour"and to xococ "something sour". The substitution of CH for X is not uncommon." 

So basically it seems that Karttunen suggested it as a tentative possibility and it was taken up by others as if it were a fact. As noted above I am skeptical of the xocolia connection. I also don't think substitution of /ch/ for /x/ is that common, I certainly can't think of any and neither can Dakin and Wichmann when they write of an earlier publication of the same argument by Robel 1904 which they don't include in their bibliography and I haven't been able to track down. They write: "Underlying this suggestion is the argument that the original drink was not sweet. Even so, this etymology is unlikely since /č/ does not change to /š/ in any other term in Nahuatl." (Dakin & Wichmann 2000:61).

So probably it makes sense to look around for a different etymology.

The "Chikolatl" Etymology

It just so happens that in their article on the word cacao Dakin and Wichmann (2000) also gave a treatment of the etymology of chocolate. They argue against the xocolatl etymology and state that more likely the original form of the word is chikolatl with an /i/. This they base on the fact that

  1. chikolatl  is attested in a number of Nahuatl varieties (Morelos (Ocotepec, Xoxocotla), Ameyaltepec, Gro, Cuetzalan Pue, Rafael Delgado, Ver, North Puebla) 
  2. Some dialects have a tendency to change /i/ to /o/ in initial syllables if there is an /o/ in the second syllable. Wichmann and Dakin say this most common in Western dialects but I don't really see that - and neither Ameyaltepec, Xoxocotla/Ocotepec or Rafael Delgado are particularly Eastern varieties in my opinion. 
  3. chikolatl seems to have been borrowed with i into a number of other Mesoamerican languages (including varieties of Mixtec and Zapotec and several Mixe-Zoquean languages)
  4. In Nahuatl there is an instrument called a chi(h)kolli  which is a stick with a hook end. This they argue can be considered similar to the cacao beaters used to make chocolate beverages frothy and foamy in many Mesoamerican cultures. 
  5. The origin of the word would then be chikolatl "water made with a chikol-stick".

But what does Kaufman and Justeson (2007) have to say then?

Given how critical they were of Dakin and Wichmann's article we might expect them to have rejected this proposal as well. But, no. They actually agree that the original form was chikolatl, all though they are unconvinced by the chihkol stick etymology, and contest some of the supposed supporting evidence for diffusion into other languages, and particularly the idea that it is an ancient word diffused into other Mesoamerican languages at an early time.

Kaufman and Justeson actually makes a good argument in favor of seeing /chikolatl/ as the most likely original form. This is because in Nahuatl the sound /ch/ tends to come from a palatalization of /ts/ or /t/ before /i/. So in all cases where we have a /ch/ in Nahuatl, we expect it to have been originally preceding the vowel /i/, even when that vowel is no longer there because it has turned into some other vowel.

The conclusion: We don't know what it meant, but it wasn't "bitter water".

Nonetheless Kaufman, Justeson, Wichmann and Dakin - some of the brightest minds in Mesoamerican historical linguistics actually agree that the word does not come from xocolatl. And these people are not in the habit of agreeing on anything, so let's not cause further problems by second-guessing them here.

The word chocolate comes from the Nahuatl word /chikola:tl/ which meant "chocolate beverage". 

Paraphrasing Martin Luther, that is what we know, and everything beyond that is speculative etymology and belongs to the devil in hell.

No! You're doing it wrong!


  • Dakin, K., & Wichmann, S. (2000). Cacao and chocolate. Ancient Mesoamerica11(01), 55-75.
  • Kaufman, T., & Justeson, J. (2007). The history of the word for cacao in ancient Mesoamerica. Ancient Mesoamerica18(02), 193-237.

torsdag den 1. januar 2015

Cacao: How a Single Word Holds the Key to Understanding the Mesoamerican Past

In this blog post I am going to summarize what is probably the most interesting example of how historical linguistics, in extreme this case the etymology of a single word, participates as evidence in the process of understanding ancient cultures.

A cup of hot Cacao with freshly made chocolate patties
on banana leaves in the back. Tlaquilpa, Veracruz 2014.
The word is question is cacao (or in English cocoa, a mangled version of the former with its own funny history), which describes the fruit of the tree Theobroma cacao, which in the form of chocolate  has provided a lot of deliciousness to people throughout the world.

The significance of this word is such that by determining its etymology, we have an important clue as to what language was spoken by the people who formed the first major civilization in Mesoamerica, the Olmecs.

Who created Mesoamerica?

Because the word has two different proposed etymologies, one that goes back to the deep past of the Mixe-Zoquean language family, and one that traces the word to the earliest Nahuan language, the word was become the lightning rod for the debates pitting the Nahuas and the Olmecs against each other as being the "founders of Mesoamerican civilization". 

Little chocolate figures in the shape of
 Olmec "colossal heads"
Specifically, one group of scholars argue that the Olmecs were a Mixe-Zoquean speaking people who cultivated maize, beans, squash and cacao and exported their knowledge and vocabulary to most of the other language groups in Mesoamerica. In this history the Nahuas originated in the U.S. Southwest and only entered Mesoamerica after the civilization had been established by speakers of Mixe-Zoquean, Otomanguean and Mayan languages. Another group of scholars argue that Uto-Aztecan speakers were among the original founders of the culture area, and that they were among the most politically and culturally important groups already in the formative era (starting about 2000 BCE, and becoming the Classic period around 200 CE), and that they might have been the main group behind the rise of the mega-empire of Teotihuacan (Teotihuacan collapsed around 600 CE). 

Cacao beans still in the pod.
The way civilizations have started in the world is generally through the development of new ways of making food that are easier and more efficient than older ones and which allow for population increase.  In Mesoamerica these innovations were the cultivation of maize from from wild species of grass, and the cultivation of squash and beans. This happened sometime between 3000 and 2000 BCE and led to people becoming sedentary, building villages and later towns with central governments. From around 2000 BCE the first major monuments and monumental architecture begins to be found in the gulf coast region of Mexico, and this culture with its distinct style of stone monuments is what archeologists call the Olmec culture. Its innovations and styles spread across Mesoamerica over the next period, being absorbed into local styles. This has led some archeologist to consider the Olmecs to be the "mother culture" of the Mesoamerican civilization, although others like to consider it only a "big sister" whose good ideas were accepted and used by the other siblings who had developed the main ingredients of civilization independently. 

So what does all this have to do with Cacao?

Cacao in Mesoamerica:

A maya ruler receiving his hot kakaw
 from a servant on a classic period vase.
Across Mesoamerica Cacao has
been associated with wealth and power,
even to the point of cacao beans being
used as a form of currency.
Just like maize, beans and squash, cacao is indigenous to Mesoamerica, and has been cultivated from wild species. But unlike these other species cacao is not a staple crop that provides nutrients necessary for basic survival, rather it is a luxury crop, that provides delicious theobromine for the nervous system at the expense of a laborious process of cultivation and processing. This means that the presence of cacao demonstrates that a culture has moved beyond the stage of bare necessities in generated a surplus. The generation of surplus in turn marks the beginning of class division and hierarchy, and a complex division of labor - the social signs archaeologists associate with the phenomenon we call "civilization". Therefore, signs of the consumption of cacao is a kind of litmus test for when and where Mesoamerican civilization can be said to have begun. 

Vessels tested positive for theobromine from San Lorenzo,
Powis, Cyphers et al. 2011.
As it is, we find the first signs of cacao cultivation and consumption among the Olmecs in the sites of El Manatí, and at San Lorenzo between  1800 and 1000 BCE. The signs are sure that it was cultivated for consumption because archeologists have found ceramic vessels with residue traces of theobromine, the active ingredient of cacao.
So the Olmecs were definitely into the chocolate rush. But what did they call this delicious addiction?

The word Cacao in Mesoamerica:

The Maya word kakaw spelled in
hieroglyphs as KA -KA-WA
The reason not only the fruit cacao is interesting but also the word is that almost all Mesoamerican languages have a very similar sounding word for it. This suggests that all these come from a single source. And if all Mesoamerican languages have their word for Cacao from a single source then it is logical to assume that they received the both word and the fruit it described from the same source as well. So basically it appears that the Olmecs exported both their ideas and their words to the rest of Mesoamerica. The trick then is to find out what language the Olmecs spoke - which is easier said than done since it is hard to determine which of the many languages that have similar words is the original one.

Note how similar the word for cacao is in different Mesoamerican languages (Source Kaufman & Justeson 2007):
  • proto-Zoquean - *kakawa;  
  • proto-Mixean - *kakaw;
  • Nahua - /kakawa-tl/; 
  • Mazahua - /kakawa/; 
  • proto-Mayan, Totonac, Salvador Lenka - /kakaw/;
  • Paya/pech - [kaku]; 
  • Purhépecha: - /khe´kua/. 
  • Boruka, Tol,-  and Honduras Lenka -  [kaw]

So where did the word originate?

The Mixe-Zoque hypothesis:
Cigar-smoking Maya
holding a cacao pod.
In a hugely important 1976 paper Lyle Campbell and Terrence Kaufman proposed that the Olmecs had spoken proto-Mixe-Zoque, the language that became the ancestor of the Zoquean and Mixean languages of southern Mexico. The argument was based on the observation that a number of important culture words could be demonstrated to have diffused  throughout Mesoamerica, and the evidence suggested that the diffusion was from Mixe-Zoquean languages into all the other ones. This was the paper that made the Mixe-Zoque languages, which had earlier been considered inconsequential and hardly even considered, a serious contender for the "Olmec question". Importantly, this argument doesn't hinge on the word for Cacao alone, because also words for other important cultivars such as gourds, squash, tomato and bean were traced to Mixe-Zoque along with several other important cultural concepts such as words related to maize preparation, to religion and to material culture. Previously many had considered it most likely that the Olmec spoke a Mayan language and that there was a direct continuity between Olmec and Maya cultures.

The Nahua hypothesis:
Tlapalcacauatl "red cacao"
from a colonial Nahuatl herbary
In 2000 Karen Dakin and Søren Wichmann proposed specifically that the word Cacao did not originate with Mixe-Zoquean as proposed by Campbell and Kaufmann 24 years earlier, but that it had originated with a Uto-Aztecan language, probably Nahuan. They argued that if proto-Mixe-Zoquean had indeed had the word *kakawa, then most of the dauhgter languages would have irregularly derived forms of the original word. Rather they suggested the word had probably been borrowed by Mixe-Zoque speakers from some other language, leading to the irregularities they observed. They then looked at Uto-Aztecan language and found that many of them have a word for 'egg' that can be reconstructed as *kawa or *kapa. Nahuatl in fact is one of the only languages that do not have a word for 'egg' based on this stem (usually having either totoltetl "bird-stone", or tekwsistli originally meaning conch). They then argue that Nahuan kakawatl could be derived from an earlier Nahuatl word kawatl 'egg' (a word which is not actually attested) and that the reduplication of the -ka- could have originated from the process that makes nouns that are described as similar to what ever is being reduplicated. I.e. kakawatl could have originally meant "that which is similar to an egg". Cacao beans, although physically very different from eggs in color, size and shape, do have a thin shell that has to be removed after toasting, which can perhaps be considered similar to an eggshell. They took this argument to show that Nahuas had been present in Mesoamerica and played a major role already during the classic period, probably being associated with the empire of Teotihuacan, which they considered the source from which the Nahua word had spread to other languages including Mixe-Zoque and Maya. 

In 2001 Jane Hill proposed that contrary to received wisdom which considered the Uto-Aztecan languages to have originated in the U.S. Southwest with proto-Nahuatl speakers migrating southwards into Mesoamerica eventually taking up agriculture and many cultural traditions of the Mesoamericans, instead the opposite had happened. Hill argued that proto-Uto-Aztecan had been spoken in Mesoamerica and that its speakers had participated in develoiping Mesoamerican civilization whereafter a large group of Uto-Aztecans migrated northwards eventually giving up agriculture for a hunter-gatherer lifestyle. Her argument was that in her reconstruction of the language family's history, some maize related words could be recostructed all the way back to the Uto-Aztecan proto-language. 

In 2005 Martha Macri argued that three words in a very early Mayan glyphic text (from 480 AD) could be considered Nahuatl loans into Mayan, and that these words were all related to Cacao. The words in question were WITIK(I) which she related to Nahuatl /witeki/ "to beat something", MULUL which she related to Nahuatl /mo:lli/ "ground spice sauce" (sometimes containing Cacao), and KOXOM(A) which she related to Nahuatl /koxo:ni/ "for a hollow vessel to make a sloshing sound" or /koxo:nia/ "to stir a pot".  I must admit that personally, I find Macri's proposal completely unconvincing because none of the words have more than a speculative relation to cacao, they are not documented in any contemporary Mayan languages as loans, and because there is  not a good fit between the form of the proposed Nahuatl source and the given maya syllables, it seems unlikely that the words would be loaned in this form since they require loaning the verbs as stems and not as fully inflected verbs,  and finally because at least two of them actually have completely plausible Maya meanings and etymologies. I think Macri's proposal can be safely ignored, which most subsequent scholarship has also done. 

Nonetheless throughout the early 2000s there was a general sense that the tide was turning against the Mixe-Zoquean hypothesis and that probably Nahuas had been present in Mesoamerica earlier than was generally thought.  

Mixe-Zoquean strikes back:
But Terrence Kaufman would have none of it. He maintained vigorously that the Mixe-Zoquean hypothesis was solid. First in 1993 he and John Justeson had published an elaborate deciphering of the undeciphered Epi-Olmec (meaning late Olmec) or Isthmian hieroglyphic script arguing that it was written in the proto-Zoquean language. This would seem to be a serious piece of support for a Mixe-Zoquean-Olmec connection.  The decipherment was nonetheless contested by some Maya archeologists who were unable to read a third text using the proposed decipherment (Houston & Coe 2004). 

Doña Cristina, of Tlaquilpa, Veracruz,
 grinding Cacao with the toasted beans
in front and a calabash full of cinnamon.
The chocolate paste comes out of the grinde
and is mixed with sugar. 
In 2007 Justeson and Kaufman published a forceful rebuttal of Dakin and Wichmann's argument. They argued that the proposal of the Mixe-Zoquean as the donor language was solid both because in their reconstruction it fit well with the forms in the Mixe-Zoquean daughter languages and because it was supported by the fact words for many other cultigens could be demonstrated to have been most likely diffused from Mixe-Zoquean. (Wichmann actually agrees with Kaufman that the Olmecs spoke Zoquean and has provided evidence for that himself, so the disagreement for Wichmann seems to be mostly about the presence of Nahuas at Teotihuacan and the specific etymology of the word cacao). They also argued that the proposal deriving kakawatl from egg was untenable. They show that proto-Nahuatl had many borrowings from Mixe-Zoquean, and that even setting the word kakawatl aside all other loans from Nahuatl into Mesoamerican languages postdate the classic period. This seems to establish that there is a general pattern of Mixe-Zoquean influence on Nahuan dating to the classical period, but no general pattern of Nahuan influence on other languages in the classical period. They argue that if the Uto-Aztecan word 'egg' which they reconstruct as *kava had been derived regularly into Nahuan it would not have given /kakawatl/ but /*kaka:tl/ - the derivation proposed by Dakin and Wichmann, they consider to be impossible. Furthermore they show that when other Mesoamerican languages do borrow words from Nahuatl they do so with the absolutive suffix attached, i.e. the Nahua and Mixe-Zoquean forms should have been *kakawat and not /kakaw/. 

In 2009 Justeson and Kaufman published another intense counter argument, refuting in toto, Jane Hill's proposal of Uto-Aztecan originating in Mesoamerica (which had already attracted general dissaproval from Uto-Aztecanists), by showing that her Maize related reconstructions could only be dated to proto-Southern Utp-Aztecan, but not to the common ancestor of all Uto-Aztecan languages. Although Kaufman and Justeson did accept a slightly earlier date for the entry of Nahuas into Mesoamerica than they usually had (originally they considered Nahuas to have arrived around the time of the fall of Teotihuacan, but in 2009 ).  This, coupled with a wide set of other rebuttalls to Hill's theory, again shifted the balance to see Nahuas as latecomers in Mesoamerica, and Mixe-Zoquean speakers as the drivers of the Olmec "mother culture".

Further arguments were presented by Dakin in 2010 arguing for Nahua influence in the Mayan lowlands during the classic period. And in 2010 and 2012 Hill published again an argument to the effect that proto-Uto-Aztecan was a Mesoamerican language. Her maize related etymologies have meanwhile been rejected by scholars such as William Merrill (2012, Merrill et al. 2009), and Lyle Campbell (Campbell & Poser 2008). 

So what is the status? Confirmation Bias and its effects:

Too most ordinary people, including here most archeologists, historians and linguists, these debates simply look like a big pile of "my kung fu is stronger than your kung fu" - hard to make head or tails of.  This is because it requires intimate familiarity with both a vast number of languages, as well as the methods of historical linguistics, to judge which claims are better founded than others. 

I think that in the end, what will make most people side with one of the proposals is probably less about the soundness of the historical linguistics behind the proposals, and more about which story of the Mesoamerican past they prefer for their own personal reasons, whether aesthetics, preferences or experiences with one or more of the ethnic groups or scholars involved etc. 

Humans are funny that way. We are very quickly convinced by any evidence that points us in the direction we already want to go. 

Personally, I count myself on the Mixe-Zoque team for the time being. This may actually be because I know less about Mixe-Zoque than about Nahuatl and Uto-Aztecan, which means that I am more critical of the proposals involving Nahuatl, than of the ones involving languages I know less about. I have found the proposals of Dakin and Wichmann, Hill and Macri to be unconvincing in the ways they stretch meanings and forms to create what to me comes across as speculative etymologies. For example to get from *kava "egg" to *kakawatl "cacao" there are at least two leaps of faith, first the idea that they could be conceived as similar, and secondly the fact that the *kawa "egg" etymon is not attested in Nahuan at all - one such leap I might be willing to accept, but not two. Maybe Kaufman does the same kind of etymological twisting with Mixe-Zoquean, but I am unable to see it, and I am more convinced by his Nahuatl work.

My main point with this blog post is to demonstrate how historical linguistics is far from an exact science, but an interpretative science, where different scholars interpret the same facts differently based on their background knowledge.

And also to show how a single word for a fruit and a delicious beverage can become the key to such a highly political question as identifying the speakers of one language or another as the founders of the Mesoamerican civilization.


  • Campbell, L., & Kaufman, T. (1976). A linguistic look at the Olmecs. American Antiquity, 80-89.
  • Campbell, L., & Poser, W. J. (2008). Language classification. History and method. Cambridge.
  • Coe, S. D., Coe, M. D., & Huxtable, R. J. (1996). The true history of chocolate. London: Thames and Hudson.
  • Dakin, K., & Wichmann, S. (2000). Cacao and chocolate. Ancient Mesoamerica11(01), 55-75.
  • Dakin, K. (2010). Linguistic Evidence for Historical Contacts between Nahuas and Northern Lowland Mayan Speakers. Astronomers, Scribes, and Priests: Intellectual Interchange Between the Northern Maya Lowlands and Highland Mexico in the Late Postclassic Period, 217.
  • Hill, J. H. (2012). Proto-Uto-Aztecan as a Mesoamerican language. Ancient Mesoamerica23(01), 57-68.
  • Hill, J. H. (2001). Proto‐Uto‐Aztecan: A Community of Cultivators in Central Mexico?. American Anthropologist103(4), 913-934.
  • Houston, S. D., & Coe, M. D. (2003). Has Isthmian writing been deciphered.Mexicon25(6), 151-161.
  • Joyce, R. A., & Henderson, J. S. (2010). Forming Mesoamerican Taste: Cacao Consumption in Formative Period Contexts. In Pre-Columbian Foodways (pp. 157-173). Springer New York.
  • Justeson, J. S., & Kaufman, T. (1993). A decipherment of Epi-Olmec hieroglyphic writing. Science259(5102), 1703-1711.
  • Kaufman, T., & Justeson, J. (2007). The history of the word for cacao in ancient Mesoamerica. Ancient Mesoamerica18(02), 193-237.
  • Kaufman, T., & Justeson, J. (2001, March). Epi-Olmec hieroglyphic writing and texts. In The Proceedings of the Maya Hieroglyphic Workshop: The Coming of Kings; Epi–Olmec Writing, March 10–11, 2001, University of Texas at Austin(pp. 93-224).
  • Kaufman, T., & Justeson, J. (2009). Historical linguistics and pre-Columbian Mesoamerica. Ancient Mesoamerica20(02), 221-231.
  • Macri, M. J. (2005). Nahua Loan Words from the Early Classic Period: Words for cacao preparation on a Río Azul ceramic vessel. Ancient Mesoamerica,16(02), 321-326.
  • Merrill, W. L. (2012). The historical linguistics of Uto-Aztecan agriculture.Anthropological Linguistics54(3), 203-260.
  • Merrill, W. L., Hard, R. J., Mabry, J. B., Fritz, G. J., Adams, K. R., Roney, J. R., & MacWilliams, A. C. (2009). The diffusion of maize to the southwestern United States and its impact. Proceedings of the National Academy of Sciences106(50), 21019-21026.
  • Powis, T. G., Cyphers, A., Gaikwad, N. W., Grivetti, L., & Cheong, K. (2011). Cacao use and the San Lorenzo Olmec. Proceedings of the National Academy of Sciences108(21), 8595-8600.
  • Wichmann, S. (1995). The relationship among the Mixe-Zoquean languages of Mexico. University of Utah Press.