The Indus Script – Introduction

(Lothal, according to the ASI)

4000 year back, in Lothal, Gujarat, a merchant walked from his home in the lower town towards the wharf. As he walked past the bead factory, he saw the artisans already at work there; some of these beads were in demand in lands as far away as Mesopotamia. Crossing the public drain and the acropolis he reached the wharf. There were quite a few ships waiting to carry goods to Dilmun, Ur, and Lagash.
He glanced to his left, in the direction of the worker’s barracks, located at the far end of the wharf and quickly walked to the opposite end – towards the warehouse. All the goods were bundled and tied as expected. As he walked to the first bundle, one of his workers put some wet clay on the rope. He took a rectangular seal from the folds of his dress and pressed it hard on the clay. Satisfied with the impression, he moved on to the next bundle. When the recipient got the shipment, he would know exactly what is contained.
There are good reasons to believe that such a scenario could have happened. One probable use of Indus seals was  in economic activity; the seals found in Lothal had impressions of a coarse cloth on their reverse and sometimes several seals were used to mark the goods. Besides this, there is enough evidence of trade relations between the Harappans and Mesopotamia going far back to the time of Sargon of Akkad (2270 – 2215 B.C.E) and even before that to 4000 B.C.E with the find of Baluchistan cotton in Jordan.
Seals
In 1875, Major-General Clark, who was the Commissioner of Awadh, discovered the first seal which had the engraving of a hump-less bull and six signs above it in Harappa[3]. 134 years later we don’t know what is written on those seals; there are many decipherments, but no consensus. While papers are coming out, applying various statistical methods to find out if the Indus seals encode a linguistic system, there is another debate over if these small palm size steatite seals with random looking inscriptions represent Indo-European, proto-Dravidian, Munda or some other language.
Why is decoding the Indus script and language so hard when Egyptian hieroglyphics, Linear B and cuneiform have been deciphered? In the case of Egyptian hieroglyphics and cuneiform there was bi-lingual encoding, like the Rosetta stone, which helped. When it comes to the Indus, which is an unknown script representing an unknown language, we don’t have that luxury. Linear B was deciphered without bi-lingual text which gives hope, but then Indus seals are short. The average length of a seal is 5; the longest single sided inscription has seventeen signs. With such data, deciphering the seal is a hard task.
When it comes to the Indus seals we want answers to these questions:

  • What is the script?
  • What is the language?
  • What is the subject matter?
  • Were the Harappans Vedic people or Dravidians?

Continue reading “The Indus Script – Introduction”

The Markov Model of Indus Script

(From a Tantra t-shirt)

There is a school of thought which believes that the Harappan seals convey something linguistic; after all they had extensive trade contacts with literate Mesopotamia. Thus it is possible that these symbols — found on seals, pottery, terracota tablets — convey data regarding the origin of the consignment or owner. Then there is another school which believes that, yes, the seals had some meaning, but definitely not linguistic; maybe they were political or religious symbols.
As this battle continues — much like the one over the Aryan homeland — a new paper has been published, which analyzes the sequential dependencies between the symbols. In English we know that the the letter “s” is most likely to be followed by “e” or “o” or “u” than “x” or “z”. Similarly in the Harappan seals it was found that given a symbol, only a subset of the symbols could follow it. This order can happen only if there are some rules regarding the placement.

To find out if this model could predict the missing or illegible symbols in a damaged seal, a known data set was intentionally damaged. The model could predict the missing symbol with 74% accuracy. Also analysis of Harappan seals found in Mesopotamia and West Asia found that they were of a different encoding; maybe they represent different subject matter.

Our results appear to favor the hypothesis that the Indus script represents a linguistic writing system. Our Markov analysis of sign sequences, although restricted to pairwise statistics, makes it clear that the signs do not occur in a random manner within inscriptions but appear to follow certain rules: (i) some signs have a high probability of occurring at the beginning of inscriptions whereas others almost never occur at the beginning; and (ii) for any particular sign, there are signs that have a high probability of occurring after that sign and other signs that have negligible probability of occurring after the same sign. Furthermore, signs appear to fall into functional classes in terms of their position within an Indus text, where a particular sign can be replaced by another sign in its equivalence class. Such rich syntactic structure is hard to reconcile with a nonlinguistic system. Additionally, our finding that the script may have been versatile enough to represent different subject matter in West Asia argues against the claim that the script merely represents religious or political symbols [A Markov model of the Indus script]

Now it turns out that Soviets and Finns had done such studies in the 60s and reached the same conclusion: there is a positional order in Indus symbols. What’s unique about this new study is that it uses the Markov model for the first time.
This paper does not decipher the script, but is work which hopefully will lead to an acceptable decipherment. The word “acceptable” is used because there are many decipherments right now, but without scholarly consensus. But what the paper suggests is that the symbols, most likely, encode a linguistic system and not religious or political symbols.

This work, like the previous one , has got extensive media coverage, with even the Time, writing about it. But anything connected to the Indus is controversial and this paper is no different: first the authors were accused of being Tamil/Dravidian nationalists and once that was found to be incorrect, it was about the deteriorating editorial standards in various journals. In response Prof. Dilip K Chakrabarti wrote, “There is a conscious attempt in certain quarters to disassociate this civilisation from the later mainstream tradition of Indian/ Vedic culture.”

Historically, the beginning of this attempt can be traced to the period around India’s Independence when Mortimer Wheeler proposed that the impetus for this civilisation came from Mesopotamia. Earlier, when India was a jewel in the British crown, there was no compulsion to depict it as an offshoot of Mesopotamian or other contemporary civilisations. The early excavators had no problem hypothesising that this civilisation was deeply rooted in the Indian soil and that many of its features could be explained with reference to the later Indian civilisation. [From Indus to India]

Besides this political side show, there is a serious question: is this method sufficient to show that the Indus seals represent a linguistic system? Can such statistical studies prove or disprove that the symbols represent a language.?  The answer depends on whom you ask.
On the other hand, if you believe that the symbols are non-linguistic, there is another question: why would the Harappans send non-linguistic symbols on seals, created on hard to work materials, with a certain syntax to their trading partners to the West and North. What was the relevance and what non-linguistic information did it convey to someone in Mesopotamia?

Smithsonian on Indus Script

Ever since Rao et al. published that the Indus script showed the structure of a formal language, a new debate on the topic was initiated. There were some hostile reactions to this paper. Now, Smithsonian has published an article on the topic which presents the findings in a positive way.

After publishing the paper, Rao got a surprise. The question of which language family the script belongs to, it turns out, is a sensitive one: because of the Indus civilization’s age and significance, many contemporary groups in India would like to claim it as a direct ancestor. For instance, the Tamil-speaking Indians of the south would prefer to learn that the Indus script was a kind of proto-Dravidian, since Tamil is descended from proto-Dravidian. Hindi speakers in the north would rather it be an old form of Sanskrit, an ancestor of Hindi. Rao’s paper doesn’t conclude which language family the script belongs to, though it does note that the conditional entropy is similar to Old Tamil—causing some critics to summarily “accuse us of being Dravidian nationalists,” says Rao. “The ferocity of the accusations and attacks was completely unexpected.” [Can Computers Decipher a 5,000-Year-Old Language?]

A Talk on Indus People and their Script

In April 2009, “a team of Indian scientists reports in Friday’s issue of Science journal that the Indus script has a structured sign system showing features of a formal language.” One of the authors of that paper is giving a talk at IISc on June 9th at 10 am (e-mail from Ranjith).

NIAS LITERARY, ARTS AND HERITAGE FORUM
Cordially invites you to a lecture entitled
Indus People and their script
By
Prof. Mayank Vahia
Tata Institute of Fundamental Research, Mumbai
On Tuesday, 9th June, 2009, at 10.00 am
in
J R D Tata Auditorium,
National Institute of Advanced Studies,
Indian Institute of Science Campus,
Bangalore 560 012

Abstract

Indus Valley Civilisation was the first truly urban civilisation with  several cities with population of 20,000 people or more at its peak. It  flourished in the Western part of the Indian Subcontinent from around 7000  BC to 1900 BC with a peak period of 2500 BC to 1900 BC when it went into a  decline. The hallmark of this civilisation is the miniature seals on which  they produced truly magnificent art work and wrote in small cryptic notes.  Their writing has been enigmatic and since their first discovery some 130  years ago, it is still not clear if it is linguistic writing or not. Our  recent work has shown that not only is the writing similar to linguistic  writing but detailed structure of writing can be clearly seen. We will  discuss the issue of Indus writing in the context of the Civilisation and  our recent work.

About the speaker

Prof. Mayank Vahia is an astronomer at the Tata Institute of Fundamental  Research, Mumbai. After having spent 3 decades in space astronomy  instrumentation, his recent interests in growth of astronomy in India has  taken him to study various aspects of India’s history and prehistory with  special emphasis on astronomy and intellectual growth of the Indian  civilisation.

Hope some of you will be able to attend this talk and blog about it.

Undeciphered Scripts

As the debate over the Indus script – on if it is a language or not and if so, is it Dravidian or Indo-European – continues, New Scientist has an article listing seven other scripts that cannot be read.

Proto-Elamite is the world’s oldest undeciphered script – assuming that it really is a fully developed writing system, which is by no means certain. It was used for perhaps 150 years from around 3050 BC in Elam, the biblical name for an area that corresponds roughly to today’s oilfields of western Iran. It is almost as old as the oldest writing of all, the earliest cuneiform from Mesopotamia. Little is known about the people who wrote the script.[Decoding antiquity: Eight scripts that still can’t be read – life – 27 May 2009 – New Scientist]

Hostile Reactions

In 2004, the Dover, Pennysylvania, school board decided to teach students an alternative to evolution called Intelligent Design.

Because Darwin’s Theory is a theory, it continues to be tested as new evidence is discovered. The Theory is not a fact. Gaps in the Theory exist for which there is no evidence. A theory is defined as a well-tested explanation that unifies a broad range of observations. Intelligent Design is an explanation of the origin of life that differs from Darwin’s view. The reference book, Of Pandas and People, is available for students who might be interested in gaining an understanding of what Intelligent Design actually involves.[Kitzmiller v. Dover Area School District]

Promptly a law suit was filed and an opening witness at the trial was Kenneth Miller, a Brown University biologist and leading proponent of evolution. During the trial he had to face not just the lawyers, but the public as well. Lot of people expressed hostile reactions — via letters, via e-mails, via phone. He was told he would spend eternity in hell. He was told he was not respecting God. He was asked how he could be a Christian and believe Darwin — all from folks who read the book of Genesis literally[1].

Such hostility exists not just between scientists and people who want to enforce their religious beliefs on others, but also between proponents of the Aryan migration/trickle down theory and non-believers. Anyone who opposes the external origins of Aryans can pick one of these labels: “Hindu fundamentalist”, “revisionist” or “fascist”. Any supporter of the external origins of Aryans is either a “colonialist-missionary” or one who harbors “racist-hegemonial” prejudices.[2] Edwin Bryant’s The Quest for the Origins of Vedic Culture: The Indo-Aryan Migration Debate has a great collection of polemical reactions from both sides.

This is one of those debates where even tenured professors do what Jamal did to watch his favorite actor. Also this kind of language is common in Indian History mailing lists where proponents of various theories display juvenile behavior to much amusement. If you think, quite naively, that to demolish a theory you just to counter the interpretation of data, you are wrong. Not in this field. So when a recent paper on Indus script was published, it was countered with the statement (among other things) that the authors of the paper are Dravidian nationalists.

Before 2004, the Rao et al. paper would not have gathered any attention. (Of course the Indus system is a language script! Why are you discussing it?) But that year, Steve Farmer managed to persuade two others — one of whom, Michael Witzel, is a well-respected authority in the field — to add their names to his thesis that it is not a language. The resulting manuscript was absurdly and unprofessionally bombastic in its language, while containing essentially nothing convincing. Regardless of the work of Rao et al, their hypothesis would have died a natural death — but Rao et al do have Farmer et al to thank for enabling them to publish their work, with its obvious conclusions, in a prestigious journal like Science. Farmer et al are so rattled that they promptly post an incoherent, shrill, content-free, ad hominem rant on Farmer’s website. Sproat even shows up on my previous post, leaving a chain of comments that reveal that he has neither understood, nor cares to understand, the argument. [More Indus thoughts and links]

As Kenneth Miller writes in his book,  finally bad science will fail. Intelligent Design was thrown out by the courts since the advocates could not present any peer-reviewed articles or evidence for intelligent design or proof of scientific research or testing. The Aryan Invasion Theory was discredited and discarded and now the Illiterate Harappan hypothesis is being questioned. No amount of polemics can stop that.
Now compare that to a response by Iravatham Mahadevan

References:

  1. Only a Theory: Evolution and the Battle for America’s Soul by Kenneth Miller
  2. A Survey of Hinduism by Klaus K. Klostermaier
  3. The Quest for the Origins of Vedic Culture: The Indo-Aryan Migration Debate by Edwin Bryant

Indus Script: A Formal Language

This picture shows a Harappan seal with five inscriptions or characters, which have been undeciphered. In fact there are many decipherments, but no scholarly consensus. One of the disputes is at a fundamental level: do these markings belong to a language or were the Harappans illiterate?
Finally, in a breaking news moment, we have an answer.

Now, a team of Indian scientists reports in Friday’s issue of Science journal that the Indus script has a structured sign system showing features of a formal language. Using mathematical and computational tools, researchers show that the script has well-defined signs, which begin and end texts, with strong correlations in the order in which the signs appear.[Scientists inch closer to cracking Indus Valley script – Home – livemint.com]

According to Asko Parpola, an expert on Indus seals

“It’s a useful paper,” said University of Helsinki archaeologist Asko Parpola, an authority on Indus scripts, “but it doesn’t really further our understanding of the script.”
Parpola said the primary obstacle confronting decipherers of fragmentary Indus scripts — the difficulty of testing their hypotheses — remains unchanged. [Artificial Intelligence Cracks 4,000-Year-Old Mystery | Wired Science from Wired.com]

Also

J. Mark Kenoyer, a linguist at the University of Wisconsin-Madison, says Rao’s paper is worth publishing, but time will tell if the technique sheds light on the nature of Indus script.
“At present they are lumping more than 700 years of writing into one data set,” he says. “I am actually going to be working with them on the revised analysis, and we will see how similar or different it is from the current results.”[Scholars at odds over mysterious Indus script – life – 23 April 2009 – New Scientist]

Additional Reading:

  1. The original paper: Statistical analysis of the Indus script using n-grams
  2. Indus script encodes language, reveals new study of ancient symbols
  3. Artificial Intelligence Cracks 4,000-Year-Old Mystery