New AI algorithm is cracking undeciphered languages

Top Image: Rongorongo script is an undeciphered language.

Scientists have created a new algorithm to hunt down similarities in ancient languages and it’s set to untangle the mystery of all undeciphered languages.

According to a new MIT report “most languages that have ever existed are no longer spoken .” The study of lost and “ undeciphered languages ” is made exceptionally challenging as so few ancient records exist to assist common machine-translation tools and algorithms like Google Translate. Since nowhere nearly enough is understood about the grammar, vocabulary, or syntax of ancient languages, many texts remain undeciphered. Without these, an entire body of knowledge about the people who spoke them has been inaccessible - until now says the team from MIT.

Tracking the Evolution of Undeciphered Languages

The team of researchers from MIT ’s Computer Science and Artificial Intelligence Laboratory ( CSAIL) recently created a new computer system that has the ability to “automatically decipher lost languages” without needing advanced knowledge of their relation to other languages - including pauses, punctuation, and inflection. Furthermore, this new system was tested for its capability of automatically determining any relationships between language groups, and in these tests it was established that the Iberian language of Spain is not related to Basque.

One of the five plaques of the Monumento a los Fueros (Paseo de Sarasate, Pamplona). This one was written in 1905 in Basque language and in an adaption of north-eastern Iberian Script.One of the five plaques of the Monumento a los Fueros (Paseo de Sarasate, Pamplona). This one was written in 1905 in Basque language and in an adaption of north-eastern Iberian Script.

In this new project, that was partly funded by the Intelligence Advanced Research Projects Activity, ( IARPA), MIT professor Regina Barzilay explains in a new paper that the system “relies on several principles grounded in insights from historical linguistics” because languages evolve in predictable ways. Dr. Barzilay explains that languages rarely add or omit entire sounds and that certain sound substitutions are likely to occur, for example, words with the “p” sound in the parent language might evolve a “b” sound in the offspring languages, but because of the significant pronunciation gap it is less likely a “p” would become a “k”.

Translating Sounds in the Vast Silence of Cyber Space

By assembling all of the known linguistic patterns, the team of scientists developed a new “decipherment algorithm” that is designed to process and interpret what the researchers describe as the “vast space of possible transformations and the scarcity of a guiding signal in the input.” The new algorithm self-learns by embedding language sounds “into a multidimensional space where differences in pronunciation are reflected in the distance between corresponding vectors.”

What this means is that the new system, or algorithm, enables researchers to isolate language patterns expressing change, and it uses these to form new computational constraints and restrictions, and once these are segmented into words in a lost language, similarities with related languages can be mapped. Basically, it hunts down commonalities in sounds and suggests possible links.

Apart from identifying some signs for numbers, Linear A is still an undeciphered language.Apart from identifying some signs for numbers, Linear A is still an undeciphered language.

Programing the Vampire Phonetic Mirror

Floating in a conceptual cyber-space, the new algorithm acts like a ‘vampire phonetic mirror,’ (my words) in that it reflects any sound structures that it recognizes as similar to others, yet it offers no reflection from unrelated, or unconnected, sounds, (hence the vampire). The system can also identify the proximity between any two given languages and it can accurately determine “ language families .” This is why the team applied the new test (algorithm) on the Iberian and Basque languages, “as well as less likely candidates from Romance, Germanic, Turkic and Uralic families.”

While Basque and Latin were found to be closer to Iberian than other languages, they were still far too different to be considered “related,” and the team of scholars are currently in disagreement on the actual related language, with some scholars claiming Iberian doesn’t relate “to any known language,” according to the new paper.

The MIT researchers hope their connecting ancient texts to related words in known languages, a process known as “cognate-based decipherment,” is only the first step in the creation of a super-advanced system that will ultimately be able to identify the semantic meaning of words, even if it is unknown how exactly these ancient words were originally spoken.

REGISTER NOW

By Ashley Cowie / Historian and Documentarian

Ashley is a Scottish historian, author and documentary filmmaker presenting original perspectives on historical problems, in accessible and exciting ways. His books, articles and television shows explore lost cultures and kingdoms, ancient crafts and artefacts, symbols and architecture, myths and legends telling thought-provoking stories which together offer insights into our shared social history.In his 20's Ashley was based in Caithness on the north east coast of Scotland and walked thousands of miles across ancient Neolithic landscapes collecting flint artefacts, which led to the discovery of significant Neolithic settlements. Having delivered a series of highly acclaimed lectures on the international Science Festival Circuit about his discoveries, he has since written four bestselling non-fiction books. Elected as a member of the Society of Antiquaries of Scotland, incorporated by Royal Charter in 1783, Ashley has been involved in a wide range of historical and scientific research projects which are detailed on this website – www.ashleycowie.com.In 2009 Ashley became resident Historian on STV’s The Hour Show and has since featured as an expert Historian on several documentaries. Ashley’s own documentaries have been watched by an estimated 200 million people and currently air in over 40 countries. NBC’s Universal’s hit-adventure show ‘Legend Quest’ follows Ashley’s global hunt for lost artefacts and is watched by over 5 million viewers in Australia, Asia and Europe every week. In North America, PBS’s ‘Great Estates’ was in Amazon’s top-ten “most downloaded documentaries 2016” and has been watched by an estimated 150 million people.

(Source: ancient-origins.net; October 21, 2020; https://tinyurl.com/y3rx9t5v)
Back to INF

Loading please wait...