Santa Fe
Institute
  • Research
    • Themes
    • Projects
    • SFI Press
    • Researchers
    • Publications
    • Library
    • Sponsored Research
    • Fellowships
    • Miller Scholarships
  • News + Events
    • News
    • Newsletters
    • Podcasts
    • SFI in the Media
    • Media Center
    • Events
    • Community
    • Journalism Fellowship
  • Education
    • Programs
    • Projects
    • Alumni
    • Complexity Explorer
    • Education FAQ
    • Postdoctoral Research
    • Education Supporters
  • People
    • Researchers
    • Fractal Faculty
    • Staff
    • Miller Scholars
    • Trustees
    • Governance
    • Resident Artists
    • Research Supporters
  • Applied Complexity
    • Office
    • Applied Projects
    • ACtioN
    • Applied Fellows
    • Studios
    • Applied Events
    • Login
  • Give
    • Give Now
    • Ways to Give
    • Contact
  • About
    • About SFI
    • Engage
    • Complex Systems
    • FAQ
    • Campuses
    • Jobs
    • Contact
    • Library
    • Employee Portal

Science for a Complex World

Events

Here's what's happening

Give

You make SFI possible

Subscribe

Sign up for research news

Connect

Follow us on social media

© 2026 Santa Fe Institute. All rights reserved. This site is supported by the Miller Omega Program.

Home / Events

Computational Methods and Long-Range Linguistic Comparison: Friends or Foes?

Noyce Conference Room
Seminar
12:15 pm – 1:15 pm  US Mountain Time
August 7, 2017
Speaker: 
George Starostin (Russian State University for the Humanities; Santa Fe Institute External Professor)

This event is closed to the public.

Abstract.  Could computers ever replace humans in the daunting task of reconstructing unattested ancestors of modern languages and building perfect phylogenies for present day linguistic families? In a recently published paper (J.-M. List et al., "The Potential of Automatic Word Comparison for Historical Linguistics", PLOS ONE, January 27, 2017), it was stated that modern automated methods of language comparison have been perfected to such a degree that they allow for identification of up to 80-90% etymological matches, previously established by trained experts "by hand" — seemingly implying that algorithms of automated comparison are robust enough to replicate and, if necessary, even replace work that was previously exclusively dependent on human potential. The truly important question in this context, however, is not whether computational methods can perform the same tasks that have already been successfully performed by human experts, but whether they can go beyond that level and help resolve issues that have hitherto proven way too challenging for said experts. Have computational methods, so far, succeeded in decisively clarifying any of these issues? In particular, have they proven useful for one of the most pressing and complicated tasks in historical linguistics — the establishment of "long distance genetic relationships" between large groupings of languages, such as, for instance, the Nostratic hypothesis of common linguistic ancestry between Indo-European, Uralic, Altaic, and several other families of Eurasia? In my talk, I will give a general overview of these issues, explain why the "big data approach" is difficult to apply to them, and point out why manual construction of various types of large linguistic databases is currently a far more important task than improving the algorithmic base for language data comparison.

Purpose: 
Research Collaboration
SFI Host: 
David Krakauer
Share
  • Sign Up For SFI News
  • SFI Calendars
  • Science


  • SFI Projects
  • Algorithmic justice
  • Artificial intelligence: Foundations to frontiers
  • A theory of embodied intelligence
Show more

  • SFI Themes
  • Complex Intelligence: Natural, Artificial, and Collective
  • Complexity and History
  • Complex Time - Adaptation, Aging, Arrow of Time
Show more

More SFI Events

View All Events
June 1, 2027

On the Noisy Balance of Nature in a Changing World

November 12, 2026

Cumulative Culture, Ideas, and Growth from Prehistory to Present

October 8, 2026

Canceled

The Conversational Nature of Language

September 3, 2026

Do LLMs Understand? How Would We Know if They Did? How Can We Get Them To?

August 20, 2026

Seminar - Alec Nevala-Lee

August 13, 2026

Seminar - Sam Bowles

August 11, 2026

Self-organization, Infectious Disease, and Social Behavior - An Evolutionary Tale?

August 10, 2026

Complex Political Identity

July 22, 2026

Twenty Years of Neuroeconomics

July 21, 2026

Modularity and the Emergence of Bacterial Pathogens

July 20, 2026

Simple Models of Complex Phenomena in the Natural and Social Sciences

July 20, 2026

Advances in Archaeoecology

June 24, 2026

Seminar - Madan Rao

June 18, 2026

Capturing Carbon in the Wild — Why Storing Carbon in Open Environmental Systems Needs a Stronger Scientific Framework

June 17, 2026

Seminar - Venkat Viswanathan

June 16, 2026

Emergent Coexistence in Ecological Communities

June 11, 2026

Persuasion at Scale: Machine Learning, Causality, and the Information Ecosystem (Lessons Learned)

June 10, 2026

Explanation: The Good, The Bad, and The Beautiful

June 9, 2026

Theories of neural computation underlying learning, imagination and reasoning: of mice, monkeys and machines

June 8, 2026

AI and Justice

June 3, 2026

How Narrative Can, and Can't, or Can't Easily, Communicate Complexity

May 27, 2026

Geometry Guides Generalization in Zero-shot Learning of Dynamical Systems

May 26, 2026

Symbolic Language, Embodied Worlds: Multimodal Intelligence in Humans and Machines

May 19, 2026

Disordered Systems for Neurocomputation

May 14, 2026

Computational Materials Design for Nanoelectronics and Spintronics

May 13, 2026

Navigating the Quantum Complexity of Matter: Computational Frontiers for Quantum Materials

May 12, 2026

The Whole Ocean was Full of Lines, Points, Fields, Waves, Folds: Sharks, Vision, and Transit

May 11, 2026

Computational Frontiers in Quantum Materials

May 11, 2026

Your Data Will Be Used Against You

May 7, 2026

The Geometry of Persuasion: Quantifying Belief Change in a Latent Embedding Space

May 5, 2026

Synchronize or Hop: Two Mechanisms for Predicting the Dynamical World in Modern ML Models

May 4, 2026

Interspecies: Decoding, Translation, and Interpretation

May 4, 2026

The AI - Agency Paradox: How Gains in Human Agency from AI Use Can Deceive Us and How We Can Measure Agency in this New Paradigm

April 30, 2026

Sieving Through Complexity: How Transient Dynamics Emerge from the Finite Observer-Referenced Framework

April 29, 2026

Metacognitive Intelligence in Human-AI Teams

April 28, 2026

Trade, Borrow, or Steal: How Acquired Metabolism Drives Evolutionary Innovation

April 27, 2026

Enhancing Counterfactual Reasoning for Complex Environments

April 24, 2026

Complexity Futures: New Paradigms 2026

April 23, 2026

Disturbance and Recovery Dynamics in Complex Systems

April 22, 2026

Sleep as a Trojan Horse (to find a unifying computational principle central to biological computation)