Learning Tone Languages: Patterns, Methods, and Insights

Location

Institute for Advanced Computational Science, IACS Building, Stony Brook University, NY 11794, USA

Event Description

Abstract: How do humans learn the sound patterns of their language? Despite a variety of methods and advances in phonotactic learning, there is still a paucity of computational research, methods and data for languages with tones. In this talk, I will explore this question specifically in light of tone languages, where pitch plays a crucial role in distinguishing words' meaning. I provide an implementation of the Bottom-Up Factor Inference Algorithm over Autosegmental Representations (BUFIA-AR), which learns the rules governing possible tone patterns. Using a dataset of Hausa, a West African tone language, the algorithm successfully identifies patterns that are not permitted in the language. These results (i) confirm long-standing linguistic generalizations, (ii) make more specific predictions about exceptional cases, and (iii) reveal previously unnoticed patterns. The results show how mathematical models of sound structure can be brought into dialogue with both linguistic theory and computational learning, highlighting the broader potential of formal approaches to capture human linguistic knowledge.

Bio: Han Li is a fifth-year Ph.D. student in Linguistics department, specializing in computational linguistics under the supervision of Professor Jeff Heinz. Her research focuses on how sound patterns in language can be formally represented and computationally learned, bridging theoretical linguistics and computer science.

Location: Institute for Advanced Computational Science, Seminar Room

Zoom Meeting: https://stonybrook.zoom.us/j/94043459206?pwd=3ra47h8HghOFRfobRBjZaDMyTwialr.1
Meeting ID: 940 4345 9206
Passcode: 332717

Date Start

Wed, 10/15/2025 - 12:00

Date End

Wed, 10/15/2025 - 13:00

AI Innovation Institute

Location

Event Description

Date Start

Date End