layout: true class: left, top <div class="footer"> <a href="https://twitter.com/khia_johnson">
<tt>@khia_johnson</tt></a> | #Interspeech2021 | Leveraging the uniformity framework | Paper ID 1780 </div> --- class:middle,center ## Leveraging the uniformity framework to examine crosslinguistic similarity for long-lag stops in spontaneous Cantonese-English bilingual speech #### Khia A. Johnson Department of Linguistics<br>University of British Columbia Interspeech 2021 | Paper ID 1780 --- class:middle ## Do highly similar speech sounds from different langauges arise from the same underlying category? <br><br> ??? Crosslinguistic influence is widespread at phonetic level, but most of the work is based on research with phonologically similar sounds that are phonetically distinct. A classic example of this is English long lag k, compared to a Spanish short lag k. Same place in the inventory, different laryngeal gesture. This allows you to physically tell the two sounds apart. How do you asses that things are indeed similar to each other. How do you tell them together? Because things that are the most alike already have the most potential for sharing a mental representation, or quote "being the same." --- <br> ## The SpiCE Corpus ### Speech in Cantonese & English .pull-left[ - Casual interview data - 34 early bilingual talkers - Sample of **9,458** long-lag stops ] .pull-right[ Documentation: <br><https://spice-corpus.readthedocs.io> <img src='images/openaccess.png' height=70> ] ??? The data set here comes from the SpiCE of speech in cantonese and english. It contains casual interviews between bilingual peers in both languages-- so its within-talker data. We collected a sample from the corpus of all the word-initial pre-vocalic long lag stops After filtering for measurement error and that sort of thing, we were left with just under 10k stops --- <br> ## Identifying *uniform* structure in phonetic variation: - Ordinal relationships - Correlations - Mixed effects model --- <br> ## Relatively few talkers show expected ordinal relationship for mean VOT <br> .left[ .big[ | Language | P < T | T < K | P < K | | --------- | -------- | ----- | ------ | | Cantonese | 0.24 ❌ | 0.61 🤷 | 0.49 🤷 | | English | 0.82 ✅ | 0.06 ❌ | 0.47 🤷 | ] ] ??? --- <br> ## Within language correlations show *some* uniformity .center[ <img src='images/corr-bt.png' height=400> ] ??? --- <br> ## Across language correlations show *very little* uniformity .center[ <img src='images/corr-in.png' height=400> ] ??? --- <br> ## The takeaway? It's murky. - Less structure to VOT variation than anticipated - Mixed effect model indicates English T/K have consistently longer VOT - Implications for multilingual representation ???