layout: true class: left, top <div class="footer"><svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <g label="icon" id="layer6" groupmode="layer"> <path id="path2" d="m 255.9997,7.9999987 c -34.36057,0 -62.21509,27.8545563 -62.21509,62.2151643 0,20.303056 9.87066,38.160947 24.91769,49.517247 0.18814,-20.457899 16.79601,-36.993393 37.29685,-36.993393 20.50082,0 37.11091,16.535494 37.29909,36.993393 15.04533,-11.3563 24.9177,-29.212506 24.9177,-49.517247 C 318.21272,35.854555 290.35915,7.9999987 255.99915,7.9999987 Z M 293.29654,392.2676 c -0.18814,20.4601 -16.79601,36.99338 -37.29684,36.99338 -20.50082,0 -37.10922,-16.53551 -37.29684,-36.99338 -15.04759,11.35627 -24.91769,29.21246 -24.91769,49.51722 0,34.36059 27.85453,62.21518 62.2151,62.21518 34.36056,0 62.21508,-27.85459 62.21508,-62.21518 0,-20.30306 -9.87066,-38.16095 -24.91767,-49.51722 z M 441.78489,193.78484 c -20.30301,0 -38.16309,9.87068 -49.51717,24.91769 20.45786,0.18819 36.99333,16.79605 36.99333,37.29689 0,20.50085 -16.53547,37.11096 -36.9911,37.29916 11.35634,15.04533 29.21249,24.91769 49.51721,24.91769 C 476.14549,318.21327 504,290.35948 504,255.99942 504,221.6394 476.14549,193.78425 441.78489,193.78425 Z M 82.738898,255.99997 c 0,-20.50139 16.535509,-37.11096 36.993392,-37.29689 -11.35632,-15.04756 -29.214164,-24.91769 -49.517197,-24.91769 -34.36057,0 -62.2150945,27.85455 -62.2150945,62.21517 0,34.3606 27.8545245,62.21516 62.2150945,62.21516 20.303033,0 38.160877,-9.87068 49.517197,-24.91773 -20.457883,-0.18818 -36.993391,-16.796 -36.993391,-37.29688 z M 431.3627,80.636814 c -24.29549,-24.295544 -63.68834,-24.295544 -87.9844,0 -14.35704,14.357057 -20.00298,33.963346 -17.39331,52.633806 -0.0824,0.0809 -0.18198,0.13437 -0.26434,0.21491 -14.578,14.57799 -14.578,38.21689 0,52.79488 14.57797,14.57799 38.21681,14.57799 52.79484,0 0.0824,-0.0824 0.13455,-0.18198 0.21732,-0.26434 18.66819,2.60796 38.27445,-3.03799 52.63151,-17.39336 24.29378,-24.29778 24.29378,-63.68837 -0.003,-87.986153 z M 186.2806,378.51178 c 14.57798,-14.57799 14.57798,-38.21461 0,-52.79319 -14.57798,-14.57853 -38.21683,-14.57798 -52.79481,0 -0.0825,0.0824 -0.13448,0.18199 -0.21476,0.26215 -18.67046,-2.60795 -38.276723,3.03572 -52.63376,17.39505 -24.297753,24.29552 -24.297753,63.6884 0,87.98449 24.29551,24.29552 63.68833,24.29552 87.98439,0 14.35702,-14.35703 20.00297,-33.96333 17.39333,-52.63386 0.0848,-0.0786 0.18364,-0.13228 0.26672,-0.21505 z m 0,-245.02583 c -0.0826,-0.0824 -0.18198,-0.13436 -0.26445,-0.21494 2.60795,-18.66823 -3.038,-38.27452 -17.39332,-52.633811 -24.29777,-24.295544 -63.68832,-24.295544 -87.984405,0 -24.297747,24.297781 -24.297747,63.688381 0,87.986151 14.357042,14.35706 33.963315,20.00301 52.631515,17.39336 0.0808,0.0824 0.13447,0.18199 0.21475,0.26434 14.57799,14.57799 38.21684,14.57799 52.79482,0 14.57797,-14.57802 14.57797,-38.21689 0,-52.79488 z m 245.0821,209.89048 c -14.35703,-14.35703 -33.96329,-20.00301 -52.63378,-17.39505 -0.0809,-0.0824 -0.13228,-0.18199 -0.21506,-0.26215 -14.57797,-14.57799 -38.21685,-14.57799 -52.79482,0 -14.57797,14.57799 -14.57797,38.21461 0,52.79316 0.0827,0.0828 0.18198,0.13455 0.26434,0.21505 -2.60796,18.67053 3.03802,38.27683 17.39334,52.63386 24.29552,24.29552 63.68834,24.29552 87.98439,0 24.29775,-24.29552 24.29775,-63.68841 0.003,-87.98451 z" style="stroke-width:0.07717"></path> </g></svg> <tt>osf.io/jhsfc</tt></div> --- class: center, middle # Language contact within the bilingual: ### Adapting a model from sound change to crosslinguistic influence <div style="width: 100%;"> <div style="float: left; width: 18%;" align=right><img src='img/speeconlogo.png' height=140></div> <div style="float: left; width: 64%;" align=center><br>Khia A. Johnson & Molly Babel <br>.small[Department of Linguistics<br>University of British Columbia]<br></div> <div style="float: left; width: 18%;" align=left><img src='img/isblogo.png' height=155></div> </div> International Symposium on Bilingualism 13<br> .small[July 13, 2021 · 12:00–13:30 CET · Session OS19: Cross-linguistic influence] ??? 1: Hi, my name is Khia Johnson, and I'm presenting joint work with Molly Babel, which is based on a paper. You can find it at the OSF link in the slide footer. --- ## Time zone & bonus Q+A - Session OS19 is 3am in our time zone — I'll be there, if a bit sleepy! - Molly and I will hold an bonus Q+A session via Zoom on July 14 at 18:00 CET - Assuming you can comment on this video in the conference platform, I'll post the Zoom link there ??? A quick note about the time. I'll be at the Q+A session for this talk, but it's going to be 3am in my time zone, so I might be a bit sleepy. Molly and I will hold a bonus Q+A session on July 14 at 6pm central European time. I'll post the zoom link for that in the comments on this video. --- class: middle ## "...the locus of language contact is the bilingual speaker" [(Sankoff 2002: 643)](https://doi.org/10.1002/9780470756591.ch25) ??? I wanted to lead with this quote—the locus of language contact is the bilingual speaker. It ties ideas from crosslinguistic influence to ideas from research on language contact at the community level and suggests that the two are not different. This perspective emphasizes the role that bilinguals have to play in contact-induced language change. --- ## Crosslinguistic influence is... - language contact at an individual level [(see Yao & Chang, 2016)](https://doi.org/10.1353/lan.2016.0031) - bidirectional - **asymmetrical** ??? Crosslinguistic influence is one clear example of contact, and it's arguable where contact-induced change starts. And while we might see it primarily as an individual-level phenomenon, examining how crosslinguistic influence patterns across a group of bilinguals can give us a snapshot of language contact at the community level. This contact manifests bidirectionally, and as we focus on in this research, it's also asymmetrical. So while any of an individual's languages is subject to influence from other languages, they don't necessarily exert the same influence. --- ## Accounting for asymmetries - Language dominance [(Flege & Bohn, 2021)](https://doi.org/10.1017/9781108886901.002) - Age of acquisition [(Flege, Bohn, & Mackay, 1995)](https://doi.org/10.1121/1.413041) - Language mode [(Simonet & Amengual, 2019)](https://doi.org/10.1177/1367006919826388) - Recency [(Chang, 2013)](https://doi.org/10.1016/j.wocn.2013.09.006) <br><br> - **More room to vary** [(Bullock & Toribio, 2009;](https://benjamins.com/catalog/sibil.41.12bul) [Olson, 2016)](https://doi.org/10.1177/1367006914566204) ??? And there are accounts for much of this asymmetry. Some common ones include dominance, age of acquisition, language mode, and recency. These accounts cover a lot of ground and account for a wide variety of asymmetries. They largely reflect linguistic experience and are, by definition, highly individual in nature. Another proposal—one I've seen only anecdotally in the literature—is that there's something about the categories involved. And that's the idea that some categories are more susceptible to influence because they have more room to vary. This explains why you see languages with long-lag voice onset time typically showing more influence from languages with short-lag voice onset time than the reverse. Your typical 80 milliseconds of aspiration could drop by half and still be perceived as long-lag. While short-lag stops are more tightly bounded on either side. So this idea of room to vary hasn't gained much traction in the crosslinguistic influence literature, but it is something you see in work on other kinds of language contact. --- class: middle ## "The division of labor between SLA and sociohistorical linguistics has led to the investigation of language contact at both ‘micro’ (individual) and ‘macro’ (community) levels" [(Yao & Chang, 2016)](https://doi.org/10.1353/lan.2016.0031) ??? There's not a great reason for why you don't see a ton of crosstalk between these different lines of research. And I think this quote from a recent Yao & Chang paper really highlights that. These micro and macro levels of language contact research have mostly stayed separate, despite how they might inform one another in interesting ways. And that's what we're doing here. Because the idea that categories in contact might differ in acceptable variability is something that's been grappled with in the sound change literature. --- ## Adapting ideas from sound change .pull-left[ - [Harrington et al. (2018)](https://doi.org/10.1111/tops.12329) account for direction of change based on distributional relationships alone - For a given dimension (solid line), the more variable distribution (y) shifts towards the less variable distribution (x) over time - Not the same as compactness [(Kartushina et al., 2015)](https://doi.org/10.1121/1.4926561) ] .pull-right[ <img src='img/harrington2018fig2.png' height=380><br> Figure 2 from Harrington et al. (2018) ] ??? A model of sound change proposed by Harrington and colleagues accounts for the direction of a sound change based purely on the relationship between two distributions. The figure here is pulled from their 2018 paper, and while it explains the model using two dimensions, it would work just as well for 1 or more than 2. So in this example, you have two versions of a category, X and Y, and you're interested in how these categories influence one another along the solid line. In this dimension, Y is much more variable than X. So, if you had something in the middle, you'd be more likely to say it's a Y than an X. This means that Y shifts towards X over time; and, Harrington and colleagues have demonstrated this with both modeling and empirical data. It's a bit weird to think about if you haven't before, but the basic idea is that the less variable group stands its ground while the more variable group closes in. --- # Research question When linked cross-language categories differ in the degree of phonetic variation for a given phonetic dimension, does the **less variable** category constrain or guide production of the **more variable** category in the other language? **Hypothesis:** Yes, and this will be refined for each study ??? So what are we doing with this framework? Well. We're proposing that the internal structure of matched categories predicts asymmetry in crosslinguistic influence and that this has cascading effects for things like contact language varieties and sound change. While the original model offers an explanation for why specific sound changes occur, our adaptation of the model provides an account of how new changes might be introduced in the first place. --- ## General method - 2 studies examining English-Cantonese asymmetries **in English speech only** - Materials - Cantonese-English bilingual corpus - Local English comparison corpus - Non-local English comparison corpus - Identify target segments - Measure and/or manually code - Compare bilingual speech behavior to comparison groups with GLMMs ??? So, in broad terms, we follow a fairly standard corpus phonetics workflow. There are two studies, and each looks at a different asymmetry across English and Cantonese where we expect Cantonese influence on English. We use three corpora, each representing different speech communities: the bilinguals were focused on, and two English comparison corpora -- more about this on the next slide. We identified the segments of interest from the corpora and then measured or manually coded them, along with a host of control variables. With this data set, we compared bilinguals' speech production patterns to the comparison groups using generalized linear mixed-effects models. --- ## More about the materials - **SpiCE: Speech in Cantonese and English [(Johnson, 2021)](https://doi.org/10.5683/SP2/MJOXP3)** - Conversational interviews with Vancouver-based bilinguals - English interviews only (n = 34) - Force-aligned - **CTWCE: Corpus of Transnational West Coast English [(Swan, 2016)](http://doi.org/10.6082/M15T3HFT)** - Sociolinguistic interviews with Pacific Northwest English speakers - Vancouver-based participants with no Chinese language knowledge (n = 15) - Force-aligned - **Buckeye Corpus of Conversational [English] Speech [(Pitt et al., 2005)](https://doi.org/10.1016/j.specom.2004.09.001)** - Conversational interviews with monolingual English speakers in Ohio, USA - Young participants only (n = 20) - Manually-aligned ??? Before I jump into the first study, I'll say a little bit more about the materials. The bilingual data comes from the English half of the SpiCE corpus, a new open-access bilingual speech corpus with conversational interviews that we've developed over the last few years here in Vancouver. The first of the comparison corpora is the Corpus of Transnational West Coast English. It's a collection of sociolinguistic interviews with English speakers in Seattle and Vancouver. We use a subset of talkers from Vancouver, that don't have any experience with any Chinese language—though many know some French. I'll refer to this as our "local comparison group." We also included a non-local comparison with the younger half of the Buckeye corpus, a widely used corpus of conversational monolingual English speech from the midwest of the United States. --- class: middle # Study 1: Intervocalic /bdg/ voicing ??? On to the first study, where we look at voicing for voiced intervocalic stops --- ## English-Cantonese voicing asymmetry .pull-left[ - English stop voicing is highly variable (Davidson, 2016) - Cantonese lacks voiced stops (Bauer & Benedict, 2011) - **Prediction:** Cantonese-English bilinguals will produce less stop voicing during closure in their English /bdg/ *in casual speech*, compared to local and non-local English speakers ] .pull-right[ <img src="index_files/figure-html/voicing-asymmetry-1.png" width="100%" /> ] ??? This is a clear example of a variability asymmetry across English and Cantonese. The literature on English stops describes their voicing as highly variable. While it certainly depends on prosodic and other factors, the clear takeaway is its variability. However, the place you're most likely to see full voicing is between vowels inside a word. On the other hand, Cantonese lacks voiced obstruents entirely. So in one language, we see everything from full voicing to full devoicing, while in the other, we primarily see the latter. Though it's worth noting that we don't really see much in the literature that actually engages with the question of whether or not Cantonese stops voice under certain conditions. In casual speech, I think it's safe to assume that it happens at least some of the time. This is why in the cartoon diagram on the right, you see a wide distribution for English and a narrower one for Cantonese. The prediction is that bilinguals will produce less stop voicing in their English than the comparison groups. It's a case of the variable group getting pulled towards the invariable one. --- ## Study 1: Methods - **Identify** intervocalic, word-medial /bdg/ - **Manually code** as (1) Good, (2) Deletion, or (3) Transcript error - **Measure** proportion of voicing during closure - **Model** proportion voiced as Beta-distributed response variable in GLMM <br> .small[ **lme4-style formula:** *Proportion Voiced ∼ Corpus × Place × Duration + Position + Stress Precedes + Stress Follows + Log Word Frequency + (1 | Word) + (1 + Place + Duration | Talker)* ] ??? So in this study, we identified all intervocalic, word-medial /bdg/ from all three corpora. We then manually coded tokens as good, deletion, or transcript error. While these coding labels were not crucial for the manually annotated Buckeye corpus, they bring the SpiCE and CTWCE corpora, which were force aligned, onto a level playing field. With our final set of "good" tokens, we then measured the proportion of voicing during closure and kept track of various control variables. Lastly, as the dependent variable here is a proportion derived from a continuous process, we modeled it as a Beta-distributed response variable in a Generalized linear mixed-effects model. I won't be going over model details in this talk. Instead, I refer you to our paper for more information on things like variable specification and coding. --- .left-column[ ## Study 1: Main Result All corpora have highly voiced stops, but bilinguals to lesser extent. The comparison is consistent and clear for /b/. ] .right-column[ <img src="index_files/figure-html/bdg-results-1.png" width="80%" /> ] ??? The main result from this study is consistent with our hypothesis. In this plot, each panel represents the distribution of proportion voiced for a particular corpus and stop. It does this with a pared boxplot and density plot. First off, this environment tends to be highly voiced, and that's apparent in every panel. That is a lot of fully voiced stops. However, it's also clear that talkers in the SpiCE corpus tend not to be quite as fully voiced. The distributions are more spread out, and the median value is lower. On the other hand, the Buckeye corpus seems to have far more consistently fully voiced stops, with the median being full voicing for two of the three stops. CTWCE, our local comparison, shows more voicing than SpiCE for /b/, the stop for which we would be most likely to observe this difference. The other SpiCE versus CTWCE comparisons were not significant, though CTWCE seems to fall somewhere in the middle of the other two in this plot. --- ## Study 1: Takeaways - Bilinguals produce less voicing during closure - Small effects, but environment strongly favors voicing - Control effects all pattern as expected ??? The takeaway from study 1 is that, yes, our prediction held up. Bilinguals produce less voicing during closure. This difference is most evident with the non-local comparison but shows up with the local comparison as well. The effect size is relatively small, but given that this environment strongly favors voicing, it's not necessarily surprisingly small. We're comforted that the control variables—things like speech rate and stress—all pattern as expected. This makes us confident in the result. Lastly, while I don't have a slide for it, we also looked at some language mode variables within the SpiCE corpus. --- ## Study 1: Takeaways .pull-left[ - Additional model demonstrates no significant effect of language mode - Interview order trend follows expected pattern ] .pull-right[ <img src="index_files/figure-html/interview-order-1.png" width="100%" /> ] ??? ... and in that additional model, there was little to no influence for either variable. We considered two things; first, the order of interview sessions in the SpiCE corpus. That is, whether an individual did the Cantonese session first or not. This was a marginal effect, and it patterned as expected, but it isn't something we can draw firm conclusions about. The second consideration was proximity to a code-switch, though there wasn't a ton of code-switching in the English interviews. We didn't see any effect in the model. --- class: middle # Study 2: Word-final /ptk/ releases ??? Ok, so moving onto the second study. It has a similar structure but focuses on a different asymmetry. --- ## English-Cantonese release rate asymmetry .pull-left[ - English stop releases are highly variable [(Davidson, 2011)]() and socially-conditioned [(Polinsky, 2018)]() - Cantonese final stops are categorically *unreleased* [(Bauer & Benedict, 2011)]() - **Prediction:** Bilinguals will be more likely to produce unreleased final stops in English /ptk/ in casual speech, compared to local and non-local English speakers ] .pull-right[ <img src="index_files/figure-html/release-asymmetry-1.png" width="100%" /> ] ??? Yet again, we have a clear example of asymmetry in which English is highly variable, and Cantonese is not. The example here final stop release rates. English stop releases are incredibly variable, which means you see clear releases, quasi-releases, non-releases, many forms of lenition, as well as deletion. This particular variable has some degree of social salience to it, so you'll see release rate vary across different social groups and speaking styles. Conversely, Cantonese stops are unreleased in final position. It's often discussed as a neutralization, though it's not entirely clear whether the long-lag or short-lag stops you see in initial position would be treated as the underlying form. Our prediction is that given the casual peer-to-peer speaking style, we would see Cantonese influence on English release patterns; specifically, bilinguals would be more likely to produce unreleased final stops, effectively mirroring their Cantonese. --- ## Study 2: Methods - **Identify** pre-vocalic and pre-approximant word-final /ptk/ - **Manually code** as (1) Released, (2) Unreleased, (3) Lenition, (4) Glottalization, (5) Deletion, or (6) Transcript error - **Exclude** errors, deletions, and all /p/ tokens - **Model** whether or not an item was *unreleased* using logistic GLMM <br> >.small[ **lme4-style formula:** Unreleased: True ∼ Corpus × Place × Rate + Vowel Precedes + Stress Precedes + Stress Follows + Word Frequency + (1 | Word) + (1 + Place × Rate | Talker) ] ??? Much like in Study 1, we identified the target items from all three corpora. These were word-final /ptk/, where the following word began with either a vowel or an approximant, as this environment encourages releases. We then manually coded the tokens into one of 6 categories. Released include clear and quasi releases. Unreleased tokens are characterized by clear closure but no release. While we coded for lenition and glottalization, we lumped these together in the analysis. Instances of deletion were removed, as were transcript errors. We also excluded all /p/ tokens from the analysis, as the counts were very low. We then modeled whether or not a stop surfaced as unreleased using a logistic mixed-effects model. While this might unfairly lump releases and lenition together, it gets at our core question of whether or not bilinguals are more likely to show Cantonese-like behavior. --- .left-column[ ## Study 2: Main Result - Unreleased more likely for bilinguals - No effect for interview order or code-switches ] .right-column[ <img src="index_files/figure-html/tkresults-1.png" width="100%" /> ] ??? The main result from Study 2 is that bilinguals are more likely to produce unreleased stops than either of the comparison groups. And notably, in the follow-up model, there was no effect of language mode as captured by interview order or proximity to a code switch. We also examined release rates for this study as opposed to the "unreleased rates just mentioned, and interestingly, we found that both Vancouver groups had remarkably low release rates, which points at a possible regional feature. --- ## Study 2: Takeaways - Predictions borne out: Bilingual more likely to product unreleased final stops - Small effects, but environment favors releases - Control effects all pattern as expected - Additional model (see paper) demonstrates no significant effect of language mode ??? So, the takeaway from Study 2 is that our hypothesis held up, albeit with small effect sizes. We anticipated that this environment would be conducive to releases and find that to be mostly true, except for the local groups' T productions. The control effects largely patterned as expected. T and K tokens interacted differently with duration, though we largely attribute this to the large percentage of Ks occurring in the word "like" -- and this is something we go into more in the paper. And lastly, there were not language mode effects detected, which is not to say they don't exist, but rather that our measures were perhaps inadequate for the purposes of detecting them. --- class: middle # Dicussion ??? There's a lot to unpack in this study, and this talk is focused on the main points. But there are two things that I want to highlight in the discussion of this work. --- ## Importance of corpus research - Speaking style and communicative context matter - our results contradict prior lab results [(Polinsky, 2018)]() - Capturing variable patterns requires... "natural" variation - Lab + corpus = better theories and models in bilingualism [(Travis & Cacoullos, 2013)](https://doi.org/10.1080/07268602.2013.814529) ??? The first implication of this work, and one that may not have been as readily apparent in the takeaways from each study, is the importance of doing this kind of work with speech corpora. While it seems that our results contradict prior work on bilinguals' stop releases, this difference can be easily explained by speaking style. Polinsky did a study with Cantonese-English bilinguals in California and found them more likely to produce stop releases. Polinsky argued that it reflected that bilinguals might anticipate misperception occurring and thus hyperarticulate to mitigate that. We found the opposite. And the difference comes down to context. Polinsky's participants produced formal, citation-style speech, and presumably, the experimenter was not a member of their speech community. On the other hand, our corpus work reflects peer-to-peer speech where the interviewer was a bilingual from the same speech community as the participant. This is precisely the kind of situation where clear speech and hyper-articulation aren't so necessary. And, we don't argue that releases never occur. They do, just less. So this feeds into the next point: to capture variable patterns, you need to have natural variation in the speech sample, which is harder to elicit in experimental work. Doing both lab studies and corpus studies represents a path to developing better theories and models. The second central part of the discussion is perhaps the more obvious one. --- ## Differences in degree of phonetic variation can account for crosslinguistic influence asymmetry - Builds on theoretical understanding of what happens to "linked" or "composite categories [(Flege & Bohn, 2021)]() - A **proof of concept** in successful conceptual adaptation of sound change model from [Harrington et al. (2018)]() <br>➞ Claims should be tempered by single direction & small effect sizes - **A testable, concrete mechanism** <br>➞ we want to see how it holds up! ??? And that's our mind finding. The difference in the degree of phonetic variation offers an explanation for why crosslinguistic influence is asymmetrical. This finding builds on our understanding of crosslinguisitic influence, particularly with what happens to linked categories. However, what we do in this research is perhaps best cast as a proof of concept. We successfully adapted a Harrington and colleagues model of sound change, at least on a conceptual level. There is plenty of room to build on this work! I say it's a proof of concept because there are a few limitations to this work that temper the strength with which we can make our claims. And that's the fact that we only look at influence in a single direction (Cantonese to English) and that many of the key results we presented ultimately stem from small effect sizes. So, while we present a testable, concrete mechanism, we're very interested to see how it holds up in our and others' future work. --- ## Acknowledgments Thank you to Angelina Lloy, Christina Sen, and Stephanie Chung for assistance with data coding; Amanda Cardoso, Gloria Mellesmoen, and the Speech in Context Lab for comments on this work at various stages; Ivan Fong, Nancy Yiu, Ariana Zattera, Christina Sen and many others for their work on the SpiCE corpus; and, to the reviewers and editors for our manuscript, who have helped us improve this work substantially. This work has been supported in part by funding from Canada’s [Social Sciences and Humanities Research Council](https://www.sshrc-crsh.gc.ca/) (MB), Canada’s [Natural Sciences and Engineering Research Council](https://www.nserc-crsng.gc.ca/) (MB), and a [UBC Public Scholars Initiative Award](https://www.grad.ubc.ca/psi) (KJ). ??? There are many people to thank for their contributions to this project, for helping create the SpiCE corpus, and for giving us feedback on the manuscript. --- class: top ### References + preprint <svg viewBox="0 0 512 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <g label="icon" id="layer6" groupmode="layer"> <path id="path2" d="m 255.9997,7.9999987 c -34.36057,0 -62.21509,27.8545563 -62.21509,62.2151643 0,20.303056 9.87066,38.160947 24.91769,49.517247 0.18814,-20.457899 16.79601,-36.993393 37.29685,-36.993393 20.50082,0 37.11091,16.535494 37.29909,36.993393 15.04533,-11.3563 24.9177,-29.212506 24.9177,-49.517247 C 318.21272,35.854555 290.35915,7.9999987 255.99915,7.9999987 Z M 293.29654,392.2676 c -0.18814,20.4601 -16.79601,36.99338 -37.29684,36.99338 -20.50082,0 -37.10922,-16.53551 -37.29684,-36.99338 -15.04759,11.35627 -24.91769,29.21246 -24.91769,49.51722 0,34.36059 27.85453,62.21518 62.2151,62.21518 34.36056,0 62.21508,-27.85459 62.21508,-62.21518 0,-20.30306 -9.87066,-38.16095 -24.91767,-49.51722 z M 441.78489,193.78484 c -20.30301,0 -38.16309,9.87068 -49.51717,24.91769 20.45786,0.18819 36.99333,16.79605 36.99333,37.29689 0,20.50085 -16.53547,37.11096 -36.9911,37.29916 11.35634,15.04533 29.21249,24.91769 49.51721,24.91769 C 476.14549,318.21327 504,290.35948 504,255.99942 504,221.6394 476.14549,193.78425 441.78489,193.78425 Z M 82.738898,255.99997 c 0,-20.50139 16.535509,-37.11096 36.993392,-37.29689 -11.35632,-15.04756 -29.214164,-24.91769 -49.517197,-24.91769 -34.36057,0 -62.2150945,27.85455 -62.2150945,62.21517 0,34.3606 27.8545245,62.21516 62.2150945,62.21516 20.303033,0 38.160877,-9.87068 49.517197,-24.91773 -20.457883,-0.18818 -36.993391,-16.796 -36.993391,-37.29688 z M 431.3627,80.636814 c -24.29549,-24.295544 -63.68834,-24.295544 -87.9844,0 -14.35704,14.357057 -20.00298,33.963346 -17.39331,52.633806 -0.0824,0.0809 -0.18198,0.13437 -0.26434,0.21491 -14.578,14.57799 -14.578,38.21689 0,52.79488 14.57797,14.57799 38.21681,14.57799 52.79484,0 0.0824,-0.0824 0.13455,-0.18198 0.21732,-0.26434 18.66819,2.60796 38.27445,-3.03799 52.63151,-17.39336 24.29378,-24.29778 24.29378,-63.68837 -0.003,-87.986153 z M 186.2806,378.51178 c 14.57798,-14.57799 14.57798,-38.21461 0,-52.79319 -14.57798,-14.57853 -38.21683,-14.57798 -52.79481,0 -0.0825,0.0824 -0.13448,0.18199 -0.21476,0.26215 -18.67046,-2.60795 -38.276723,3.03572 -52.63376,17.39505 -24.297753,24.29552 -24.297753,63.6884 0,87.98449 24.29551,24.29552 63.68833,24.29552 87.98439,0 14.35702,-14.35703 20.00297,-33.96333 17.39333,-52.63386 0.0848,-0.0786 0.18364,-0.13228 0.26672,-0.21505 z m 0,-245.02583 c -0.0826,-0.0824 -0.18198,-0.13436 -0.26445,-0.21494 2.60795,-18.66823 -3.038,-38.27452 -17.39332,-52.633811 -24.29777,-24.295544 -63.68832,-24.295544 -87.984405,0 -24.297747,24.297781 -24.297747,63.688381 0,87.986151 14.357042,14.35706 33.963315,20.00301 52.631515,17.39336 0.0808,0.0824 0.13447,0.18199 0.21475,0.26434 14.57799,14.57799 38.21684,14.57799 52.79482,0 14.57797,-14.57802 14.57797,-38.21689 0,-52.79488 z m 245.0821,209.89048 c -14.35703,-14.35703 -33.96329,-20.00301 -52.63378,-17.39505 -0.0809,-0.0824 -0.13228,-0.18199 -0.21506,-0.26215 -14.57797,-14.57799 -38.21685,-14.57799 -52.79482,0 -14.57797,14.57799 -14.57797,38.21461 0,52.79316 0.0827,0.0828 0.18198,0.13455 0.26434,0.21505 -2.60796,18.67053 3.03802,38.27683 17.39334,52.63386 24.29552,24.29552 63.68834,24.29552 87.98439,0 24.29775,-24.29552 24.29775,-63.68841 0.003,-87.98451 z" style="stroke-width:0.07717"></path> </g></svg> Johnson, K. A., & Babel, M. (Under review). Language contact within the speaker: Phonetic variation and crosslinguistic influence. <https://osf.io/jhsfc/> ### Use our data <svg viewBox="0 0 256 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <g label="icon" id="layer6" groupmode="layer"> <path id="path42" style="color:#000000;font-style:normal;font-variant:normal;font-weight:normal;font-stretch:normal;font-size:medium;line-height:normal;font-family:sans-serif;font-variant-ligatures:normal;font-variant-position:normal;font-variant-caps:normal;font-variant-numeric:normal;font-variant-alternates:normal;font-variant-east-asian:normal;font-feature-settings:normal;font-variation-settings:normal;text-indent:0;text-align:start;text-decoration:none;text-decoration-line:none;text-decoration-style:solid;text-decoration-color:#000000;letter-spacing:normal;word-spacing:normal;text-transform:none;writing-mode:lr-tb;direction:ltr;text-orientation:mixed;dominant-baseline:auto;baseline-shift:baseline;text-anchor:start;white-space:normal;shape-padding:0;shape-margin:0;inline-size:0;clip-rule:nonzero;display:inline;overflow:visible;visibility:visible;opacity:1;isolation:auto;mix-blend-mode:normal;color-interpolation:sRGB;color-interpolation-filters:linearRGB;solid-color:#000000;solid-opacity:1;vector-effect:none;fill-opacity:1;fill-rule:nonzero;stroke:none;stroke-width:0.264583;stroke-linecap:butt;stroke-linejoin:miter;stroke-miterlimit:4;stroke-dasharray:none;stroke-dashoffset:0;stroke-opacity:1;color-rendering:auto;image-rendering:auto;shape-rendering:auto;text-rendering:auto;enable-background:accumulate;stop-color:#000000;stop-opacity:1" d="m 128.23305,64.000002 c -56.460308,0 -102.670899,46.211658 -102.670899,102.672438 v 14.02215 h 41.118177 v -14.02215 c 0,-34.23923 27.314786,-61.55425 61.552722,-61.55425 34.23793,0 61.55272,27.31502 61.55272,61.55425 v 56.25186 c -18.11316,-10.82348 -39.2508,-17.06007 -61.78576,-17.06007 -66.62047,0 -121.0678874,54.44743 -121.0678874,121.06788 C 6.9321226,393.55258 61.37954,448 128.00001,448 c 66.62047,0 121.06787,-54.44742 121.06787,-121.06789 0,-23.29025 -6.66132,-45.0879 -18.16393,-63.59639 v -96.66328 c 0,-56.46078 -46.21063,-102.672437 -102.6709,-102.672438 z m -0.23301,182.982408 c 44.39846,0 79.94968,35.55122 79.94968,79.9497 0,44.39849 -35.55122,79.94973 -79.94968,79.94973 -44.398491,0 -79.949707,-35.55124 -79.949707,-79.94973 0,-44.39848 35.551216,-79.9497 79.949707,-79.9497 z m 0.39478,46.39298 c -18.7346,0 -33.921611,15.187 -33.921611,33.92161 0,18.73459 15.187011,33.92238 33.921611,33.92238 18.7346,0 33.92238,-15.18779 33.92238,-33.92238 0,-18.73461 -15.18778,-33.92161 -33.92238,-33.92161 z"></path> </g></svg> Johnson, K. A. (2021). SpiCE: Speech in Cantonese and English. Scholars Portal Dataverse, V1. <https://doi.org/10.5683/SP2/MJOXP3> ### Connect with us! <svg viewBox="0 0 576 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M532 386.2c27.5-27.1 44-61.1 44-98.2 0-80-76.5-146.1-176.2-157.9C368.3 72.5 294.3 32 208 32 93.1 32 0 103.6 0 192c0 37 16.5 71 44 98.2-15.3 30.7-37.3 54.5-37.7 54.9-6.3 6.7-8.1 16.5-4.4 25 3.6 8.5 12 14 21.2 14 53.5 0 96.7-20.2 125.2-38.8 9.2 2.1 18.7 3.7 28.4 4.9C208.1 407.6 281.8 448 368 448c20.8 0 40.8-2.4 59.8-6.8C456.3 459.7 499.4 480 553 480c9.2 0 17.5-5.5 21.2-14 3.6-8.5 1.9-18.3-4.4-25-.4-.3-22.5-24.1-37.8-54.8zm-392.8-92.3L122.1 305c-14.1 9.1-28.5 16.3-43.1 21.4 2.7-4.7 5.4-9.7 8-14.8l15.5-31.1L77.7 256C64.2 242.6 48 220.7 48 192c0-60.7 73.3-112 160-112s160 51.3 160 112-73.3 112-160 112c-16.5 0-33-1.9-49-5.6l-19.8-4.5zM498.3 352l-24.7 24.4 15.5 31.1c2.6 5.1 5.3 10.1 8 14.8-14.6-5.1-29-12.3-43.1-21.4l-17.1-11.1-19.9 4.6c-16 3.7-32.5 5.6-49 5.6-54 0-102.2-20.1-131.3-49.7C338 339.5 416 272.9 416 192c0-3.4-.4-6.7-.7-10C479.7 196.5 528 238.8 528 288c0 28.7-16.2 50.6-29.7 64z"></path></svg> khia.johnson@ubc.ca · @khia_johnson · [khiajohnson.com](https://khiajohnson.com) <br> molly.babel@ubc.ca · @UBCSpeeCon · [speechincontext.arts.ubc.ca](https://speechincontext.arts.ubc.ca) ??? Lastly, we want to refer you to the paper for more details and our complete list of references. We also hope that you are inspired to your our data. The SpiCE corpus is free and available for download. We hope to see you in the Q+A session and elsewhere during ISB.