A curated global dataset of social contact between diverse language communities

IRIS

The GramAdapt Social Contact Dataset is a curated dataset of 34 language pairs with qualitative and quantifiable data on social interaction and aspects of societal multilingualism. The language pairs were sampled globally to represent the world’s linguistic diversity. The dataset can be used to interrogate the social dimensions of language contact independently or in conjunction with appropriate linguistic data. The data were collected by distributing a questionnaire to experts who have experience with either one or both of the language communities of a pair. The data represent subjective expert assessments based on choices from predetermined answers which can be quantified. Authors 1, 2 and 3 manually checked the response to identify possible misjudgments or misunderstandings. This results in a dataset containing 13,493 data points. This dataset is a first of its kind in the field of linguistics, built upon wide findings from sociolinguistics, historical linguistics, psycholinguistics, and linguistic anthropology.

A curated global dataset of social contact between diverse language communities

Rosnátaly Avelino^{Data Curation};Sacha Beck^{Data Curation};Anna Berge^{Data Curation};Ana Blanco Pena^{Data Curation};Ross Bowden^{Data Curation};Nicolás Brid^{Data Curation};Joseph M. Brincat^{Data Curation};María Belén Carpio^{Data Curation};Alexander Cobbinah^{Data Curation};Paola Cúneo^{Data Curation};Deginet Wotango Doyiso^{Data Curation};Anne-Maria Fehn^{Data Curation};Saloumeh Gholami^{Data Curation};Arun Ghosh^{Data Curation};Hannah Gibson^{Data Curation};Elizabeth Hall^{Data Curation};Katja Hannß^{Data Curation};Hannah Haynie^{Data Curation};Jerry Jacka^{Data Curation};Matias Jenny^{Data Curation};Richard Kowalik^{Data Curation};Sonal Kulkarni-Joshi^{Data Curation};Maarten Mous^{Data Curation};Marcela Mendoza^{Data Curation};Cristina Messineo^{Data Curation};Francesca Romana Moro^{Data Curation};Hank Nater^{Data Curation};Michelle Ocasio^{Data Curation};Bruno Olsson^{Data Curation};Ana María Ospina Bozzi^{Data Curation};Agustina Paredes^{Data Curation};Admire Phiri^{Data Curation};Nicolas Quint^{Data Curation};Erika Sandman^{Data Curation};Dineke Schokkin^{Data Curation};Ruth Singer^{Data Curation};Ellen Smith-Dennis^{Data Curation};Lameen Souag^{Data Curation};Yunus Sulistyono^{Data Curation};Yvonne Treis^{Data Curation};Matthias Urban^{Data Curation};Jill Vaughan^{Data Curation};Georg Ziegelmeyer^{Data Curation};Veronika Zikmundová^{Data Curation};Ricardo Napoleão de Souza^Methodology;Kaius Sinnemäki^{Conceptualization}

2025-01-01

Abstract

The GramAdapt Social Contact Dataset is a curated dataset of 34 language pairs with qualitative and quantifiable data on social interaction and aspects of societal multilingualism. The language pairs were sampled globally to represent the world’s linguistic diversity. The dataset can be used to interrogate the social dimensions of language contact independently or in conjunction with appropriate linguistic data. The data were collected by distributing a questionnaire to experts who have experience with either one or both of the language communities of a pair. The data represent subjective expert assessments based on choices from predetermined answers which can be quantified. Authors 1, 2 and 3 manually checked the response to identify possible misjudgments or misunderstandings. This results in a dataset containing 13,493 data points. This dataset is a first of its kind in the field of linguistics, built upon wide findings from sociolinguistics, historical linguistics, psycholinguistics, and linguistic anthropology.

Scheda breve

Scheda completa

Scheda completa (DC)

Anno

2025

Appare nelle tipologie:

1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Kashima_et_al-2025-Scientific_Data.pdf accesso aperto Tipologia: Documento in Post-print Licenza: Creative commons Dimensione 2.44 MB Formato Adobe PDF Visualizza/Apri	2.44 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11574/250621

Citazioni

ND

social impact