CONTEXT ***Please read this entire file before working with this collection*** The data in this project were compiled through consultation with a single Kayan speaker speaker, Elrica anak Blabun. The consultation sessions were conducted online and are mostly elicitation between the researchers and consultant. Much of the data are in the form of elicited sentences, grammaticality judgements, and in some cases novel sentences volunteered by Elrica. The data are therefore outside of the context of typical Kayan speech, but the sentences should still be considered natural and grammatical Kayan sentences in the context of direct elicitation, except of course in cases where sentences were judged by Elrica as ungrammatical. Kayan is a language spoken in central Sarawak. It is internally diverse, with many different forms spoken in different areas. The variety in this archive is spoken in Uma Nyaving, a Kayan community in the Sungai Asap area of central Sarawak. Uma Nyaving Kayan is considered part of the larger Rejang River dialect, although these recordings should only be considered indicative of the Kayan spoken at Uma Nyaving. THE COLLECTION The collection consists of recordings of consultation sessions, transcriptions of the recordings, analyzed texts, and a vocabulary. A summary of each item is given below. Recordings: Recordings are roughly one-hour long sessions. The recordings were made on the zoom platform. Quality is generally good but do not expect very high quality audio. mp4 and wav audio files are available for each session. Transcriptions: Each session has a full transcription to provide appropriate context. The transcriptions are exported from ELAN as a txt file. The original ELAN file is also included. Analyzed texts: The texts are created from the transcription, but only include the relevant Kayan words and sentences. Analyzed texts include interlinear glossing with linguistic analysis and translation. The collection includes xml and txt versions of the analyzed texts. In the txt files, Transcriptions are in a tab- separated format. All sentences are time-stamped. The interlinearization of the analyzed texts in the txt file is organized as a four-line gloss as follows: line 1: Orthographic transcription of the Kayan sentence line 2: IPA version of transcription, tab-separated, with morphology indicated line 3: Aligned gloss with preliminary analysis line 4: Translation (followed by citation and context information) Following each translation are three items in parentheses in the following order: (File Name, Sentence Type, Timestamp in HH:MM:SS.xxx format) The file name gives the specific session as it appears in the archive, for example, ADS5-004. The sentence type specifies some of the context. We distinguish between three types: Elicited Sentences, Grammaticality Judgements, and Volunteered Sentences. Each is indicated with a single-letter abbreviation as follows: E = Elicited sentence (Elrica was asked to provide a certain Kayan sentence) J = Grammaticality judgement (Elrica was asked to judge the grammaticality of a novel Kayan sentence constructed by the investigator) V = Volunteered sentence (Elrica volunteered the sentence either as an alternative to the elicited sentence or as a spontaneous Kayan sentence) A sample analyzed sentence is shown below: nun ika' au kuman nun ikaʔ au kuman what 2s already eat What did you already eat? (ADS5-004, J, 00:27:34.178--00:27:36.628) This is a sentence from ADS5-004, it was a novel sentence provided by the investigator and judged as grammatical, and occurs from minute 27, second 34.178 until minute 27, second 36.628. Because of the tab-separation, interlinear glossing may not always align vertically with the Kayan word. Transferring the sentence to a spreadsheet or table will solve the visual discrepancy. Ungrammatical sentences are marked with *. Borderline sentences, sentences that have an other-than-intended meaning, or sentences that Elrica is unsure about are marked with #. All other sentences are considered grammatical and in an appropriate context. For any disagreements between the transcripts and the analyzed text with regard to the form of Kayan words or exact translations, use the analyzed text. The IPA uses some Austronesian conventions: y is used instead of true IPA [j] j is used instead of true IPA [ʤ] Vocabulary: The vocabulary is a compilation of all Kayan words and their definitions as provided in the sessions. The list is not meant to be a complete vocabulary of Kayan, but only a summary of the words provided during the project. All Kayan words found in the texts should have a corresponding entry in the vocabulary. ORGANIZATION OF THE COLLECTION ADS5-002 is the current wordlist. Both a txt and csv file is provided. ADS5-003 is all of the elicited sentences from every session. They are stored in both a txt file and in a FLEx-exported xml file. We include ADS5-003 as a convenient way to access relevant data from all sessions in the collection without having to go through each session individually. However, it is still recommended to access the individual session file to check the context of the data. ADS5-004 and all following sessions have session-specific files. These include the session recording (wav and mp4), transcription, analyzed text, and ELAN. In ADS5-004 (and all following sessions), there are two txt files. The first, ADS5-004-01, is the analyzed text, the second, ADS5-004-02, is the transcript. VERSIONS Updates will proceed monthly during the duration of the project, until Fall 2024. Current version and upload history is as follows: Current version: v003, 2024-07-06 (third deposit, ADS5-011 through ADS5-021) Past versions: v002, 2023-05-26 (second deposit, ADS5-007 through ADS5-010) v001, 2023-02-27 (initial deposit, ADS5-001 through ADS5-006) CITATION Please reference the collection as a whole and specify sessions in the in-text citation. Alexander D. Smith, Sommerlot, Carly J., Michael Yoshitaka Erlewine. 2023. Kayan Uma Nyaving Language Elicitation. Kaipuleohone Digital Language Archive. An example in-text citation referencing data from ADS5-006: (Smith et al. 2023: ADS5-006) If editors allow, list all names: (Smith, Sommerlot & Erlewine 2023: ADS5-006) SUPPORT This project was supported by the Singapore Ministry of Education under the grant “History and syntax of the languages of Borneo” (MOE-T2EP40121-0003, A-8000132-00-00). CONTACT Contact any of the researchers for inquiries: Alexander D. Smith - smith.alexander.david@gmail.com Carly J. Sommerlot - csommerlot@gmail.com Michael Yoshitaka Erlewine - mitcho@mitcho.com