Persistent Identifier
|
doi:10.18710/MGMEDP |
Publication Date
|
2024-03-20 |
Title
| Replication Data for: The trouble with phrases: Studies of Norwegian phraseology in norm and usage |
Author
| Rauset, Margunn (University of Bergen) - ORCID: 0000-0001-6158-858X |
Point of Contact
|
Use email button above to contact.
Rauset, Margunn (University of Bergen) |
Description
| This dataset contains the empirical data that are analysed and discussed in the doctoral dissertation "Frasar til besvær? Studiar av norm og bruk i norsk fraseologi" [The trouble with phrases: Studies of Norwegian phraseology in norm and usage]. The dataset consists of two collections. The first is a phrase collection which forms the empirical basis for the discussion in chapter 5 of the dissertation, and the second collection is discussed in chapter 6. The dataset has been manually curated.
The data in chapter 5 is a collection of 713 phrases and the different forms they are given in different dictionaries. The sources are two printed phraseological collections from 2011 ("Prikken over i-en og andre uttrykk" (Vannebo 2011) and "Ord og uttrykk på fire språk" (Erichsen 2011)) and the online dictionary Islex (islex.no) in 2014. Furthermore it is registered whether, and in what form, the phrases extracted from these sources are included in the online version of Bokmålsordboka and Nynorskordboka (ordbok.uib.no) from 2014. The selection criteria for the first collection are described on pages 203–211 in the dissertation and moreover attached as an appendix below.
The data in chapter 6 is a collection of attestations of the phrase "kaste barnet ut med badevatnet" (‘throw out the baby with the bathwater’), with variations, creative modifications and divergent use, in 833 Norwegian texts. There are 460 unique examples in Bokmål and 373 examples in Nynorsk excerpted from 10 different text corpora and other digital text collections in 2015 and 2016. The data were entered in table format (Excel sheet) and ordered with one attestation (one corpus instance) per line. For each attestation (i.e. for each line), attribute values were added in columns to describe and document the attestation further. The collection and methodological considerations of the second collection are described on pages 302–306 in the dissertation. These pages are attached as an appendix below.
The dataset is replication data for the following publication: Rauset, Margunn (2022). "Frasar til besvær? Studiar av norm og bruk i norsk fraseologi." ISBN 9788230854143 (print), 9788230866634 (PDF). (2024-03-19) |
Subject
| Arts and Humanities |
Keyword
| linguistics
lingvistikk (Humord) http://data.ub.uio.no/humord/c05737
lexicography
leksikografi (Humord) http://data.ub.uio.no/humord/c06153
phraseology
fraseologi (Humord) http://data.ub.uio.no/humord/c05804
Norwegian language
norsk språk (Humord) http://data.ub.uio.no/humord/c10259
Bokmål (Humord) http://data.ub.uio.no/humord/c10260
Nynorsk språk http://data.ub.uio.no/humord/c10261 |
Related Publication
| Rauset, M. (2022). Frasar til besvær? Studiar av norm og bruk i norsk fraseologi [Doctoral thesis, The University of Bergen]. handle: 11250/2992943 https://hdl.handle.net/11250/2992943 |
Language
| English; Norwegian Bokmål; Norwegian Nynorsk; Norwegian |
Producer
| University of Bergen (UiB) https://www.uib.no/en |
Contributor
| Data Collector : Kinn, Torodd |
Distributor
| University of Bergen (University of Bergen) https://dataverse.no/dataverse/uib |
Depositor
| Rauset, Margunn |
Deposit Date
| 2022-04-22 |
Time Period
| Start Date: 1898-01-01 ; End Date: 2016-09-29 |
Date of Collection
| Start Date: 2013-09-01 ; End Date: 2017-09-01 |
Data Type
| Corpus data; Txt-files with text strings |
Data Source
| The phrase collection discussed in chapter 5 is taken from two printed phraseological collections and three online dictionaries: * "Prikken over i-en og andre uttrykk" Vannebo, Kjell Ivar. 2011. Prikken over i-en og andre uttrykk: bruk, bakgrunn, betydning. Oslo: Cappelen Damm. * "Ord og uttrykk på fire språk". Erichsen, Gerda Moter. 2011. Ord og uttrykk på fire språk. Oslo: Cappelen Damm akademisk. * Islex , collected in 2014 * the online version of the two dictionaries Bokmålsordboka and Nynorskordboka (ordbok.uib.no), collected in 2014 The material discussed in chapter 6 is taken from the following sources, collected in 2015-2016: (in the list below the Norwegian name of the resource is given, followed by the corresponding English name, and finally with a link to the resource metadata for further information about the resource, terms of access, etc.) Bokmål: * Leksikografisk bokmålskorpus [The Corpus for Bokmål Lexicography]. Distributed by CLARINO Text Laboratory Centre: http://hdl.handle.net/11538/0000-000B-C022-5 * Norsk aviskorpus [Norwegian Newspaper Corpus Bokmål]. Distributed by the CLARINO UiB Portal: http://hdl.handle.net/11495/D9B5-0349-4330-0 * NoWaC [Norwegian Web as Corpus]. Distributed by CLARINO Text Laboratory Centre: http://hdl.handle.net/11538/0000-0005-E7C0-D Nynorsk: * Norsk aviskorpus [Norwegian Newspaper Corpus Nynorsk]. Distributed by the CLARINO UiB Portal: http://hdl.handle.net/11495/D9F7-42A1-ED26-0 * Norsk Ordboks nynorskkorpus [Norwegian Dictionary Corpus]. Distributed by the CLARINO UiB Portal: http://clarino.uib.no/korpuskel/landing-page?resource=nnk&view=short * Oslo-korpuset av taggede norske tekster (nynorskdelen) [The Oslo Corpus, Nynorsk part]. Distributed by the CLARINO Text Laboratory Centre: http://hdl.handle.net/11538/0000-0005-E7C3-A * Atekst Retriever (a collection of Norwegian newspapers): https://app.retriever-info.com/services/archive?languageCategory=lang_NO. * Google (the search engine): https://google.com * Nasjonalbiblioteket [The National Library]: https://nb.no |