|
View: |
Part 1: Document Description
|
|
Citation |
|
|---|---|
|
Title: |
Replication Data for: Syntax from and for discourse II: More on complex sentences as meso-constructions |
|
Identification Number: |
doi:10.18710/SIPOUV |
|
Distributor: |
DataverseNO |
|
Date of Distribution: |
2025-01-15 |
|
Version: |
1 |
|
Bibliographic Citation: |
Hampe, Beate; Gries, Stefan Th., 2025, "Replication Data for: Syntax from and for discourse II: More on complex sentences as meso-constructions", https://doi.org/10.18710/SIPOUV, DataverseNO, V1 |
|
Citation |
|
|
Title: |
Replication Data for: Syntax from and for discourse II: More on complex sentences as meso-constructions |
|
Identification Number: |
doi:10.18710/SIPOUV |
|
Authoring Entity: |
Hampe, Beate (University of Erfurt) |
|
Gries, Stefan Th. (University of California at St. Barbara) |
|
|
Producer: |
University of Erfurt |
|
University of California at St. Barbara |
|
|
Software used in Production: |
Antconc |
|
Distributor: |
DataverseNO |
|
Distributor: |
The Tromsø Repository of Language and Linguistics (TROLLing) |
|
Access Authority: |
Hampe, Beate |
|
Depositor: |
Hampe, Beate |
|
Date of Deposit: |
2019-03-06 |
|
Holdings Information: |
https://doi.org/10.18710/SIPOUV |
|
Study Scope |
|
|
Keywords: |
Arts and Humanities, temporal adverbial clauses, usage-based models, complex-sentence constructions, meso-constructions, cognitive corpus linguistics, multinomial regression, English, complex sentences, subordination, British National Corpus, adverbial clauses, construction grammar |
|
Abstract: |
<p><b>Dataset abstract:</b> The corpus files employed are a subset of 812 files containing spoken language from the British National Corpus (World edition, Oct. 2000) capturing British English in the late 20th century. For a description of the corpus, see <a href="http://www.natcorp.ox.ac.uk/archive/worldURG/index.xml">http://www.natcorp.ox.ac.uk/archive/worldURG/index.xml</a>. A total of 740 files were chosen because their meta data marked them as belonging to one of the following genres: broadcast discussion; classroom; consultation; conversation; demonstration; interview; interview or oral history; meeting; parliament; public debate; tutorial; spoken unclassified. To these, we added 72 files with the genre descriptions: courtroom; speech unscripted; sports live. </p><p>From these files with spoken British English, all occurrences of adverbial clauses exhibiting one of the four subordinating conjunctions ‘before’, ‘after’, ‘once’, and ‘until’ were extracted. For the final analysis, 8 samples of equal size (together comprising 560 tokens) were created from this output by narrowing down the corpus output to sentence configurations with adverbial clauses with these conjunctions in either initial or final position, by retaining only complex sentence configurations showing both the adverbial clause and a matrix, and by finally selecting only 1 token per file following a randomizer. The size of each of the subsets (70 tokens) was dictated by the frequency of the most infrequent configuration (initial until-clauses). </p> |
|
<p></p><b>Article abstract:</b> This paper presents a direct continuation of preceding corpus-linguistic research on complex sentence constructions with temporal adverbial clauses in a cognitive and usage-based framework (Diessel 2008; Hampe 2015). Working towards a more systematic construction-based account of complex sentences with before-, after-, until- and once-clauses in spontaneously spoken English, Hampe (2015) hypothesised that the morpho-syntactic realisations of configurations with initial adverbial clauses systematically diverge from those of configurations with final ones as a reflection of the specific functionality of each and that usage properties that are found across instantiations with a coherent functional load are retained in the schematisations creating constructions. This paper employs a multinomial regression in order to test to which extent each of eight closely related complex-sentence constructions with either initial or final before-, after-, until- and once-clauses can be predicted from the realisation of a few key morpho-syntactic properties of the respective adverbial and matrix clauses involved. The results support an analysis of complex-sentence constructions as meso-constructions that are not only specific about the subordinator and the positioning of the adverbial clause, but also retain “traces” of characteristic usage properties. |
|
|
Time Period: |
1960-1994 |
|
Date of Collection: |
1960-1994 |
|
Kind of Data: |
corpus data |
|
Methodology and Processing |
|
|
Sources Statement |
|
|
Data Sources: |
<p>British National Corpus (BNC), see the official BNC website at <a href="http://www.natcorp.ox.ac.uk/">http://www.natcorp.ox.ac.uk/</a>.</p><p> The BNC II, ‘world edition’, edited by Lou Burnard, published for the British National Consortium by the Humanities Computing Unit at Oxford University Computing Services in October 2000) is described in detail under <a href="http://www.natcorp.ox.ac.uk/archive/worldURG/index.xml">http://www.natcorp.ox.ac.uk/archive/worldURG/index.xml</a>.</p> <p>The extracted words that are contained in this dataset only represent non-substantial portions of the BNC corpus. Therefore, the reuse (including redistribution) of these excerpts is permitted by the exceptions rules in IPR and database protection regulations, such as Fair use (USA cf. <a href="https://www.copyright.gov/fair-use/more-info.html" title="Fair use" target="_blank">US Copyright Act</a>), Fair dealing (UK; cf. <a href="https://www.gov.uk/guidance/exceptions-to-copyright" title="Fair dealing" target="_blank">Exceptions to copyright</a>), the <a href="http://data.europa.eu/eli/dir/1996/9/2019-06-06" title="Lawful users" target="_blank">EU Database Directive</a> (cf. article 8 Rights and obligations of lawful users), "lover, forskrifter, rettsavgjørelser og andre vedtak av offentlig myndighet" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§14" title="offentlige vedtak" target="_blank">§ 14 in Åndsverkloven</a>), "uvesentlige deler av databaser" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§24" title="uvesentlige deler av databaser" target="_blank">§ 24 in Åndsverkloven</a>), "sitatretten" (Norway; cf. <a href="https://lovdata.no/lov/2018-06-15-40/§29" title="sitatretten" target="_blank">§ 29 in Åndsverkloven</a>).</p> |
|
Data Access |
|
|
Notes: |
<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a> |
|
Other Study Description Materials |
|
|
Related Publications |
|
|
Citation |
|
|
Title: |
Beate Hampe & Stefan Th. Gries (2018), Syntax from and for discourse II: More on complex sentences as meso-constructions. Yearbook of the German Cognitive Linguistics Association/Jahrbuch der Deutschen Gesellschaft für Kognitive Linguistik (GCLA) 6: 115-142 |
|
Identification Number: |
10.1515/gcla-2018-0006 |
|
Bibliographic Citation: |
Beate Hampe & Stefan Th. Gries (2018), Syntax from and for discourse II: More on complex sentences as meso-constructions. Yearbook of the German Cognitive Linguistics Association/Jahrbuch der Deutschen Gesellschaft für Kognitive Linguistik (GCLA) 6: 115-142 |
|
Label: |
00_Readme_Hampe_Gries.txt |
|
Text: |
Readme file with general and methodological information about the study, as well as an overview of the data and files |
|
Notes: |
text/plain |
|
Label: |
01_Data_Hampe_Gries_kwic.txt |
|
Notes: |
text/plain |
|
Label: |
02_Data_Hampe_Gries_coding.txt |
|
Notes: |
text/plain |