Skip to contents

The function standardises treaty texts by removing punctuation and markers, while splitting these texts into articles and annexes.

Usage

standardise_treaty_text(textvar)

standardize_treaty_text(textvar)

Arguments

textvar

A text variable.

Value

A list of treaty sections of the same length.

Details

Treaty texts are not always similar when imported to R. Some treaty texts, for example, contain paragraph markers while others come in one text chunk. standardise_treaty_text() facilitates the cleaning and annotation of these treaty texts so that information about clauses can be retrieved at a later stage with the retrieve_clauses() function.

Examples

if (FALSE) {
standardise_treaty_text(sample(manyenviron::texts$AGR_TXT$Text, 30))
}