The function standardises treaty texts by removing punctuation and markers, while splitting these texts into articles and annexes.
Details
Treaty texts are not always similar when imported to R.
Some treaty texts, for example, contain paragraph markers while others
come in one text chunk.
standardise_treaty_text()
facilitates the cleaning and annotation of
these treaty texts so that information about clauses can be retrieved
at a later stage with the retrieve_clauses()
function.