Regular expression instructions

Regular expressions (regex) is a very useful means of working with serial data which repeats similar information using similar formats.

Regexr is a great place to learn about and try out regular expressions.

Contents

Using regex in Oxygen

You can use regex in the Find/Repace in Files tool when you enable the Regular Expression option in it. Some examples of commonly used regex codes:

To put <persName> around passenger names in a list: find Mr. [A-Z][a-z, 0-9]+, replace with <persName>$&</persName>.

Select all, copy, and paste results into atom text editor.
Open find and replace. Click the Regex option, then use this line to remove some of the results: Find XPath location: .+\nStart location: .+\nEnd location: .+\n\nSystem ID: Replace with (leave empty). Click Replace All.
Remove the file location that precedes the issue date: Find /Users/whanley/GitHub/DEG-content/ (this will be different on your computer–just select everything that comes before the date filename) Replace with (leave empty). Click Replace All.
Find .xml\nDescription: , Replace with \t. Replace All.
You’ll have a bit of garbage left over at the beginning and end of the file. Delete this. Now you will have a tab-separated, two-column table that you can paste into a spreadsheet.

Recipe to clean count results, giving a single column response: \n.+\nDescription:

Say you are trying to make a table of the results that you exported from Oxygen.

Import these results into a Word document. Then use Edit > Find > Advanced Find and Replace.