Automated data extraction, term recognition
The tool GTRI was using for Natural Language Processing was GATE, which an open source project (developed in Java) based at Sheffield. They have a set of workshop materials online at https://gate.ac.uk/wiki/
The equivalent to GATE in Python is NLTK. They don't have a set of workshops, but there is a good book online that can be used to teach yourself the tools. See http://www.nltk.org/ and http://www.nltk.org/book/ (don't buy the print edition until they've finished the revision).