We pre-processed the text in the transcripts and tokenized them using a Natural Language Toolkit to identify sentence endings and process the timestamps to match each statement First ...