The text for the caption is now returned clean (tags removed). The cue text could contain tags like:
- timestamp tags: <00:19.000>
- class tags: <c.classname>text
- and others...
Important: It currently removes any tag present in the cue text. For example <b> would be removed.
Also a new attribute is available on captions to retrieve the text without cleaning tags: raw_text