Multi-feature late fusion of NLP results (by normalizing text and n-gram processing) with OpenSMILE embedding results.
NLP transcript normalization (see methods) and OpenSMILE; otherwise similar to Martinc 2021. Same gist but different data-prep.
- N-gram processed the input features
- Used WordNet to replace words with roots