Datasets

Title Generation

Dataset obtained online from arXiv articles for the task of Abstract to Title Generation.

Pre-processing of COGNIMUSE (annotated music video dataset) for the task of Emotional Music Generation.

Obtains Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpus.

Repository to make dataset with Speech-To-Text error by applying TTS and STT to text.

Extracts original and corrected essays from the FCE Corpus: XML to TXT format conversion.