Skip to content

A variety of machine learning projects based the Transformer model, including: Song Recognition, MIDI Song Extender, Poem Generation, and Sentiment Analysis

Notifications You must be signed in to change notification settings

umd-fire-coml/2022-t2-transformer

Repository files navigation

Augemented Song Recognition Project

by Spencer Lutz, Rishi Keshav Pradeep, and Saatvik Vadlapatla

Our project is a song recognition model which has the ability to take in augmented snippets of songs and predict what the actual song is. We built the dataset used to train the model by creating web crawler to pull songs from the iTunes library. The end user interacts with a React app. This UI allows the user to pick from a list of songs and apply augmentations to it such as pitch, speed, and noise. Using the librosa library, a random triplet of STFTs(Short-Term Fourier Transform) are generated for the chosen audio. The STFT data is then split into patches and then fed into the transformer sequentially. This data is passed through a MLP layer which generates an embedding. This embedding is compared to the other songs in the dataset to find the one which is most similar.

Go to our project here.

MIDI Song Extender

by Steven Zhang, Rahul Dagar, and Kofi Boateng

Our project is a MIDI generation model that takes in the beginning of a MIDI file as a sample and generates original music based off of the given sample. We use an encoder package to compress MIDIs into integer arrays to feed into the model. One note is encoded by 2 integers representing note type and duration. The model takes an input of 50 encoded notes and predicts the next note. To generate more, we loop this process by appending the generated note to the array and feeding the most recent 50 notes back into the model. To train the model, we used the adl-piano-midi dataset.

Check out our HuggingFace demo here!

You can also take a look by watching this demo video.

See our project details here.

Our Subprojects

Poem Generation

This project is a poem generator model. User will input the start of the poem i.e. a word or a sentence and the model will auto generate the next line of the peom. This model is based on distilGPT2 and finetuned with a Gutenberg Poetry Corpus.

See our project details (Poem Generation).

You can also take a look by watching this demo video.

Sentiment Analysis

This project is a sentiment analysis model. User will input a sentence and the model will output the sentiment (Positive or Negative) of the input sentence. This model is based on distilBERT and DistilRoberta, and finetuned with SST2 dataset.

See our project details (Sentiment Analysis).

You can also take a look by watching this demo video.

Combined GUI

We have a combined GUI for all of our three projects.

See details here

About

A variety of machine learning projects based the Transformer model, including: Song Recognition, MIDI Song Extender, Poem Generation, and Sentiment Analysis

Resources

Stars

Watchers

Forks

Packages

No packages published