Episodes Transcripts

In this section, we will tackle everything related with Natural Language Processing. We will talk about how we analyze song lyrics and episodes transcripts and then discuss the results!

Sentiment Analysis

The first type of analysis we ran through the episode transcripts, was a Sentiment Analysis. What is sentiment you may ask? Well when it comes to data science, the sentiment is a measure of how the word is perceived by people. Depending on the perception (positive or negative) each word is given a grade from 0 (very bad) to 9 (very good) a word is. For example "love" has a very high sentiment score, whereas "terror" has a very bad one.

So we used a labMT word data-base and their sentiment score to calculate the average sentiment score of each episode. This way we could have an idea of how Rick & Morty language would be perceived. We got the following results :

Average sentiment score per episode
Average Sentiment per seasons

So it appears that overall the sentiment score of each episode and season is just around the average. But the general reviews of Rick & Morty often shows that it's a pretty fun and happy show, so how could that be explained ?

There is a few things to consider, the first one being swearing, there is a lot of swearing in the show, and most of the time the sentiment score of a swear word isn't that good. Whereas the R&M audience might find this overuse of swearing pretty funny. So this is something that might have decreased the overall scores. But we will discuss swearing a lot more later on !
The second element to consider is also the word creativity is Rick & Morty, new words appear every episode, new puns, etc. All those new words are not taken into consideration when calculating sentiment scores, even though they are also well perceived and are a reason R&M is a highly rated show.

Then we can notice another trend while looking at the last numbers, over time the score is decreasing when it comes to Rick & Morty.
It might a hint, that overall the choices of words are changing and not in a good way. The words chosen by the writers are getting rougher over the course of time. But does that mean that the episodes are less appreciated?

Comparing Sentiment to IMDB Ratings

Sentiment Score vs. IMDB Ratings

To answer the last question we decided to fetch IMDB ratings of the show and to compare them to sentiment score to see if there were any correlation between the two.
And well, even though the sentiment score decreases over time, the ratings do not and they even reach their all-time high for the last season of the show. So maybe sentiment wasn't really the best way to analyze Rick & Morty language. But we'll give it one last chance :

The most positive character ?

We took a deeper look at the sentiment score for Rick & Morty especially. We ran the sentiment score algorithm only for their lines, to see who was contributing the most to the happiness of the show, and see who was the bummer.

Rick vs Morty sentiment scores over time

First, we can see that Rick & Morty's lines are aligned with the previous results we got, when their sentiment score decreases so does the overall shows'. Apart from that, both characters' way of expressing themselves is full of ups and downs. The only clear conclusion here is that the second half of the last season was the least sentimental one for Rick and Morty.

It's all about the words!!

Enough about sentiment, we clearly just realized that sentiment could not really give us the full taste of Rick & Morty. So we'll dive deeper into swear words and words creativity.

Most used words for Rick & Morty

A nice thing to do when you have access to the transcript is to look for the most frequent/important words. We did that by parsing transcripts and applying TF-IDF to their respective lines over all the episodes transcripts. TF-IDF is "a statistical measure used to evaluate how important a word is to a document in a collection or corpus". So we used TF-IDF most important words for Rick & Morty that we snapped into some schwifty word clouds:

Morty's word clouds

Morty's Top 3

ooh
poncho
bailed

As it turns out, the words with the highest TF-IDF scores describe quite well the characters. Morty's infamous 'ohh' is the most typical response of his - no matter if he's bored, happy, or terrified - his reaction usually comes down to emotionless 'ooh'. But then he apparently mentions "poncho" a lot too ?! And "bailed" isn't too surprising as Morty's word is often "Thanks for bailing me out Rick".

Rick's Top 3

burps
sauce
new
Rick's word clouds

For Rick, it's even more iconic. Maybe it's not a word but still - Rick burps all the time, and here we see that it was always carefully noted down in the transcript files. Rick also loves mentioning sauce and the world "new". The later being less suprising as he also invents or discovers new stuffs.

Most used words per seasons

We applied the same method mentioned for the last part to get the top words per season:

Season 1 Word Cloud

Top 10

meeseeks
needful
snuffles
bloom
jacob
pluto
dream
pancakes
doofus
science

Top 10

testicle
gary
jerrys
feels
purge
deer
doctor
poopybutthole
nathan
summers
Season 2 Word Cloud
Season 3 Word Cloud

Top 10

pickle
toxic
ants
citadel
wong
candidate
mechanical
campaign
manager
cornvelious

For each season we can see that the top words are heavily influenced by the theme of some episodes, for the first season words like meeseeks, dogs, and snuffles appear. Or even for the third season, clearly with the presence of one of the most famous quote of the show: "I'm Pickle Riiiiiiiick !".

Swearing Analysis

Rick & Morty is known for its rough language, so we decided to dig a little deeper into this, to find the most used swears, who uses the most and also their importance in the show's success & flow.

A little bit earlier, we introduced you to sentiment score, now let's talk about profanity score! We used the profanity check library to calculate it but what does it do? When given a string it analyzes it and returns a probability (score between 0 and 1) of the string to be offensive. So "F*** you!" would return 1, whereas "go to hell, you scum!" would return something around 0.8. So using this library on the transcript can tell you if they are more or less offensive.

Profanity score per episode
Profanity score per season

First we can notice that profanity score are pretty low on a scale to one, they are all around 0.12. This can be explained by the fact that we are calculating profanity score on average, and most of the transcript are not swears, so that would defenitely lower the score.
If we are now looking at scores over time they are all the same over the seasons, except for the last one who has slightly lower average, so that proably means less swear words. So if we think back at our sentiment analysis we can probably forget the hypothesis that stated that swear words would lower the score for the last season.

Comparing Profanity Scores & IMDB Ratings

Profanity Score vs. Ratings

As we did for the sentiment score let's compare profanity score and ratings to see if it actually matters. And it appears that it does, ratings and profanity score, for the most part, seems to be in sync, does that mean that the more swear words the better the episode? From our point of you, it's not the only parameters but yes it counts!

Swearing Contest !

It has become a tradition for us, to compare Rick & Morty's line. Let's do it again for swearing!

Sentiment Score vs. Profanity

The plot above gives an insight into how the vulgarity of Rick's and Morty's language was changing over time. On the x-axis, there are seasons (divided into halves: a and b) while the y-axis represents the profanity score. Morty's style is more or less consistent through-out 3 seasons with the peak somewhere between the end of season 2 and the beginning of season 3. When it comes to Rick, here the analysis becomes a bit more interesting. From the very beginning of the series till the first half of season 2 his profanity score fluctuates around the same values as Morty did. But in the second half of the last season, the number of swear words skyrockets, while Morty's score declines sharply. One could put forward a thesis that Rick's language became much vulgar under the influence of Morty, while Morty's temperament melted down.

Made Up Words!!

Another fact about Rick & Morty is their creativity when it comes to inventing new funny words, so let's see what we can find in there :

Songs

Aww Jeez, Rick & Morty songs, let's try and figure out what makes them so special !

Sentiment

Sentiment Score per songs

As it turns out, the sentiment score varies across the series' discography. The songs with the lowest scores are: 'Luppoo Luppoo' and 'I'm a Tax Attorney'. The first one consists only of made-up, meaningless words while in the second one a comedian is singing that no religion can be compared to him being a Tax Attorney - which also seems a bit emotionless. When it comes to the songs with the highest score, the winner is 'Handy Hands'. It was sung in the episode "Raising Gazorpazorp. When Morty was raising Morty Jr., he turned on the TV to a children show where a man was singing this tune. Later in the episode, the song was sung again by Morty to calm Morty Jr. down, which succeeded, and also made him sing along - could you imagine anything more sentimental than this?

Word Cloud

Songs Word Cloud

Top 10

schwifty
sung
butt
chaos
moonmen
dance
father
flaps
alien
summer

Conclusion

Wo weee ! So we've just taken a deeper look into Rick & Morty's transcripts. We've found that sentiment scores might be a little tricky to analyze a show such as R&M, as it contains a lot of swearing and puns that unfortunately sentiment scores cannot consider yet. Then we also looked at the swear words in the show, and we've discovered that it might be important for the fans even though their presence wasn't that important overall. We've also dug into the words of Rick & Morty and discovered that often they are influenced by their current adventures and that the main characters have their own famous lines!