The purpose of this report is to analyse how the music of the 100 greatest musical artists of all time, according to Rolling Stone Magazine, has endured over the past decade and what characteristics explain changes in popularity. The analysis is based primarily on the data from Spotify, with additional information from Wikipedia. My empirical strategy involves explaining the changes in engagement (as measured by the number of followers and popularity of the artist) with the characteristics of the music of that artist. I find that sadder songs, with explicit content and fewer spoken words, have endured better over the past decade, with genres such as heavy metal and hip hop gaining popularity.
The data used in this report comes from two sources. The first source is Spotify, which I accessed through the Spotify API. It provides information on the characteristics of each song of the artist, such as danceability, how many spoken words there are, energy, etc. I calculated the average of these indicators across all songs by artist. I also created a dummy variable indicating whether most of an artist’s tracks contain explicit content. Additionally, I accessed the data on each artist’s popularity and number of followers through the API and, based on this information, created new rankings which reflect the artist’s current popularity.
The second source is Wikipedia, which I accessed using Selenium. It provides information on the genre of the artist (Spotify API also provides this information, but the Wikipedia categories were more general and hence better suited for the analysis) and the year when they started their career. For each artist, I took the genre listed first in the information box and then changed it to a more general genre (eg. hip hop instead of political hip hop) or to a similar genre (eg. heavy metal instead of hard rock) when necessary. Given these changes, the genre variable is very general and may not reflect the exact musical style of the artists.
The final dataset contains artist-level information about the characteristics of their music, their current popularity and number of followers, and the change in their ranking over the past decade as measured by popularity and number of followers. It also contains information about the genre of the artist’s music and the year when they started their career.
Figures 1 and 2 depict the changes in rank (from the 2010 Magazine ranking) as measured by popularity and number of followers by artist, respectively. They are not precisely the same since popularity accounts for people who listen to an artist’s music but do not follow them. Nevertheless, the general trend is similar across both measures. Hip hop artists (Eminem, Jay-Z, Tupac) have become much more popular over the past decade, while 50s rock and roll artists (Little Richard, Bo Diddley, Chuck Berry) have become less popular.
To investigate which characteristics explain these changes in
popularity, I estimated two linear regression models. Figure 3 shows the
correlation matrix of the possible independent variables. I decided to
exclude mean loudness and mean acousticness from the equations since
they are strongly correlated with mean energy. I also excluded
categorical variables such as genre and key to avoid high dimensionality
and overfitting.
I estimated the same model twice using changes in rank as measured by popularity and the number of followers as the dependent variables. The models meet the assumptions of linear regression. Figure 4 shows the scaled coefficients of the two models with 95% confidence intervals. The coefficients suggest artists whose tracks score high one valence and speechiness (ie. are happy and have a lot of spoken words) have become less popular over the past decade. Moreover, artists whose majority of tracks are explicit have become more popular. Additionally, younger artists with more danceable tracks have likely become more popular - the coefficients for these variables are only marginally insignificant. The rest of the coefficients are not statistically significant.
Finally, Figure 5 depicts the average changes in rank by genre. Heavy metal and hip hop artists have become more popular, and blues and soul artists have become less popular. This pattern is consistent with the results of the linear models. Hip hop and heavy metal artists tend to have more explicit tracks, which are often sadder (hip hop artists often come from disadvantaged backgrounds, which their music reflects). Blues and soul artists tend to be older and produce happier but less danceable tracks (they are generally quite slow).