IMDB Top 250 outliers

On the IMDb top 250, you normally see a correlation between the number of votes and the rating for a movie. Better rated movies are more watched. The outliers are interesting.

IMDb: Correlation between number of votes and rating

The movies that are popular despite not having a high rating are:

I can understand why The Sixth Sense, Pirates of the Caribbean and especially The Matrix are on this list — geeks would have watched these and voted on IMDb, though their voting need not have been high. But why are Gladiator and Sixth Sense on that list?

Movies that are highly rated, but not as popular are:

Seven Samurai and The Good, The Bad, The Ugly probably didn’t get the votes they deserve because they’re written in their Japanese and Mexican names on IMDb. I hadn’t seen them for a long time for the same reason. As for The Godfather, I personally think it’s just overrated. But Rear Window? That’s a surprise. Hitchcock thriller with all the classic elements…

Another correlation is between the rating and the year of the movie. Early movies get lower ratings than recent movies. Technique could be the reason, but I doubt it. In any case, some movies stand out of their time.

IMDb: Correlation between rating and year of movie

I haven’t seen Metropolis or M. But among the others, I think Citizen Kane is the one that deserves to stand out, if only for portraying the anti-hero, and for not having a happy ending. The Shawshank Redemption was a bit of a surprise. Few people that I know have heard of it. And yet, there it is, right on top.

  1. Madhu says:

    Some analysis this:) were u consulting for IMDB sometime?:)

  2. S Anand says:

    Nah, just had some time on my hands early this morning!

  3. ritzkini says:

    😀 cool anal ! but..how did you get the raw data ?? is my question !

  4. S Anand says:

    Just cut and paste data on the IMDb top 250 page on Excel!

  5. Shankar says:

    Another conclusion one could draw: The conclusion that a movie is good seems representative of public opinion due to the high number of votes, but that a movie is bad is only the conclusion of a few, and hence may not be representative of public opinion. Is this a correct conclusion?

  6. S Anand says:

    I wouldn’t say that from this data. These represent the top rated movies on IMDb, i.e. the top 250 movies EVER. We can only say that high ratings are contributed by both large and small samples. Even that would be a weak statement because IMDb has a “minimum number of votes” cutoff for the top 250.

  7. Govar says:

    Another interesting thing I’ve noticed in IMDB ratings is that a lot of votes are meant either to push up or pull down. For example, almost every movie will have more number of people rating 1 than ratings 2,3 or 4. Which coudl mean only one thing: They want to bring down the rating. Similarly, lot of people go to the other extreme – 10 – instead of say 8 or 9.

  8. S Anand says:

    That’s an interesting observation… did you get that out of the raw IMDb data? I’ll have a look at that. Should be interesting analysis to do.

  9. fdf says:

    imdb top 250 is not good, don;t listen to it, pp;l vote up a film, and vote films down! godgather and LOTR is over rated!