Most Popular Youtube Channel Prediction¶

Motivation¶

Problem¶

Analyzing the general trends of popular youtube channel cateogories to determine whether or not a youtube channel will be popular can be hard, especially for those interested in starting a popular youtube channel. It becomes incredibly difficult to predict this given the frequent changes in societal interests.

Solution¶

Youtube is generally understood as one of the largest streaming platform services in the world, their platform acts as a rich database platform that provides insight into the popularity of a youtube channel and the type of content the channel creates. The primary goal of this project is to define the link between the primary content of the channel and the counts that Youtube has to measure the popularity of a channel's videos and account.

Impact¶

If the following project is successful in defining the link, it could provide insight and predict how popular specific videos or content of a channel might be. This is helpful is assessing interest in a channel before it it released so that content creators know what type of videos to create to cater to the interests of their audience.

Dataset¶

The Kaggle Dataset of Most Subscribed 1000 Youtube Channels will be used to assess the specific characteristics of each youtube channel.

The dataset notes each channel's:

  • Rank
  • Youtube Channel Name
  • Total Subscribers
  • Video Views
  • Video Count
  • Category of Channel
  • Start Date

The table provides a rank number, using the ranking system provided our project looks to use the ranks and predict which content categories are therefore the most popular.

Rank Youtube Channel Name Total Subscribers Video Views Video Count Category of Channel Start Date
1 T-Series 234,000,000 212,900,271,553 18,515 Music 2006
2 Youtube Movies 161,000,000 0 0 Film & Animation 2015
3 Cocomelon-Nursery Rhymes 152,000,000 149,084,178,448 846 Education 2006
4 SET India 150,000,000 137,828,094,104 103,200 Shows 2006

Potential Problems¶

Rank is not explicity defined as one of the variables for the channels, given this it is difficult to assert that Rank is objective rather than subjective. This can pose an issue given that we are not able to surely define that the rank of a channel is indeed ranked at a point that reflects the values of the public. How do we know that this is not the ranking preferences of a single person? We might be able to eliminate the possibilty of bias and subjectivity by looking at other sources that rank the top youtube channels

Method¶

We will look to graph the correlation between popularity of the channel and the type of content that it produces. This will be compared against the various measures that youtube has to assess the performance of the channel, we likely will average the values provided and assess how the channel's rank compares to these values. Given that the correlation graphs between a specific category of content is strong with the popularity of the channel, we can make predictions about trends for future channel and their content and how they will perform