

list files, the format of these files are not preferred for data analysis purposes. To work on IMDb, we first pulled the raw data from the large and enormous dataset of IMDB, the data which we extracted was raw and unclean, so the next step was to clean the data and make it structured to be used easily for analysis.Īfter cleaning and making the data unstructured, we arranged the data in the desired and suitable format for the analysis to be performed easily.Īfter the data has been cleaned and making the data unstructured into the desired format, we used algorithms to perform linear regression analysis and correlation analysis on the files generated.ĭata Fetching, Cleaning the data and converting it into structured data Our project also does the analysis on IMDB to predict the best director, actor, and movie. The project then examines the data set of IMDB and predicts the box office success of the movie. Our project takes the raw and tarnished data, cleans it up and makes sure to provide us with a clean and structured data for easy analysis. Looking into this matter, IMDB has gained lot of popularity and has been successful for providing the reviews to the users. In these recent times of Covid-19 pandemic, people are staying at home and looking for an entertainment unit to pass their time and have fun without getting bored at home. This has been made possible because of easy access to the large and packed data which can be accessed in a very secure manner.
#Imdb raw data set movie#
IMDB is one of such database that can provide us with an enormous information of every movie or TV show.ĭata Analytics is a process that is providing software tools to analyze, process and extract data from a very large data set with which normal working tools cannot deal easily.

In this project we are pulling the data from the IMDBs database and predicting the best movie, director and actor by analyzing the votes given by the people and Facebook likes all over the world.įirst, in these recent times of Covid-19 pandemic we all need a source of entertainment unit or a platform to let go of our boredom at home. After all the votes given by the people all over the world, IMDb calculates and tallies all the votes and rates the movie according to that. They are given the option to rate them from a scale of 1 to 10. People who registered themselves in IMDb have all the access to rate the TV program or the movie according to their likes or dislikes. People can also give their personal votes to their favorite movie or actor. People can access to the database and take a look at the information about movies, TV programs and video games. IMDb is a database, which is widely used all over the world and is the largest database which shows the data in relation to video games, movies, directors, actors, casts and TV programs. Due to this, theres a heavy demand or popularity of online database which can provide with a database of movies, directors and actors. People are using multiple online platforms for their entertainment purpose, such as IMDB, Instagram, youtube and so on. Nowadays, electronic devices are available at affordable prices. People are staying at home due to the pandemic and making use of the internet more frequently than before. Internet has become universal now days, especially in these recent times of covid-19 pandemic. Movies are kept in theatres for about two weeks or a month, after that time period, movies are removed from theatres and marketed on several media platforms. Movies are made to be shown or projected on big screens at movie theatres. Movies can be of different genres, for example, some people like funny movies, so they can watch hilarious movies, whereas some people like suspense movies, so they watch thriller movies and so on. People in every part of the world watch movies as a type of entertainment, a way to enjoy their day and have fun with their loved ones. Predictive Modelling on IMDB’s Movie DataĬompute Science and Engineering SRM Institute of Science and TechnologyĪssistant Professor, Computer Science and EngineeringĪbstract:- Movies, also known as films, are a type of videography or cinematography which uses moving pictures and sound to portrait a story about something or tell/teach people something.
