Skip to content

YahiaML/TMDb-movies-data-investigation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Movie Data Analysis

Project Overview

The Movie Data Analysis project is an exploration of the TMDb (The Movie Database) dataset using Python. The goal is to gain insights into various aspects of the movie industry, such as popular genres, correlations between revenue and popularity, and the impact of factors like budget, stars, and directors on movie ratings.

Tools and Technologies Used

  1. Python: The entire project is implemented using the Python programming language for its versatility in data analysis and manipulation.

  2. Pandas: Pandas, a powerful data manipulation library, is utilized for handling and organizing the TMDb dataset. It facilitates easy analysis, cleaning, and visualization of the data.

  3. Matplotlib and Seaborn: These libraries are employed for creating visualizations to better understand patterns and trends in the dataset.

Key Features and Functionality

  1. Genre Analysis: The project explores the popularity of different movie genres over the years, providing insights into evolving audience preferences.

  2. Correlation Analysis: Examining the relationship between movie revenue and popularity to understand if high revenue correlates with high popularity.

  3. Vote Average Investigation: Investigating the factors influencing movie vote averages, including the impact of popularity and budget.

  4. Star and Director Ratings: Identifying the stars and directors with the highest-rated movies, both in terms of total votes and average votes.

Conclusion

The Movie Data Analysis project showcases the capabilities of Python and data analysis libraries in exploring and extracting valuable insights from the TMDb dataset. The project provides a comprehensive analysis of the movie industry. Whether you are interested in understanding genre trends, correlations between revenue and popularity, or the influence of stars and directors, this project serves as a valuable resource for movie enthusiasts and data analysts alike.

Limitations

  • The analysis might be impacted by dropped rows, missing budget and revenue values, and the removal of values after the pipe (|) characters in genres.
  • Users are encouraged to explore the dataset further and consider potential biases introduced during data cleaning.

About

This is an investigation analysis for TMDb movies data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published