Movie Data Analysis

Project Overview

The Movie Data Analysis project is an exploration of the TMDb (The Movie Database) dataset using Python. The goal is to gain insights into various aspects of the movie industry, such as popular genres, correlations between revenue and popularity, and the impact of factors like budget, stars, and directors on movie ratings.

Tools and Technologies Used

Python: The entire project is implemented using the Python programming language for its versatility in data analysis and manipulation.
Pandas: Pandas, a powerful data manipulation library, is utilized for handling and organizing the TMDb dataset. It facilitates easy analysis, cleaning, and visualization of the data.
Matplotlib and Seaborn: These libraries are employed for creating visualizations to better understand patterns and trends in the dataset.

Key Features and Functionality

Genre Analysis: The project explores the popularity of different movie genres over the years, providing insights into evolving audience preferences.
Correlation Analysis: Examining the relationship between movie revenue and popularity to understand if high revenue correlates with high popularity.
Vote Average Investigation: Investigating the factors influencing movie vote averages, including the impact of popularity and budget.
Star and Director Ratings: Identifying the stars and directors with the highest-rated movies, both in terms of total votes and average votes.

Conclusion

The Movie Data Analysis project showcases the capabilities of Python and data analysis libraries in exploring and extracting valuable insights from the TMDb dataset. The project provides a comprehensive analysis of the movie industry. Whether you are interested in understanding genre trends, correlations between revenue and popularity, or the influence of stars and directors, this project serves as a valuable resource for movie enthusiasts and data analysts alike.

Limitations

The analysis might be impacted by dropped rows, missing budget and revenue values, and the removal of values after the pipe (|) characters in genres.
Users are encouraged to explore the dataset further and consider potential biases introduced during data cleaning.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
LICENSE		LICENSE
README.md		README.md
TMDb_movies_data_investigation.ipynb		TMDb_movies_data_investigation.ipynb
tmdb-movies.csv		tmdb-movies.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie Data Analysis

Project Overview

Tools and Technologies Used

Key Features and Functionality

Conclusion

Limitations

About

Releases

Packages

Languages

License

YahiaML/TMDb-movies-data-investigation

Folders and files

Latest commit

History

Repository files navigation

Movie Data Analysis

Project Overview

Tools and Technologies Used

Key Features and Functionality

Conclusion

Limitations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages