News

Data from IMDb was accessed via a public S3 bucket. PySpark facilitated the analysis of extensive datasets that exceed in-memory processing capacities. The use of Pandas and Matplotlib libraries ...