在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称:pritamd/SoccerTweetAnalysis开源软件地址:https://github.com/pritamd/SoccerTweetAnalysis开源编程语言:Jupyter Notebook 100.0%开源软件介绍:SoccerTweetAnalysisThis is an assignment from "Big Data Integration and Processing" Course from Coursera. The "Big Data Integration and Processing" is the 3rd course for the Big Data Specilization. As the Sports Analyst, you are very interested in reporting on the countries with the most popularity in Twitter. So a good way to approach this problem would be to find which countries were mentioned the most in the tweets in your dataset and to analyze what words are being used the most in these tweets. In addition to the CSV file you just exported from MongoDB, we give you a small dataset with the codes and names of some countries. To see this additional dataset, open the following file: Downloads/big-data-3/final-project/country-list.csv To get you started, we have prepared a Jupyter notebook template, and started a SparkSQL context for you. Please open the notebook in: Downloads/big-data-3/final-project/SoccerTweetAnalysis.ipynb. You will use this notebook to answer the questions below. So let’s get started. Question 1: As a Sports Analyst, you are interested in how many different countries are mentioned in the tweets. Use the Spark to calculate this number. Note that regardless of how many times a single country is mentioned, this country only contributes 1 to the total. Question 2: Next, compute the total number of times any country is mentioned. This is different from the previous question since in this calculation, if a country is mentioned three times, then it contributes 3 to the total. Question 3: Your next task is to determine the most popular countries. You can do this by finding the three countries mentioned the most. Question 4: After exploring the dataset, you are now interested in how many times specific countries are mentioned. For example, how many times was France mentioned? Question 5: Which country has the most mentions: Kenya, Wales, or Netherlands? Question 6: Finally, what is the average number of times a country is mentioned? |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论