Project description You work as an analyst for the telecom operator Megaline. The company offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget. Description of the plans Surf Monthly charge: $20 500 monthly minutes, 50 texts, and 15 GB of data After exceeding the package limits: - 1 minute: 3 cents - 1 text message: 3 cents - 1 GB of data: $10 Ultimate Monthly charge: $70 3000 monthly minutes, 1000 text messages, and 30 GB of data After exceeding the package limits: - 1 minute: 1 cent - 1 text message: 1 cent - 1 GB of data: $7 Instructions on completing the project Step 1. Open the data file and study the general information File path: /datasets/megaline_calls.csv /datasets/megaline_internet.csv /datasets/megaline_messages.csv /datasets/megaline_plans.csv /datasets/megaline_users.csv Step 2. Prepare the data Convert the data to the necessary types Find and eliminate errors in the data Explain what errors you found and how you removed them. For each user, find: The number of calls made and minutes used per month The number of text messages sent per month The volume of data per month The monthly revenue from each user (subtract the free package limit from the total number of calls, text messages, and data; multiply the result by the calling plan value; add the monthly charge depending on the calling plan) Step 3. Analyze the data Describe the customers' behavior. Find the minutes, texts, and volume of data the users of each plan require per month. Calculate the mean, variance, and standard deviation. Plot histograms. Describe the distributions. Step 4. Test the hypotheses The average revenue from users of Ultimate and Surf calling plans differs. The average revenue from users in NY-NJ area is different from that of the users from other regions. You decide what alpha value to use. Explain: How you formulated the null and alternative hypotheses. What criterion you used to test the hypotheses and why. Step 5. Write an overall conclusion Description of the data Megaline rounds seconds up to minutes, and megabytes to gigabytes. For calls, each individual call is rounded up: even if the call lasted just one second, it will be counted as one minute. For web traffic, individual web sessions are not rounded up. Instead, the total for the month is rounded up. If someone uses 1025 megabytes this month, they will be charged for 2 gigabytes. The users table (data on users): user_id — unique user identifier first_name — user's name last_name — user's last name age — user's age (years) The calls table (data on calls): duration — call duration (in minutes) user_id — the identifier of the user making the call The messages table (data on texts): The internet table (data on web sessions): The plans table (data on the plans): plan_name — calling plan name usd_monthly_fee — monthly charge in US dollars minutes_included — monthly minute allowance messages_included — monthly text allowance mb_per_month_included — data volume allowance (in megabytes) usd_per_minute — price per minute after exceeding the package limits (e.g., if the package includes 100 minutes, the 101st minute will be charged) usd_per_message — price per text after exceeding the package limits usd_per_gb — price per extra gigabyte of data after exceeding the package limits (1 GB = 1024 megabytes)