Great Deal! Get Instant $10 FREE in Account on First Order + 10% Cashback on Every Order Order Now

Microsoft Word - ITECH1103 Analytic Group Assignment - Semester 2 2018 1 | P a g e ITECH1103- Big Data and Analytics Group Assignment – Semester 3, 2018 Worth – 30% ANALYTIC REPORT (20%- Due Week 11...

1 answer below »
Microsoft Word - ITECH1103 Analytic Group Assignment - Semester 2 2018

1 | P a g e
ITECH1103- Big Data and Analytics Group
Assignment – Semester 3, 2018
Worth – 30%

ANALYTIC REPORT (20%- Due Week 11 Sunday
11:55pm) and PRESENTATION (10% - Due Week
10 in Tutorial Time)

Analytic Report:
Learning Outcomes Assessed: A3, K3, K6, and S2:
Purpose: The purpose of this task is to provide students with practical experience in
working in teams to write a Data Analytical report to provide useful insights, pattern and
trends in the chosen/given dataset. This activity will give students the opportunity to
show innovation and creativity in applying Watson Analytics and designing useful
visualization solutions and predictive solutions for various analytics problems.

Group Presentation: Week 10 (Scheduled Laboratory) Learning Outcomes
Assessed: K4, A1, A2, V1, V2
Purpose: The purpose of the oral presentation is to provide an opportunity for students
to present the results of DATA Analysis and to share this knowledge while practicing
their ve
al communication skills

Project Details: Consider you are working as a Content Analyst in an ABC online
multimedia company and your task for this analytical project is to use analytical tool (i.e.
IBM Watson Analytics) to explore, analyse and visualize the given dataset. This dataset
eflects details about different videos, uploaded during the period from 2006 to 2018.
The original dataset is extracted from the Kaggle.com and then modified and uploaded
onto https:
data.world/iamdilan/youtube-dataset. Your primary goal is to
download the modified dataset and provide different and interesting insights in the
lights of 20 guided questions listed below along with advance insights . The dataset could
e downloaded from the following link

Dataset source: https:
data.world/iamdilan/youtube-dataset
Data Dictionary:
Video_id Unique identity of video
Trending_date trending date of video
Title Name of video
Channel_title Name of channel
Category_id : see category list below (table)
Publish_date The date on which the video was published
Time_frame The time at which the video was uploaded/published
Publish_day_of_week Day of the week video published
Publish_country Country in which video published
Tags Tags
https:
data.world/iamdilan/youtube-dataset
https:
data.world/iamdilan/youtube-dataset

2 | P a g e
Views Number of views of video
Likes Number of likes of video
Dislikes Number of dislikes of video
Comments_count Number of comment for a video
Comments_disable Whether comment is disable or not
Ratings_disabled Whether ratings is disabled or not
Video_e
or_or_removed Whether video has e
or or it is removed

YouTube Video Category Id list:
2 - Autos & Vehicles
1 - Film & Animation
10 - Music
15 - Pets & Animals
17 - Sports
18 - Short Movies
19 - Travel & Events
20 - Gaming
21 - Videoblogging
22 - People & Blogs
23 - Comedy
24 - Entertainment
41 - Thriller
42 - Shorts
43 - Shows
44 - Trailers
25 - News & Politics
26 – How to & Style
27 - Education
28 - Science & Technology
29 - Nonprofits & Activism
30 - Movies
31 - Anime/Animation
32 - Action/Adventure
33 - Classics
34 - Comedy
35 - Documentary
36 - Drama
37 - Family
38 - Foreign
39 - Ho
or
40 - Sci-Fi/Fantasy
3 | P a g e
You are expected to present the data findings in a visual forms (i.e., charts and graphs).
This is a group assignment. You will complete it with your team (max 3 members
enrolled in the same laboratory). It is expected that each team member will contribute
equally in the project. Each team will turn in one joint document and give a joint
presentation in Timetabled Laboratory class in Week 10. In addition, each individual
team member will write a short reflection as part of the report. You will receive feedback
on the draft about presentation choices, content, analysis, and style.

The Questions
Your job is to examine the dataset and present it in a set of informative graphs and text
y answering the following questions.

Guided Questions for Dataset
1. What is the total number of uploaded videos in this dataset?
2. How many different types of uploaded categories are there?
3. What is the number of countries in this dataset?
4. What is the number of (unique) channels in this dataset?
5. Which are the top three countries, according to number of channels, in this dataset?
6. What is the lowest number of channel by country?
7. How many different unique channels are there in the US?
8. Provide a list of the top 10 viewed video titles with respect to each country.
9. Provide a list of least 10 viewed video titles with respect to each country.
10. How many years of uploaded videos are there in the data file?
11. How many uploaded videos have there been in the last month? (Select the last month
of the year)
12. In which year, were the most videos uploaded in GB?
13. Which hour had the most uploaded videos in this dataset? Is there any differences
etween countries? (time_frame)
14. What are the top 3 viewed categories in terms of number of uploaded videos?
15. What are the least 3 viewed categories in terms of number of uploaded videos?
16. Which video has the highest percentage of likes?
17. Which video has the highest percentage of dislikes?
18. Which day has the highest uploads of videos?
19. Which day has least uploads of videos?
20. What is monthly
eakdown of published videos?
Task 1- Background information
Write a description of the selected dataset and project, and its importance for the firm.
Information must be appropriately referenced. [1 Page]
4 | P a g e
Task 2 – Reporting / Dashboards
For your project, perform the relevant data analysis tasks by answering the above
questions and, identify the visualization and dashboards you need to develop for the
Content Manager of the indicated firm. [2-3 Pages]
Task 3 – Advanced Insights: In addition to the guided questions, it is expected to
provide at least five (5) insights of the data. These insights will be judged in terms of
quality and complexity.
Task 4 – Research
Justify why these BI reporting solution/dashboards are chosen in Task 2 (Reporting /
Dashboards) and why that dataset attributes are present and laid out in the fashion you
proposed (feel free to include all other relevant justifications).
Note: To ensure that you discuss this task properly, you must include visual samples of
the reports you produce (i.e. the screenshots of the BI report/dashboard must be
presented and explained in the written report; use ‘Snipping tool’), and also include any
assumptions that you may have made about the analysis in your Task2 (i.e. the report to
the content manager of the company). [1-2 Pages]
Task 5 – Recommendations for Content Manager
The Content Manager would like to improve the multimedia operations. Based on your
BI analysis and the insights gained from the dataset in the lights of analysis performed
in previous tasks, make some logical recommendations to the Content Manager, and
justify why/how your proposal could enhance company’s multimedia operations and
could assist in achieving operational/strategic objectives with the help of appropriate
eferences from peer- reviewed sources. [1-2 Pages]
Task 6 – Cover letter
Write a cover letter to the Content Manager with the important data insights and
ecommendation to achieve operational/strategic objectives [1 page]
Task 7 - The Reflection: Each Team member is expected to write a
ief reflection about
this project in terms of challenges, learning and contribution.
Other Tasks –
Please refer to marking scheme at the end of the assignment for other tasks and
expectations.
Report Submission:

• Hard-copy to tutors/lecturers assignment box in week 10. Double- sided
printing for the hard-copy is encouraged in order to save paper.
• You will also submit a 7-8 pages report (about 1500 words not counting cover
page and references) of this project. At least 15 references in your report must
e from peer-reviewed sources. Include any and all sources of information
including any person(s) you interviewed for this project.
• Please note that all references must adhere to APA style. See
http:
owl.english.purdue.edu/owl
esource/560/01 and
http:
owl.english.purdue.edu/owl
esource/560/01

5 | P a g e
http:
www.apastyle.org/ for details on how to format a report and how to cite
eferences. Make sure your follow formal report structure with cover page,
introduction, use of headings, subheadings, conclusion sand reference section.
• You are reminded to read the “Plagiarism” section of the course description. Your
essay should be a synthesis of ideas from a variety of sources expressed in your
own words. All reports must use the APA referencing style. University
Referencing/Citation Style Guide: The University has published a style guide to
help students co
ectly reference and cite information they use in assignments
(American Psychological Association (APA) citation style,
http:
www.ballarat.edu.au/aasp/student/learning_support/generalguide/pri
n t/ch06s04.shtml or Australian citation style
• Reports are to be presented in hard copy in size 12 Arial Font and double spaced.
Your report should include a list of references used in the essay and a
ibliography of the wider reading you have done to familiarize yourself on the
topic.
• A passing grade will be awarded to assignments adequately addressing all
assessment criteria. Higher grades require better quality and more effort. For
example, a minimum is set on the wider reading required. A student reading
vastly more than this minimum will be better prepared to discuss the issues in
depth and consequently their report is likely to be of a higher quality. So before
submitting, please read through the assessment criteria very carefully.
http:
www.apastyle.org
http:
www.ballarat.edu.au/aasp/student/learning_support/generalguide/prin
http:
www.ballarat.edu.au/aasp/student/learning_support/generalguide/prin
Answered Same Day Jan 15, 2021 ITECH1103

Solution

Sundeep answered on Jan 19 2021
143 Votes
PowerPoint Presentation
YouTube analysis
Using IBM Watson analytics
Analysis of data
The dataset is a very raw dataset with multiple values and atti
utes that are used in the analysis of the project
There are multiple duplicate values that would be observed
2
The 55885 unique videos are published in 4 countries during the time period from 2006 to 2018
The 4 countries include Canada, US, France and GB
3
There are 12360 channels which are divided into 18 categories
The categories may overlap in the countries
4
There are 3 countries, USA, France and Canada that contribute maximum videos
GB has been the country that has uploaded least number of channels
5
The top 10 titles viewed by FRANCE are:
 
Malika LePen : Femme de Gauche – Traile
LA PIRE PARTIE ft Le Rire Jaune, Pie
e Croce, Fabien Olicard, Nad Rich' Hard, Max Bird, Studio Vrac
DESSINS ANIMÉS FRANÇAIS VS RUSSES 2 - Daniil le Russe
PAPY GRENIER - METAL GEAR SOLID
QUI SAUTERA LE PLUS HAUT ? (VÉLO SKATE ROLLER TROTTINETTE)
STRANGER JOKES : Jokes de Papa avec les teens de Stranger Things
De retour dans le Manoir hanté avec le Grand JD !!
T'es qui toi ? Squeezie, le youtubeur aux 4 milliards de vues - Salut les Te
iens
ON VOUS DÉVOILE NOTRE VRAI SALAIRE
Benzema balance ses dur vérités Deschamps et Les bleus Dans le CFC !
We are using France’s data since France has the max number of uploads among the 4 nations
The category and the upload reason could be understood by analysing the videos
6
. The monthly
eakdown of videos is as given:
Jan – 8308
Feb – 7640
March – 8294
April – 6608
May – 7791
June – 3163
July – 17
August – 15
Sept – 35
Oct – 40
Nov – 5580
Dec – 8397
There are different months and different occasions due to which people upload videos on the social media and YouTube. There may be certain reasons like Christmas in December and holidays in January due to which the video count upload by people on the YouTube is high and it leads to more interaction. There are some months in the year where there are very less uploads on the internet. Such months include July, August etc
7
insights
Entertainment, People’s blog and Sports are the main topics of interest that have been found out from analysis, these can be further
oken down into multiple sections such as comedy, action, ho
or, cricket, soccer, baseball and the types of blogs that people write. Further analysis can help understand what kind of movies work in a country and what is the prefe
ed Genre
The uploads differ from country to country and this may be due to several factors like elections, trade, entertainment industry, economic condition and growth
The working hours of people could be one reason that maximum uploads are done during the early evening of 4pm – 5pm
The likes and the comment count of GB is maximum among the countries while the upload count has been less
The uploaded data is only of the developed nations and no data has been provided for the developing...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here