Great Deal! Get Instant $10 FREE in Account on First Order + 10% Cashback on Every Order Order Now

ECA template ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 1 of 6 ECA – July Semester 2019 ANL251 End-of-Course Assessment – July Semester 2019 XXXXXXXXXXPython...

1 answer below »
ECA template
ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 1 of 6
ECA – July Semester 2019







ANL251

End-of-Course Assessment – July Semester 2019

XXXXXXXXXXPython Programming

___________________________________________________________________________

INSTRUCTIONS TO STUDENTS:

1. This End-of-Course Assessment paper comprises SIX (6) pages (including the cover page).

2. You are to include the following particulars in your submission: Course Code, Title of the
ECA, SUSS PI No., Your Name, and Submission Date.

3. Late submission will be subjected to the marks deduction scheme. Please refer to the
Student Handbook for details.


IMPORTANT NOTE

ECA Submission Deadline: 4 September 2019, 12 noon
ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 2 of 6
ECA – July Semester 2019
ECA Submission Guidelines

Please follow the submission instructions stated below:

A - What Must Be Submitted

You are required to submit the following TWO (2) items for marking and grading:

• A report (you should submit this item first as it ca
ies the highest weightage).

• The Python notebook file & the used datasets


B - Submission Deadline

• The TWO (2) items of report, Python code and used datasets are to be submitted by 12
noon on the submission deadline.

• You are allowed multiple submissions till the cut-off date for each of the TWO (2) items.

• Late submission of any of the TWO (2) items will be subjected to mark-deduction scheme
y the University. Please refer to Section 5.2 Para 2.4 of the Student Handbook.


C - How the TWO (2) Items Should Be Submitted

• The Report: submit online to Canvas via TurnItIn (for plagiarism detection).
o please ensure that your Microsoft Word document is generated by Microsoft Word 2007
or higher.
o the report must be saved in .docx format.

• The Python code and used datasets:
o write all your code in one file (.ipynb file)
o compress the .ipynb file and used datasets as a .zip file
o submit the .zip file to Canvas

Avoid using a public WiFi connection for submitting large video files. If you are using public
wireless (WiFi) connection (e.g. SG Wireless at public areas), you might encounter a
eak in
the connection when sending large files.

Please verify your submissions after you have submitted the above TWO (2) items.


D – Please be Aware of the Following:

Submission in hardcopy or any other means not given in the above guidelines will not be
accepted. You do not need to submit any other forms or cover sheets (e.g. form ET3) with your
ECA.

ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 3 of 6
ECA – July Semester 2019
You are reminded that electronic transmission is not immediate. The network traffic may be
particularly heavy on the date of submission deadline and connections to the system cannot be
guaranteed. Hence, you are advised to submit your work early. Canvas will allow you to
submit your work late but your work will be subjected to the mark-deduction scheme. You
should therefore not jeopardise your course result by submitting your ECA at the last minute.

It is your responsibility to check and ensure that your files are successfully submitted to
Canvas.


E - Plagiarism and Collusion

Plagiarism and collusion are forms of cheating and are not acceptable in any form in a
student’s work, including this ECA. Plagiarism and collusion are taking work done by others
or work done together with others respectively and passing it off as your own. You can avoid
plagiarism by giving appropriate references when you use other people’s ideas, words or
pictures (including diagrams). Refer to the APA Manual if you need reminding about quoting
and referencing. You can avoid collusion by ensuring that your submission is based on your
own individual effort.

The electronic submission of your ECA will be screened by plagiarism detection software. For
more information about plagiarism and collusion, you should refer to the Student Handbook
(Section XXXXXXXXXXYou are reminded that SUSS takes a tough stance against plagiarism or
collusion. Serious cases will normally result in the student being refe
ed to SUSS’s Student
Disciplinary Group. For other cases, significant mark penalties or expulsion from the course
will be imposed.



Find a news article (that is of interest to you) from any trusted sources published in the last
month. Formulate a research question in order to support, object to, or expand on the claim(s)
in the selected news article. Find two (2) or more publicly accessible datasets on the web which
can be used to answer your research question.

You may need to go through the following tasks (a)-(d) multiple times in order to a
ive at a
meaningful research question and findings.

(a) Identify the key claim(s) in the selected news article. Identify two (2) or more datasets
that are publicly accessible. Analyse the dataset and answer the following questions.
What kind of information is present in the datasets? How is the data organised and what
common features can be used to relate the datasets? Are there data quality issues in the
datasets (such as e
oneous data, missing data, etc.)? Do you need to prepare (such as
clean, transform, or manipulate) the raw data for analysis?

(b) Describe the measures that you need to calculate in order to answer your research
question. Refer to Appendix 1 for an example.

(c) Apply your Python programming skills to generate summary statistics and graphical
plots (or other form of visualisations) to address the stated research question. Explain
your findings and observations.

ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 4 of 6
ECA – July Semester 2019
(d) Identify some possible limitations of your findings. For example, are they limited to a
certain city or country? Justify your assumptions about the data, if any.



Present your work for tasks (a)-(d) in your report using the template provided in Appendix
2. Provide screenshots of the relevant Python codes for each task and its output(s) where
appropriate. Keep your report concise and coherent as a self-contained entity. The
evaluation criteria also include logical flow of your explanation, variety of the
visualisations employed and appropriate summary statistic used. More marks will be
awarded to answers with in-depth analyses and practical recommendations. Limit your
eport file size to a maximum of 4M Bytes.

For a
eakdown of the marks, please refer to Appendix 2.
(80 marks)

Write the codes you used for implementing tasks (a)-(d) in an .ipynb file. The program should
have sufficient comments to describe the co
esponding steps and analyse the logical flow for
each task.
(20 marks)


Up to 25 marks of penalties will be imposed for inappropriate or poor paraphrasing. For serious
cases, they will be investigated by the examination department. More information on effective
paraphrasing strategies can be found on
https:
academicguides.waldenu.edu/writingcente
evidence/paraphrase/effective.








----- END OF ECA PAPER -----

https:
academicguides.waldenu.edu/writingcente
evidence/paraphrase/effective
ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 5 of 6
ECA – July Semester 2019
Appendix 1 Proposing Research Questions


Example: Do university towns have their mean housing prices less affected by recessions?

• The following datasets could be used:
• From the Zillow research data site (http:
www.zillow.com
esearch/data/) there is
housing data for the United States. In particular, the datafile for all homes at a city
level (http:
files.zillowstatic.com
esearch/public/City/City_Zhvi_AllHomes.csv),
has median home sale prices at a fine-grained level.
• From the Wikipedia page on college towns is a list of university towns in the
United States
(https:
en.wikipedia.org/wiki/List_of_college_towns#College_towns_in_the_Unit
ed_States).
• From Bureau of Economic Analysis, US Department of Commerce, the GDP over
time (http:
www.bea.gov/national/index.htm#gdp) of the United States in cu
ent
dollars (use the chained value in 2009 dollars), in quarterly intervals.

• You may, for example, compare the ratio of the mean price of houses in university
towns the quarter before the recession starts compared to the recession bottom. A
ecession is defined as starting with two consecutive quarters of GDP decline, and
ending with two consecutive quarters of GDP growth. A recession bottom is the
quarter within a recession which had the lowest GDP.


ANL251 Copyright © 2019 Singapore University of Social Sciences (SUSS) Page 6 of 6
ECA – July Semester 2019
Appendix 2 Report Template & Marks Breakdown


Title of the ECA project
your name, SUSS PI No. and submission date

Abstract – Briefly describe the dataset, research question, methods, and findings.
(maximum 150 words)
(10 marks)
1. Datasets
Document your work for Task (a) here.
Provide the web links to retrieve the used datasets.
Provide screenshots of the relevant Python code and its output.
(15 marks)
2. Research Question
Provide the web link to retrieve the news article of your interest.
Briefly describe the key claims and supporting reasons in the article. Explain how your
esearch question supports, objects to or expands the claims in your selected news article.
Document your work for Task (b) here.
(10 marks)
3. Analysis
Document your work for Tasks (c) here.
Provide screenshots of the relevant Python code and its output.
(30 marks)
4. Conclusions
Document your work for Task (d) here.
(10 marks)
Reference
(5 marks)
Appendix (optional)

    1
Answered Same Day Aug 10, 2021

Solution

Prasun Kumar answered on Sep 04 2021
149 Votes
Does Singapore’s GDP affect its taxi ridership?
Chay Whye Hoe, SUSS PI No. and September 4, 2019
Abstract - The hypothesis of this paper is that with a increasing economy, people tend to avoid travelling by taxis. To prove/disapprove this reasoning, datasets provided by 2 service providers have been explored (1) from Dept. of Statistics, Government of Singapore (GDP data) and from Land Transport Authority, Government of Singapore (transportation data - taxi and public transport). Using Python li
aries such as Tabula, Numpy, Matplotlib and Pandas, data scraping, cleaning and analysis has been done. Co
elation analysis of various parameters show that there is a significant co
elation between these variables.
1. Datasets
To support the hypothesis, 3 datasets from 2 sources have been used in the cu
ent study. The Land Transport Authority (LTA), under the Ministry of Transport, provides data on taxi ridership (LTA, Publications & Research, 2019) from different sources (such as metered taxis, ride-hail operators etc.) on a monthly basis. The dataset is provided in Adobe Acrobat (pdf) file format (taxi_info_*.pdf). It contains information such as average daily number of trips (for 1 shift and 2 shift taxis), number of monthly rides for different taxi fleets (such as comfort, citycab, trans-cab etc.) and information on taxi drivers’ vocational license. Figure 1 shows data scraping from pdf files using the Python module ‘Tabula’ into Pandas DataFrame.
Figure 1: Data scraping from a pdf file
LTA also provides monthly (which can be aggregated to quarterly and yearly) statistics on public transport (PT) ridership (LTA, PT Ridership, 2018), from 3 sources (Bus, Mass Rapid Transit (MRT) and Light Rail Transit (LRT)) through its alliance with Tableau. These were not available for download (Figure 2), hence, the data was aggregated at quarterly basis and scraped manually (public transport data.txt).
Figure 2: LTA PT Ridership Website Screenshot
The 3rd data source is the Department of Statistics (DOS), Government of Singapore which provides detailed quarterly Gross Domestic Product (GDP) data, bifurcated by industry and using different estimation strategies (Government of Singapore, DOS, 2019). This data is available as Microsoft Excel file (xlsx) format (gdp data.xlsx). For this analysis the “GDP at Cu
ent Prices, By Industry (SSIC 2015), Quarterly” has been used. Figure 3 shows reading the data in Python using Pandas.
Figure 3: Reading GDP data in excel file using...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here