Table of Content

Project Description
Source
Structure
To start

Project Description

This is a project to be done for PDAS CA1 which test the basic competency in writing Python program and Python packages such as Python Numpy and Matplotlib for Data Analysis and Visualization

The objective of the project is:

Are the more popular courses in ITE, Poly, UNI correlated with better employment opportunities?

¹

Notes: Courses are grouped into course clusters based on the Graduate Employment Surver provided by MOE. These courses are:

Fresh Graduates

Arts, Design & Media
Built Environment
Business
Dentistry
Education (NIE)
Engineering
Health Sciences
Humanities & Social Sciences
Information & Digital Technologies
Music
Sciences
Yale-NUS

Follow up Graduates

Architecture
Biomedical Sciences and Chinese Medicine
Law
Medicine
Pharmacy

Source

Dataset name	Link
Employment (2017-2019)	https://www.moe.gov.sg/-/media/files/post-secondary/joint-web-publication-6-aus-ges-2019.pdf?la=en&hash=DE36C0FF72D7FB96B7B29B96DBC8D67D03A7B3C3
Employment (2019-2021)	https://www.moe.gov.sg/-/media/files/post-secondary/ges-2021/joint-web-publication-4-aus-ges2021.ashx?la=en&hash=2CB3200A8C1B7D935D0253470072DE82DDF49B42
Intake by course (UNI)	https://data.gov.sg/dataset/universities-intake-enrolment-and-graduates-by-course
Intake by course (Poly)	https://data.gov.sg/dataset/polytechnics-intake-enrolment-and-graduates-by-course

The first two datasets are from MOE while the last three datasets are from Data.gov.sg

Structure

File structure:

datasets_cleaned => contains the cleaned csv files
datasets_src => contains the csv files for all the original uncleaned datasets
datasets.zip => contains the backup zip file for the datasets
clean.ipynb => to clean the data from datasets_stc
main.ipynb => where all the code will be
README.md => contains all the source for the datasets

To start

Start by running:

pip3 install -r requirements.txt

Delete the entire contents of the directory 'datasets_cleaned'. To clean the data, run the cells inside 'clean.ipynb' (This will recreate the contents of 'datasets_cleaned')
Run the cells inside 'main.ipynb' to see the analysis and visualization performed
For a summary of the graphs head to the powerpoint slides

The data for 2019 from the 2017-2019 employment is slightly different from the 2019-2021 employment data set. This is likely due to the statistical noise generated to provide privacy to graduates. ↩

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Content

Project Description

Source

Structure

To start

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
datasets_cleaned		datasets_cleaned
datasets_src		datasets_src
README.md		README.md
clean.ipynb		clean.ipynb
dataset.zip		dataset.zip
insights.txt		insights.txt
main.ipynb		main.ipynb

kaze-droid/SG-Education-Analysis

Folders and files

Latest commit

History

Repository files navigation

Table of Content

Project Description

Source

Structure

To start

Footnotes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages