Justin

Experience

Junior Data Scientist, May 2024 - current
Govtech Nucleus Unit, Kementerian Digital, Kuala Lumpur

  • Developed web-scraping infrastructure for SiaranGovMy demo
    • A modular infrastructure to easily customize scraping logics across 30+ ministry/agency websites for media releases
  • Developed parser to convert pdf to markdown [demo]
    • Using Python and the PyMuPDF library to parse PDF files from urls to plain text, markdown and metadata
  • Developed AI-powered microservices:

Data Scientist I, Oct 2022 - May 2024
AirAsia SEA Sdn. Bhd., Selangor

  • Maintained the predictive booking pace curve model
  • Developed ETL pipelines
  • Developed Looker Studio dashboards
  • Routine exploratory and feasibility studies for new datasets, and problem statements

Research Assistant, Sep 2020 - Sep 2022
School of Data Science, Perdana University, Kuala Lumpur

  • Produce pipeline for preprocessing datasets, using multiple bioinformatics tools
  • Performing analysis on the preprocessed dataset, eg: differentially expressed gene analysis, machine learning for classification analysis
  • Generate progress reports

Internship, Oct-Dec 2018
B. Braun Medical Industries Sdn. Bhd., Penang

  • Measured and reported on workstation environment cleanliness and brightness
  • Supporting studies on the Automated Gluing Project for the Elastomeric Pump Assembly
  • Revision of the Sub-microbore Assembly Standard Operating Procedure

Education

Bachelor of Science (Hons) Biotechnology, June 2020
Universiti Tunku Abdul Rahman, Kampar, Perak

Thesis title: “Morphometric variability of the anatomical structures of monogeneans parasites (Capsalidae: Benedeniinae) from cultured groupers (Epinephelus coioides) of Hong Kong and Malaysia”

Advisor: Wong Wey Lim, Assoc. Professor, Universiti Tunku Abdul Rahman

Foundation in Science, 2017 Universiti Tunku Abdul Rahman, Kampar, Perak


Honours and Awards

Fundamental Research Grant Scheme, 2020 Perdana University, Kuala Lumpur

Student Assistantship, 2020 Universiti Tunku Abdul Rahman, Kampar, Perak


Computer Skills

Programming Languages : Python (Pandas, numpy, Sci-kit learn, Hyperopt-sklearn, Streamlit, FastAPI, Plotly, request-html, Pyspark), Vega/Vega-lite, Airflow, Terraform, Git, Snowflake SQL, R

Cloud Computing : Google Cloud Platform (BigQuery, Looker Studio, Google Kubernetes Engine, Google Cloud Storage), Vercel


Language Skills

English : Proficient (MUET Band 4)

Mandarin : Fluent

Malay : Conversational

Hokkien : Fluent

Japanese : Beginner


Analytics Bounty Experiences

Flipside Crypto (Profile: https://flipsidecrypto.xyz/tzeji)

  • Extracting relevant blockchain data from Flipside SQL databases
  • Produce dashboards to answer questions for specific blockchain ecosystems.

See dashboards I've created for Flipside here


Projects/Activities Involved

Contributor, May 2022-July 2022
MetricsDAO

  • Authoring weekly newsletter series by reporting happenings in the organization and announcement of upcoming events to community members

Working Committee of Bioinformatics Workshop Fiesta 2020/22, Oct 2020 - Sep 2022
Asia Pacific Bioinformatics Network

  • Coordinating workshop sessions
  • Creating adverts for each workshop
  • Managing participant registrations
  • Providing customer support
  • Managing workshop websites
  • Setting up and managing workshop Discord server
Download Resume