Experience
Data Engineer, October 2025 - curent
Bank Negara Malaysia, Kuala Lumpur
Junior Data Scientist, May 2024 - June 2025
Govtech Nucleus Unit, Kementerian Digital, Kuala Lumpur
- Developed web-scraping infrastructure for SiaranGovMy demo
- A modular infrastructure to easily customize scraping logics across 30+ ministry/agency websites for media releases
- Developed parser to convert pdf to markdown [demo]
- Using Python and the PyMuPDF library to parse PDF files from urls to plain text, markdown and metadata
- Developed AI-powered microservices:
- Generative UI components using the Malaysian Design System library [demo]
- Image alt text generator [demo]
Data Scientist I, Oct 2022 - May 2024
AirAsia SEA Sdn. Bhd., Selangor
- Maintained the predictive booking pace curve model
- Developed ETL pipelines
- Developed Looker Studio dashboards
- Routine exploratory and feasibility studies for new datasets, and problem statements
Research Assistant, Sep 2020 - Sep 2022
School of Data Science, Perdana University, Kuala Lumpur
- Produce pipeline for preprocessing datasets, using multiple bioinformatics tools
- Performing analysis on the preprocessed dataset, eg: differentially expressed gene analysis, machine learning for classification analysis
- Generate progress reports
Internship, Oct-Dec 2018
B. Braun Medical Industries Sdn. Bhd., Penang
- Measured and reported on workstation environment cleanliness and brightness
- Supporting studies on the Automated Gluing Project for the Elastomeric Pump Assembly
- Revision of the Sub-microbore Assembly Standard Operating Procedure
Education
Bachelor of Science (Hons) Biotechnology, June 2020
Universiti Tunku Abdul Rahman, Kampar, Perak
Computer Skills
Programming Languages : Python (Pandas, numpy, Sci-kit learn, Hyperopt-sklearn, Streamlit, FastAPI, Plotly, request-html, Pyspark), Vega/Vega-lite, Airflow, Terraform, Git, Snowflake SQL, R
Cloud Computing : Google Cloud Platform (BigQuery, Looker Studio, Google Kubernetes Engine, Google Cloud Storage), Vercel