Skills
LLMs
Retrieval Augmented Generation (RAG)
Vector Search and Storage
Data Processing
HTTP/ API Webscraping
SQL
Python
Airflow/ Prefect
dbt
Analytics
Tableau
PowerBI
Excel
Superset
Metabase
Data Quality
OpenMetadata
AWS Glue
Cloud Services
AWS
GCP
Terraform
Kubernetes
Docker
Flask
Other proficiencies
CICD
Git
Networking
Cyber Security
Linux
Machine Learning
Statistics
Work
-
RazerBig Data Engineer2024 - Present
- End-to-end management of company's data pipelines, platform infrastructure, and quality
- - Initiator and owner of internal data catalog platform (open-source OpenMetadata) deployed on EKS
- - Wrote custom library to enable integration between our pipelines and OpenMetadata SDK
- - Developed templatized repository for the deployment of data pipelines for RAG and vector databases.
-
GlaxoSmithKline (GSK) VaccinesData Analyst2022 - 2024
- Building data and ML apps to improve manufacturing process control
- - Built data model and automated pipeline for complex data extraction, aggregation, and visualization of Quality Control (QC) data.
- - Created real-time performance monitoring of the site’s 40+ vaccine products via central control tower app with ML capabilities.
- - Contributing member of in-house data intelligence service team. Used by 5 other GSK sites globally to build custom ML apps.
-
GlaxoSmithKline (GSK) VaccinesManufacturing Science Specialist2021 - 2022
- Statistical analysis and engineering project management
- - Improved routine monitoring and statistical process control of product quality attributes through automated analysis and report generation.
-
Adra AITechnical Consultant2021 - 2023
- Clinical trial data analysis and protocol writing for AI radiology software
- - Cleared U.S. FDA, and Singapore HSA clinical trial submissions
-
FreelanceData Consultant2022 - Present
- Develop data pipelines and visualisations for small companies
- - N/A
-
National University of Singapore (NUS)Bachelor of Engineering (Honours) in Chemical Engineering2017 - 2021