My Cloud Resume

Skills

LLMs

Retrieval Augmented Generation (RAG)
Vector Search and Storage

Data Processing

HTTP/ API Webscraping
SQL
Python
Airflow/ Prefect
dbt

Analytics

Tableau
PowerBI
Excel
Superset
Metabase

Data Quality

OpenMetadata
AWS Glue

Cloud Services

AWS
GCP
Terraform
Kubernetes
Docker
Flask

Other proficiencies

CICD
Git
Networking
Cyber Security
Linux
Machine Learning
Statistics

Work

  • Razer
    2024 - Present
    Big Data Engineer
    • End-to-end management of company's data pipelines, platform infrastructure, and quality
    • - Initiator and owner of internal data catalog platform (open-source OpenMetadata) deployed on EKS
    • - Wrote custom library to enable integration between our pipelines and OpenMetadata SDK
    • - Developed templatized repository for the deployment of data pipelines for RAG and vector databases.
  • Data Analyst
    • Building data and ML apps to improve manufacturing process control
    • - Built data model and automated pipeline for complex data extraction, aggregation, and visualization of Quality Control (QC) data.
    • - Created real-time performance monitoring of the site’s 40+ vaccine products via central control tower app with ML capabilities.
    • - Contributing member of in-house data intelligence service team. Used by 5 other GSK sites globally to build custom ML apps.
  • Manufacturing Science Specialist
    • Statistical analysis and engineering project management
    • - Improved routine monitoring and statistical process control of product quality attributes through automated analysis and report generation.
  • Adra AI
    2021 - 2023
    Technical Consultant
    • Clinical trial data analysis and protocol writing for AI radiology software
    • - Cleared U.S. FDA, and Singapore HSA clinical trial submissions
  • Freelance
    2022 - Present
    Data Consultant
    • Develop data pipelines and visualisations for small companies
    • - N/A