Lead a team of three engineers, educate and support data science team to achieve faster delivery
Collaborate with pre-sales and data science team to ensure the feasibility of incoming projects
Improve day-to-day operations within the consulting team to speed up processes
Within 8 months since onboarding, managed to have the data science team who previously exclusively develop on Google Colab / Vertex Notebook do code versioning, pull requests, and python venv (via poetry). More than 30+ pull requests have been made during this period
Oversee technical direction and delivery for projects. This involves model development (MLOps) and application deployment (DevOps), while utilizing a common foundational blocks to reduce implementation overhead (platform engineering)
Lead a team of three engineers, building and maintaining infrastructure for five products, centralized data platform and MLOps platform.
Collaborate with engineering, product and data team to align the platform directions that satisfy all parties
Use Terraform with Infracost to assess infrastructure cost to prune unused/underutilized resources, in turn reduce 33% of cloud cost per month (FinOps)
Design and implement end-to-end cloud-native application deployment (Terraform for infra/secrets, Helm for application deployment, GitHub Actions for CI/CD, Kubernetes for container orchestration)
Lead a major core product refactoring across ops, engineering and data (data engineering, data science and machine learning engineering). Affected areas: data pipelines, model training pipelines, real-time inference endpoints, database performance optimization, deployment pipelines, local development workflow
Lead a cloud migration from AWS to GCP (AWS ECS to GCP GKE)
Alternative / self-hosted version for popular subscription services: Netflix, Spotify, LastPass, Trello, Dropbox, NordVPN, etc
Workloads are managed via Docker Compose and Helm
Terraform for managing Cloudflare and Kubernetes
Caddy for reverse-proxy
SSO via Authentik
Measure latency, RPS and CPU & memory utilization for a hello world api in various languages
Measure overheads incurred from using C FFI in various languages
Run spark jobs on kubernetes, which can be used both locally and on production environment