Karn Wong's Blog

Book Highlights - Babel by Gaston Dorren Apr 22, 2026 LLM tracing with MLflow AI Gateway Apr 10, 2026 Garmin development setup on Linux Apr 6, 2026 Development environment setup with flox and direnv Mar 31, 2026 Observability with eBPF and OTel Mar 27, 2026 Classical models vs neural networks for tabular data Mar 26, 2026 Multi-account git setup Mar 1, 2026 Book Highlights - Of Monsters and Mainframes by Barbara Truelove Feb 9, 2026 Reduce workload attack surface with distroless image Feb 3, 2026 ONNX multi-platform benchmark Jan 7, 2026 Multi-machine work setup Sep 11, 2025 Spatial join performance Aug 30, 2025 Static lib linking in Go program Aug 26, 2025 AWS cost optimization tactics Aug 4, 2025 Terraform project structure Jul 22, 2025 The true cost of deploying on EC2 Jul 18, 2025 Dataframe processing benchmarks (ภาษาไทย) Jul 8, 2025 Dataframe processing benchmarks Jul 8, 2025 Simplify SSH (and HTTP & Postgres) access management with Warpgate Jun 28, 2025 Using Nix for CI, but with a twist Jun 20, 2025 Hello Garage, goodbye MinIO Jun 13, 2025 Cloudflare's terraform v5 provider migration strategy Jun 10, 2025 Model inference in ONNX Jun 3, 2025 My code search setup throughout the years May 31, 2025 WASM FFI performance benchmark Apr 15, 2025 WASM deployment benchmark Mar 10, 2025 Migrate from Docker Desktop to Podman on Darwin Mar 2, 2025 Cross-platform keyring-like setup (also works without desktop environment) Feb 13, 2025 My journey for fully switching from Windows over to Linux Jan 25, 2025 Information gathering infrastructure Dec 30, 2024 Native implementation vs WASM for Go, Python and Rust benchmark Dec 4, 2024 Various notes on career-related stuff Oct 19, 2024 Calling C from Go, Python and Rust benchmark Oct 12, 2024 LLM serving latency benchmark Oct 9, 2024 AWS IAM credentials best practices Oct 5, 2024 When (not) to use Terraform Oct 5, 2024 Hello World API Performance Benchmark (Go, Node, Python, Rust) Sep 20, 2024 Simplify self-hosting backups to S3 with docker Sep 7, 2024 Streamlit load test performance Sep 7, 2024 Reasons why you shouldn't use programming languages for IaC Aug 5, 2024 GCP's service account credentials can be a security risk. Here's how to mitigate them. Jul 14, 2024 Thoughts on summarization service system design Jun 9, 2024 Faster spark workloads with Comet Apr 7, 2024 Slim down Python Docker image size with Poetry and Pip Apr 7, 2024 Dataframe write performance to Postgres Mar 17, 2024 How to connect to Cloud SQL from Cloud Run (no, you don't need a VPC) Feb 10, 2024 What is platform engineering? Jan 21, 2024 Collaboration model for data science projects Jan 20, 2024 Should data scientists deploy models to production? Dec 30, 2023 Setting up Postgres locally, what could go wrong? Dec 23, 2023 Things to watch out for GCP SSL with Cloudflare DNS Dec 18, 2023 Serverless real-time machine learning inference with AWS Nov 28, 2023 Some problems can be solved with workflows Nov 24, 2023 Using Apache Iceberg to reduce data lake operations overhead Nov 15, 2023 Reduce operational costs with Terraform Nov 4, 2023 Spark on Kubernetes Sep 12, 2023 Data Engineering Resources Sep 9, 2023 A networking God tale: all I want is to run a speedtest behind a firewall Aug 27, 2023 Spatial data to QGIS server playbook (yes, this is for prod) Aug 10, 2023 Create Kubernetes service accounts with Terraform Aug 1, 2023 pglogical setup Jul 20, 2023 Book Highlights - Atlas of the Heart by Brene Brown Jul 6, 2023 Book Highlights - Build by Tony Fadell Jul 6, 2023 Hassle-free Kubernetes monitoring with Coroot Jun 9, 2023 Book Highlights - Managing Transitions: Making the Most of Change by William Bridges Apr 27, 2023 Use SQL against CSV (or other hard files) without CLI Apr 25, 2023 DevX starts at your local machine Apr 22, 2023 The mythical ChatOps in action Apr 18, 2023 DuckDB vs Polars vs Spark! Apr 7, 2023 Cost optimization with Kubernetes Apr 1, 2023 Kubernetes with Grafana Cloud Apr 1, 2023 Bare metal works, until it doesn't. Hello, cloud. Mar 24, 2023 Google Analytics v4 ingestion via BigQuery Mar 19, 2023 SecretOps with Teller Mar 19, 2023 Data transformation - Python vs SQL showdown Mar 18, 2023 Terraform RDS module with DNS setup Mar 17, 2023 Book Highlights - Engineering Management for the Rest of Us by Sarah Drasner Mar 16, 2023 Book Highlights - The Staff Engineer's Path by Tanya Reilly Mar 15, 2023 Load balancer 101, thou shalt not be alone Dec 10, 2022 Cross-platform package (+env) management with Nix Dec 3, 2022 Load credentials into your shell via Bitwarden CLI - Fish edition Nov 29, 2022 Cloudflare DNS management with Terraform Nov 20, 2022 Deploy more efficiently with templating Nov 5, 2022 IaC is cool, until someone specified the wrong machine size 💸 Nov 3, 2022 Deploy static site with branch preview via Cloudflare Pages Oct 5, 2022 Terraform with ECS task on EC2 backend Oct 4, 2022 Intro to Dagster Cloud Sep 27, 2022 Intro to Pulumi Sep 26, 2022 Data engineer archetypes Aug 26, 2022 Minimal ECS task with Fargate backend Aug 26, 2022 What SQL can't do for data engineering May 15, 2022 Use SSH key during Docker Build without embedding the key via ssh-agent Feb 6, 2022 Use Pyspark locally with Docker Dec 21, 2021 Reduce Docker image size with Alpine Dec 19, 2021 Secrets management with SOPS, AWS Secrets Manager and Terraform Nov 30, 2021 Run GitHub Actions faster with cache for Pipenv and Docker build Nov 9, 2021 ecs-cli snippets Oct 8, 2021 Self-hosting primer Aug 22, 2021 Python venv management Jul 2, 2021 Don't write large table to Postgres with Pandas Jun 27, 2021 Data engineering toolset (that I use) glossary Jun 4, 2021 Automatic Scrapy deployment with GitHub actions Jun 2, 2021 Elasticsearch with custom dictionary May 3, 2021 Shapefile to data lake Apr 23, 2021 Spark join OOM fix Apr 11, 2021 Add Ghost content to Hugo Mar 31, 2021 Hello Caddy Mar 7, 2021 Password auth with apache2 reverse-proxy Feb 22, 2021 Buying tea when you have Celiac Feb 14, 2021 Workarounds for archiving large shapefile in data lake Jan 31, 2021 Mongodb export woes Jan 27, 2021 CPU upgrade is a breeze, only if you know how Dec 20, 2020 There are caveats behind every statement Dec 1, 2020 Visualizing map region prefix/suffix Sep 3, 2020 Impute pipelines May 22, 2020 Word-based analysis with song lyrics Apr 15, 2020 ความน่าปวดหัวของการถอดอักษรไทยเป็นตัวโรมัน Jan 6, 2020 Loanwords are okay Nov 6, 2019 Indic name mistransliteration in Thai version of Harry Potter Oct 10, 2018 Definition of 'a Word' Jul 30, 2018 Resettled refugees in Sweden Jul 30, 2018 How angur became องุ่น Nov 23, 2017 The confusing case of Thai romanization system Sep 14, 2017

Posts