https://karnwong.me/posts/rss.xml
# data-engineeringAll Tags
Spatial join performance 2025-08-30 Dataframe processing benchmarks (ภาษาไทย) 2025-07-08 Dataframe processing benchmarks 2025-07-08 Faster spark workloads with Comet 2024-04-07 Dataframe write performance to Postgres 2024-03-17 Using Apache Iceberg to reduce data lake operations overhead 2023-11-15 Spark on Kubernetes 2023-09-12 Data Engineering Resources 2023-09-09 DuckDB vs Polars vs Spark! 2023-04-07 Google Analytics v4 ingestion via BigQuery 2023-03-19 Data transformation - Python vs SQL showdown 2023-03-18 Intro to Dagster Cloud 2022-09-27 Data engineer archtypes 2022-08-26 What SQL can't do for data engineering 2022-05-15 Use Pyspark locally with Docker 2021-12-21 Don't write large table to Postgres with Pandas 2021-06-27 Data engineering toolset (that I use) glossary 2021-06-04 Shapefile to data lake 2021-04-23 Spark join OOM fix 2021-04-11 Workarounds for archiving large shapefile in data lake 2021-01-31 Mongodb export woes 2021-01-27