# data-engineering
Spatial join performance Aug 30, 2025 Dataframe processing benchmarks (ภาษาไทย) Jul 8, 2025 Dataframe processing benchmarks Jul 8, 2025 Faster spark workloads with Comet Apr 7, 2024 Dataframe write performance to Postgres Mar 17, 2024 Using Apache Iceberg to reduce data lake operations overhead Nov 15, 2023 Spark on Kubernetes Sep 12, 2023 Data Engineering Resources Sep 9, 2023 DuckDB vs Polars vs Spark! Apr 7, 2023 Google Analytics v4 ingestion via BigQuery Mar 19, 2023 Data transformation - Python vs SQL showdown Mar 18, 2023 Intro to Dagster Cloud Sep 27, 2022 Data engineer archtypes Aug 26, 2022 What SQL can't do for data engineering May 15, 2022 Use Pyspark locally with Docker Dec 21, 2021 Don't write large table to Postgres with Pandas Jun 27, 2021 Data engineering toolset (that I use) glossary Jun 4, 2021 Shapefile to data lake Apr 23, 2021 Spark join OOM fix Apr 11, 2021 Workarounds for archiving large shapefile in data lake Jan 31, 2021 Mongodb export woes Jan 27, 2021
https://karnwong.me/posts/rss.xml