Choosing-Tools

Table Format: Hudi vs. Iceberg vs. Delta Lake

Jan 1, 0001

Streaming SQL Engine Comparison

Our Goal: User-Facing Analytics This is User-facing analytics, not BI Data Warehouse Engine like Trino, Dremio, BigQuery. When it comes to user-facing analytics, you need a database that supports sub-second query responses, near real-time updates and high QPS (concurrent Queries-per-second).

Jan 1, 0001

Database Comparison: PostgreSQL vs. MySQL

TLDR: use PostgreSQL unless you have no choice because you work in company that use MySQL https://www.reddit.com/r/node/comments/rv6u8u/why_do_you_choose_mysql_over_postgres/ https://www.reddit.com/r/PostgreSQL/comments/tldork/in_what_circumstances_is_mysql_better_than/ https://www.reddit.com/r/PostgreSQL/comments/xblooo/convince_me_to_choose_postgresql_over_mysql/ https://www.reddit.com/r/golang/comments/16hn0u3/mysql_or_postgres/ https://www.bytebase.com/blog/postgres-vs-mysql/ https://www.integrate.io/blog/postgresql-vs-mysql-which-one-is-better-for-your-use-case/ https://www.datacamp.com/blog/postgresql-vs-mysql Popularity Why? Better community supports Google Trends Big Company Adoption Distribution of Commit MySQL is so Oracle, kind of the same with Delta Lake by Databricks.

Jan 1, 0001

Data Format: Avro vs. Protobuf vs. Parquet

Theory What is Data Format? A data format is a specific representation or arrangement of data. It defines how information is structured, transmitted, and encoded. Data formats specify: Structure: How the data is organized.

Jan 1, 0001

Compute Engine: Spark vs. Flink

Jan 1, 0001

Business Intelligence: Superset vs. Metabase vs. Redash

References: https://atwong.medium.com/best-open-source-replacements-for-business-intelligence-tools-power-bi-tableau-looker-3857ea58737d

Jan 1, 0001