r/Clickhouse

paradedb/benchmarker: a workload agnostic, multi-backend benchmarking tool.
▲ 18 r/Clickhouse+4 crossposts

paradedb/benchmarker: a workload agnostic, multi-backend benchmarking tool.

Hi r/postgresql!

We just open sourced ParadeDB Benchmarker, a multi-backend benchmarking framework built on top of the excellent Grafana k6 (blog post).

One of the goals was avoiding a shared query abstraction layer. PostgreSQL queries stay PostgreSQL queries, with their own driver and native SQL.

Supports PostgreSQL, Elasticsearch, OpenSearch, ClickHouse, MongoDB, and ParadeDB with:

  • mixed read/write workloads
  • support for docker-compose profiles per backend
  • dataset loader
  • config and setup capture
  • live metrics + exported reports

One of the ah-ha moments I had building this was using the pgx Go driver in anger for the first time, I'm a Rust guy, but I'm seriously impressed with pgx and what it can do.

Any comments welcome, we will be using this to benchmark ParadeDB, but you can write your own datasets and workloads which have nothing to do with full-text search.

github.com
u/jamesgresql — 20 hours ago
▲ 8 r/Clickhouse+1 crossposts

If you're consuming from Kafka and writing into ClickHouse, sync inserts at high message rates will hurt you. Async insert mode helps a lot but the buffering and dedupe behavior isn't always obvious.

Wrote this up from our my experience building a stream processing pipeline.

Curious how others are handling the Kafka → ClickHouse write path.

u/Marksfik — 1 day ago
▲ 9 r/Clickhouse+2 crossposts

ClickHouse is a beast for observability, but dumping raw, un-enriched OTel data into it can lead to massive storage costs and messy queries. We just launched native OTLP connection for GlassFlow that moves that processing upstream when it comes to enriching and filtering OTel spans before they hit the table.

The goal is to keep the dashboards fast without the overhead of massive background merges or complex SQL views. Check out the setup we’re using for enriched OTel pipelines. What’s your biggest bottleneck when querying raw OTel data in ClickHouse? 🤔

u/Marksfik — 9 days ago

Hello,

I'm having a hard time connecting my in house clickhouse to my Tableau server. I'm not sure if I'm using the correct driver or not. The documentation are no help as well.

Appreciate anyone's support on this.

reddit.com
u/Slow-Leave3149 — 13 days ago