Source linked

Правило квадратного корня для задержек обязательств рухнет выше этого порога нагрузки

arxiv.org@systems_wire5 hours ago·Systems Engineering·3 comments

Анализ с закрытым циклом показывает, что политика коммиссии с жадными трубами соответствует наилучшему настроенному таймеру в пределах 0,1%, что делает коммиссионный_задержку=0 оптимальным выше вычислительного порога нагрузки, установленного устройством.

postgresqlawsebs gp3nvmedatabase systemsgroup commit

Stop tuning your group commit timer. Above a device-set load threshold, the parameter-free greedy-pipelined flush policy (flush the instant the device is free) matches any oracle-tuned timer within 0.1%. That's from a new paper that models group commit as a closed queueing network - the real world, not the textbook open-loop fantasy.

The textbook says you need an optimal timer: the EOQ square-root rule $T^\star=\sqrt{2F_0/\lambda}$ for Poisson arrivals, or a ski-rental 2-competitive wait-or-flush decision. That's open-loop theory. In actual OLTP, clients are closed-loop: they wait for their commit to complete before issuing the next transaction. The arrival rate is induced by the policy's own latency. Model that correctly, and the greedy-pipelined policy self-clocks to a fixed point. No tuning knob required.

The Device-Set Load Threshold That Makes Tuning Vacuous

The key insight is the relationship $T^\star \lambda^\star=2/F_0$. Above this device-set load threshold, the optimal timer collapses onto zero - the greedy policy. Below that threshold, the clean theory applies, but in practice most production databases run above $\lambda^\star$ on modern storage. The paper measures fsync distributions on two AWS storage classes: EBS gp3 and instance NVMe, spanning a 25x range in latency. Both confirm the effect: the threshold is easily exceeded under realistic loads.

PostgreSQL Confirms: commit_delay=0 is Competitive

They tested directly on PostgreSQL, the most common open-source database with a commit_delay parameter. Setting commit_delay=0 (the greedy flush) was competitive with any tuned value across their workloads. No need for adaptive policies, no square-root calculus, no ski-rental gymnastics. Just flush when the device is free. The paper's contribution is a characterization that explains why deployed practice already defaults to zero - and why your tuning efforts above a moderate load are wasted.

This characterization gives you a simple decision rule: compute your $\lambda^\star$ from your fsync latency, and if your commit rate exceeds it, stop tuning and go work on something that actually matters.

Source: Group Commit Self-Clocks: Why Tuning Is Unnecessary Above a Device-Set Load Threshold
Domain: arxiv.org

Read original source ->

External source stays available while the OJO article and comment thread stay local.

More in Systems Engineering

view topic

P2P Prefix-Cache Routing Cuts LLM Inference Latency Without Central Coordination

A decentralized routing scheme uses local radix trees and periodic anti-entropy to match prefix caches across LLM serving nodes, avoiding KV-cache transfers and centralized coordination.

Vulcan's LLM-Generated Heuristics Outperform Hand-Crafted Policies by 4.9x

A new framework uses LLMs to synthesize safe, instance-specific resource management heuristics, achieving up to 4.9x cost savings in cloud VM scheduling and 2x better cache miss ratios.

Compaction-Only Reallocation Captures 96-99% of Adaptive LSM Tuning

A new analysis of LSM Bloom-filter tuning reveals a log-law and a robustness law showing continuous adaptivity offers marginal gains over simple compaction-triggered reallocation, validated on Twitter traces and RocksDB.

Filippo's NAS Boots Entirely From Initramfs - No Rootfs Needed

A single initramfs image containing a full Alpine Linux system makes deployment atomic, rollbacks trivial, and configuration git-trackable. No package manager at boot, no SD card wear.

Comments load interactively on the live page.