Exploiting a Natural Network Effect for Scalable, Fine-grained Clock Synchronization 论文阅读

最新推荐文章于 2024-10-27 21:21:19 发布

braveTester

最新推荐文章于 2024-10-27 21:21:19 发布

阅读量565

点赞数

CC 4.0 BY-SA版权

分类专栏：论文阅读博士 NSDI

本文链接：https://blog.csdn.net/braveTester/article/details/97621129

本文探讨了高精度时钟同步在一致性、事件排序、因果性和任务资源调度等方面的重要性。针对传统方法存在的精度与易部署性的权衡问题，介绍了论文'Huygens'提出的一种方法，它能在数据中心环境中实现纳秒级精度且易于部署。通过利用网络效应，该算法能够在考虑路径噪声和传播延迟的情况下，有效地校正时钟同步误差。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Exploiting a Natural Network Effect for Scalable, Fine-grained Clock Synchronization

Introduction

Usage of synchronizing clocks

Usage of synchronizing clocks:

consistency
event ordering
causality
scheduling of tasks and resources

Challenges for high precision synchronized clocks

Challenges for high precision synchronized clocks:

Uncertainty of clock is comparable to propagation delay of the network. The common used clocks (implemented by a quartz crystal oscillator) may drift from true time at the rate of 6-10 microseconds/sec. But the one-way delay (OWD), defined as the raw propagation (zero-queuing) time between sender and receiver, in high-performance data centers is under 10μs.
Path noise. Path noise (due to small fluctuations in switching times, path asymmetries
(e.g., due to cables of different length) and clock timestamp noise) is in the order of 10s-100s of ns, and is hard to measure → hard to have ns level clocks.

Current limitation

Current limitation: trade-off between easy deployability and precision.

Huygens (this paper)

Huygens (this paper) achieves 10s of nanoseconds precision, and is easy to be deployed.

Literature survey

Two methods to determining OWD:

Determining the time spent by the probe at each element en route from A to B.
By estimating the RTT (where B sends a probe back to A). In this case, assuming the OWD is equal in both directions, halving the estimated RTT gives the OWD.

NTP

Picking the three with the smallest RTTs along multiple probe-echo pairs.

10s of ms in WAN, 10s of μs in DCN.

PTP

Switches record the ingress and egress time of a packet to accurately obtain packet dwell times at switches.

Advanced hardware + dedicated network → < 1ns

conventional fully “PTP-enabled network” → 10s-100s of ns

not fully “PTP-enabled network” → 1000x worse, 10s-100s of μs

high load network → performance degradation

DTP

Use PHY synchronization mechanism defined in IEEE 802.3 Ethernet. It is fine-grained, and is not load-dependent.

In 10Gbps, it can achieve 25.6ns (6.4ns * 4) for a single hop.

Need special extra hardware.

PPS

Use GPS, all communication cost is precisely measured.

Very expensive to deploy at scale.

Our approach

Data center features

Symmetric, multi-level, fat-tree topology.
Propagation times are small, well-bounded by 25-30μs. Abundant bisection bandwidth + multiple path → a reasonably good chance probes can traverse the network without encountering queueing delays (really?).
Many servers → possible to synchronize them in concert.

Algorithms and techniques

Coded probes. A pair of probe packets going from server $i$ to $j$ with a small inter-probe transmission time spacing of $s$ . Only take coded probes which keep the $s$ into account (“pure” coded probes).
Support Vector Machines (SVM).
Network effect. from Wikipedia, A network effect (also called network externality or demand-side economies of scale) is the effect described in economics and business that an additional user of a good or service has on the value of that product to others.