Exploiting a Natural Network Effect for Scalable, Fine-grained Clock Synchronization 论文阅读

本文探讨了高精度时钟同步在一致性、事件排序、因果性和任务资源调度等方面的重要性。针对传统方法存在的精度与易部署性的权衡问题,介绍了论文'Huygens'提出的一种方法,它能在数据中心环境中实现纳秒级精度且易于部署。通过利用网络效应,该算法能够在考虑路径噪声和传播延迟的情况下,有效地校正时钟同步误差。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

Exploiting a Natural Network Effect for Scalable, Fine-grained Clock Synchronization

Introduction

Usage of synchronizing clocks

Usage of synchronizing clocks:

  • consistency
  • event ordering
  • causality
  • scheduling of tasks and resources

Related paper:

  • LAMPORT, L. Time, clocks, and the ordering of events in a distributed
    system. Communications of the ACM 21, 7 (1978), 558–
    565.
  • LISKOV, B. Practical uses of synchronized clocks in distributed
    systems. Distributed Computing 6, 4 (1993), 211–219.

Useful in many fields: finance and e-commerce, distributed database, SDN, congestion control.

Accuracy will influence the performance of some systems.

Challenges for high precision synchronized clocks

Challenges for high precision synchronized clocks:

  • Uncertainty of clock is comparable to propagation delay of the network. The common used clocks (implemented by a quartz crystal oscillator) may drift from true time at the rate of 6-10 microseconds/sec. But the one-way delay (OWD), defined as the raw propagation (zero-queuing) time between sender and receiver, in high-performance data centers is under 10μs.
  • Path noise. Path noise (due to small fluctuations in switching times, path asymmetries
    (e.g., due to cables of different length) and clock timestamp noise) is in the order of 10s-100s of ns, and is hard to measure → hard to have ns level clocks.

Current limitation

Current limitation: trade-off between easy deployability and precision.

Huygens (this paper)

Huygens (this paper) achieves 10s of nanoseconds precision, and is easy to be deployed.

Literature survey

Two methods to determining OWD:

  1. Determining the time spent by the probe at each element en route from A to B.
  2. By estimating the RTT (where B sends a probe back to A). In this case, assuming the OWD is equal in both directions, halving the estimated RTT gives the OWD.

NTP

Picking the three with the smallest RTTs along multiple probe-echo pairs.

10s of ms in WAN, 10s of μs in DCN.

PTP

Switches record the ingress and egress time of a packet to accurately obtain packet dwell times at switches.

Advanced hardware + dedicated network → < 1ns

conventional fully “PTP-enabled network” → 10s-100s of ns

not fully “PTP-enabled network” → 1000x worse, 10s-100s of μs

high load network → performance degradation

DTP

Use PHY synchronization mechanism defined in IEEE 802.3 Ethernet. It is fine-grained, and is not load-dependent.

In 10Gbps, it can achieve 25.6ns (6.4ns * 4) for a single hop.

Need special extra hardware.

PPS

Use GPS, all communication cost is precisely measured.

Very expensive to deploy at scale.

Our approach

Data center features

  1. Symmetric, multi-level, fat-tree topology.
  2. Propagation times are small, well-bounded by 25-30μs. Abundant bisection bandwidth + multiple path → a reasonably good chance probes can traverse the network without encountering queueing delays (really?).
  3. Many servers → possible to synchronize them in concert.

Algorithms and techniques

  1. Coded probes. A pair of probe packets going from server iii to jjj with a small inter-probe transmission time spacing of sss. Only take coded probes which keep the sss into account (“pure” coded probes).
  2. Support Vector Machines (SVM).
  3. Network effect. from Wikipedia, A network effect (also called network externality or demand-side economies of scale) is the effect described in economics and business that an additional user of a good or service has on the value of that product to others.

Clocks in the real-world

Following is the illustration of a coded probe (a pair of probe packets).

signaling-between-clocks

Let Δ\DeltaΔ be the relative difference between clocks of server A and server B (If server A’s clock is ttt, then the B’s is t+Δt + \Deltat+Δ).

We have

{ RXB=TXA+OWDA→B+ΔrxA=txB+OWDB→A−Δ \begin{cases} RX_B = TX_A + \text{OWD}_{A \rightarrow B} + \Delta \\ rx_A = tx_B + \text{OWD}_{B \rightarrow A} - \Delta \end{cases} { RXB=TXA+OWDAB+ΔrxA=txB+OWDBAΔ

Bert是一种在自然语言处理中被广泛使用的模型,其在各种任务中表现出了出色的性能。然而,对于方面级情感分析,Bert并不直接适用。因此,需要对Bert进行利用,并通过修改和扩展来适应这一任务。 端到端(end-to-end)的方面级情感分析是指通过一个模型直接从文本中提取方面和情感信息。为了利用Bert进行端到端的方面级情感分析,首先需要对数据进行预处理,并将其转换成Bert模型所接受的输入格式。这包括将文本分段、添加特殊标记以及填充序列等操作。 在Bert模型的基础上,需要添加相关的层来实现方面级情感分析。一种常见的方法是利用注意力机制来捕获方面词与其他词之间的关系。通过计算不同词之间的注意力权重,可以将方面词的相关信息传递给其他词,从而更好地理解整个文本。另外,也可以添加一些分类层来预测每个方面的情感。 为了更好地利用Bert,还可以使用领域特定的语料库来进行预训练。通过在大规模的语料库上进行预训练,模型可以更好地理解特定领域的文本,并提升方面级情感分析的性能。 此外,还可以通过调整Bert模型的超参数来进一步改善性能。例如,可以调整学习率、批大小和训练周期等超参数,以获得更好的结果。 总之,“exploiting bert for end-to-end aspect-based sentiment analysis”意味着通过对Bert进行修改和扩展,将其应用于端到端的方面级情感分析任务中,以提升模型的性能和效果。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值