没有物理地址用于x，在jgroups群集中的节点重新启动时丢弃消息

问题描述：

当群集中的一个节点重新启动时，我的节点日志充斥着警告消息“将单播消息丢弃到错误的目标”。我们正在使用Jgroups，TCP，jgroups-3.4.1.Final版本。我的服务器不上来，用contagiously泛起没有物理地址用于x，在jgroups群集中的节点重新启动时丢弃消息

下面是警告消息这些警告消息 [0; 33mWARN [TransferQueueBundler，H广播，H-13] [TCP] JGRP000032：H-13：无物理地址8281f201-7fb1-f6ac-faf3-d6837bc39087，滴消息

[0; 33mWARN [INT-1，H-广播，H-13] [TCP] JGRP000031：H-13：丢弃单播消息错目的地d205fcba-151C-ad58-8323-fe4f49117f88

请让我知道如何解决这个问题

个感谢， Nivedita

<TCP loopback="true" 
    recv_buf_size="${tcp.recv_buf_size:20M}" 
    send_buf_size="${tcp.send_buf_size:640K}" 
    discard_incompatible_packets="true" 
    max_bundle_size="64K" 
    max_bundle_timeout="5" 
    enable_bundling="true" 
    use_send_queues="true" 
    sock_conn_timeout="300" 
    timer_type="new" 
    timer.min_threads="4" 
    timer.max_threads="10" 
    timer.keep_alive_time="3000" 
    timer.queue_max_size="500" 
    thread_pool.enabled="true" 
    thread_pool.min_threads="4" 
    thread_pool.max_threads="10" 
    thread_pool.keep_alive_time="5000" 
    thread_pool.queue_enabled="true" 
    thread_pool.queue_max_size="100000" 
    thread_pool.rejection_policy="discard" 
    oob_thread_pool.enabled="true" 
    oob_thread_pool.min_threads="1" 
    oob_thread_pool.max_threads="8" 
    oob_thread_pool.keep_alive_time="5000" 
    oob_thread_pool.queue_enabled="false" 
    oob_thread_pool.queue_max_size="100" 
    oob_thread_pool.rejection_policy="discard" 
    bind_addr="${hybris.jgroups.bind_addr}" 
    bind_port="${hybris.jgroups.bind_port}" /> 
<TCPPING timeout="3000" 
    initial_hosts="xxx.xx.xx.4[7800],xxx.xx.xx.5[7800],xxx.xx.xx.6[7800], xxx.xx.xx.7[7800], xxx.xx.xx.8[7800], xxx.xx.xx.9[7800], xxx.xx.xx.10[7800], xxx.xx.xx.11[7800], xxx.xx.xx.12[7800], xxx.xx.xx.13[7800], xxx.xx.xx.68[7800], xxx.xx.xx.69[7800], xxx.xx.xx.70[7800], xxx.xx.xx.4[7800], xxx.xx.xx.5[7800], xxx.xx.xx.6[7800]" 
    num_initial_members="16"/> 

<MERGE2 min_interval="10000" max_interval="30000" /> 
    <FD_SOCK /> 
    <FD timeout="3000" max_tries="3" /> 
    <VERIFY_SUSPECT timeout="1500" /> 
    <BARRIER /> 
    <pbcast.NAKACK use_mcast_xmit="false" exponential_backoff="500" discard_delivered_msgs="true" /> 
    <UNICAST2 /> 
    <pbcast.STABLE stability_delay="1000" desired_avg_gossip="50000" max_bytes="4M" /> 
    <pbcast.GMS print_local_addr="true" join_timeout="3000" view_bundling="true" /> 
    <UFC max_credits="20M" min_threshold="0.4" /> 
    <MFC max_credits="20M" min_threshold="0.4" /> 
    <FRAG2 frag_size="60K" /> 
    <pbcast.STATE_TRANSFER />

答

非常感谢您的建议。当有问题的节点之一被关闭时（它无法telnet与其他能够telnet的节点相比），群集节点自行修复自己

我无法清楚地得到这个答案。亚姆得到同样的问题！我应该如何在jgroups集群中进行配置？ – NandhaKumar

在群集中的节点中，其中一个节点出现网络问题，我们无法在其上的7800端口上进行远程登录。当从集群中删除故障节点时，节点自愈并加入集群 –

无论如何，感谢您的解决方案！在我的情况下，我可以使用telnet连接节点，但无法加入群集。我不知道问题在哪里。 – NandhaKumar

答

我假设你使用TCP:TCPPING？您是否列出全部会员TCPPING.initial_hosts？这是上述警告最可能的原因。

有一个缓存将UUID（JGroups集群成员的内部表示）映射到每个成员的物理地址。

您可以通过JMX或probe.sh uuids查看内容。在h13中应该有一个映射8281f201-7fb1-f6ac-faf3-d6837bc39087，但它缺失。同样，最可能的原因是h13未在TCPPING中列出。

你可以尝试的可选发现协议（例如MPING如果IP多播被支持，FILE_PING这需要一个共享文件系统，TCPGOSSIP与外部查找服务等）。详细信息请查看手册。

是的，我们已经列出了所有主机在initial_hosts属性中JGroups的-tcp.xml。我们正在使用不支持多播的天蓝云，因此我们无法使用多播。。请在下面找到jgroups-tcp.xml配置 –

粘贴问题 –

中的jgroups-tcp.xml配置尝试使用探测命令，但没有返回任何结果。如果探测器使用多播，我认为它不会工作。如果我可以使用任何其他机制，您可以请帮助 –

没有物理地址用于x，在jgroups群集中的节点重新启动时丢弃消息

相关推荐